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1. REAL PARTY IN INTEREST 

The real party in interest is Genentech, Inc., South San Francisco, California, by an 
assignment of the parent application, U.S. Serial No. 09/941,992 recorded November 16, 2001, 
at Reel 012176 and Frame 0450. 

2. RELATED APPEALS AND INTERFERENCES 

The claims pending in the current application are directed to a polypeptide referred to 
herein as "PR0341 M . There exists one related patent application, U.S. Serial No. 09/990,71 1, 
filed November 14, 2001 (containing claims directed to antibodies to the full-length sequence 
PR0341 polypeptide). This related application is also under final rejection from the same 
Examiner and based upon the same outstanding rejection, therefore appeal of these final 
rejections are being pursued independently and concurrently herewith. 

3. STATUS OF CLAIMS 

Claims 124-126 and 129-131 are pending in this application. 

Claims 1-123 and 128 were canceled and Claim 124 was amended to remove references 
to Figures in the claim. In response to a Final Office Action filed on July 7, 2004, Claim 127 
was canceled and Claim 124 was further amended with the functional recitation "wherein, the 
nucleic acid encoding said polypeptide is amplified in lung cell carcinomas". 

Thus, Claims 124-126 and 129-131 remain pending and under final rejection, wherein the 
final rejection of these claims is being appealed herein. 

A copy of the rejected claims involved in the present Appeal is provided in the Claims 
Appendix. 

4. STATUS OF AMENDMENTS 

A summary of the prosecution history for this case is as follows: 

Previously, in response to a Final Office Action mailed on September 16, 2004, a Notice 

of Appeal was filed on January 12, 2005 and an Appeal Brief was filed on July 26, 2005. An 

Examiner's Answer was mailed on October 12, 2005 which cited new references; hence, a Reply 

Brief, a Petition for Designation as New grounds of Rejection and a request to withdraw finality 

of the rejection was filed under 37 C.F.R. §1.181 on December 12, 2005. The Decision on the 
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Petition granted the Appellants' request to have the finality withdrawn, and was mailed on 
January 30, 2006. Therefore, a Supplemental Response submitting supportive references to 
Appellants' arguments was filed on March 30, 2006. A Final Office Action was mailed on April 
21, 2006 to which a response was mailed September 25, 2006 with additional references and 
affidavits. These references were indicated as fully considered in an Advisory Action (with new 
references cited by the Examiner) mailed September 25, 2006. A Notice of Appeal was filed on 
October 20, 2006 and an Interview summary was also mailed on December 1, 2006, which 
clarified that arguments to the newly cited references within the Advisory Action would be 
acceptable in the Appeal Brief filing. A Statement of Substance of this Interview was filed on 
December 28, 2006. 

No claim amendments have been submitted after the last final rejection of April 21, 2006. 

5. SUMMARY OF CLAIMED SUBJECT MATTER 

Independent Claim 124 is directed to an isolated polypeptide comprising the amino acid 
sequence of a polypeptide referred to in the present application as "PR034L" PR0341 is a cell 
surface polypeptide, which is described as a novel polypeptide having a signal peptide sequence 
extending from about amino acid position 1 to about amino acid position 17 in the sequence of 
SEQ ID NO: 20 and seven transmembrane domains (see page 49, lines 3-8, and for example, 
Example 8 and Figure 12). The encoding PR0341 is shown for the first time in the present 
patent application to be (i) significantly overexpressed (or "upregulated") in human lung cell 
carcinomas as compared to normal, non-cancerous human tissue controls (Example 170). This 
feature is specifically recited in claim 124, and carried by all claims dependent from claim 124. 

In particular, the amino acid sequence of the native "PR0341" polypeptide and the 
nucleic acid sequence encoding this polypeptide (referred to in the present application as 
"DNA26288-1239") are shown in the present specification as SEQ ID NOs: 20 and 19, 
respectively, and in Figures 12 and 11, respectively. Page 288, lines 14-17 of the specification 
provides the description for Figures 12 and 1 1. The cDNA for PR0341 was deposited under 
ATCC accession number 209792. Pending Claims 125-126 and 129-131 depend from Claim 
124. 

A PRO polypeptide sequence lacking the signal peptide (claim 124, part (b)) is described 

in the specification at, for example, page 305, lines 12-22, and page 49, lines 2-3. The 
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preparation of chimeric PRO polypeptides (claims 130 and 131), including those wherein the 
heterologous polypeptide is an epitope tag or an Fc region of an immunoglobulin, is set forth in 
the specification at page 374, lines 24 to page 375, line 9. Examples 140-143 and page 376, line 
12 onwards describe the expression of PRO polypeptides in various host cells, including E. coli, 
mammalian cells, yeast and Baculovirus-infected insect cells. 

Finally, Example 170, in the specification at page 539, line 19, to page 555, line 5, sets 
forth a 'Gene Amplification assay' which shows that the PR0341 gene is amplified in the 
genome of certain human lung cancers (see Table 9A, page 550, third column). The profiles of 
various primary lung tumors used for screening the PRO polypeptide compounds of the 
invention in the gene amplification assay are summarized on Table 8, page 546 of the 
specification. 

6. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

1 . Whether Claims 1 24- 1 26 and 1 29- 1 3 1 satisfy the utility requirement under 
35 U.S.C. §101/ §112, first paragraph. 

7. ARGUMENTS 
Summary of the Arguments 

Issue 1 : Utility 

Claims 124-126 and 129-131 stand rejected under 35 U.S.C. §101/§112, first paragraph 
as allegedly lacking either a specific and substantial asserted utility or a well established utility. 

Appellants have submitted that patentable utility of the PR0341 polypeptides is based 
upon the gene amplification data for the gene encoding the PR0341 polypeptide. Since the 
specification clearly discloses that the gene encoding PR0341 showed significant amplification, 
ranging from 2.173 to 2.514 fold in three different lung primary tumors . Appellants have 
submitted, with their Response filed October 24, 2003, the Declaration of Dr. Audrey Goddard, 
which explains that a gene identified as being amplified at least 2-fold by the disclosed gene 
amplification assay in a tumor sample relative to a normal sample is useful as a marker for the 
diagnosis of cancer , for monitoring cancer development and/or for measuring the efficacy of 
cancer therapy. Therefore, one of ordinary skill would find it credible that the claimed PR0341 
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polypeptides have utility as markers for the diagnosis of lung tumors . Appellants further 
submitted a vast number of references from the art as well as Declarations, written by experts in 
the field of oncology, during prosecution (for example, see Items 2-4: Declarations by Dr. 
Ashkenazi and Dr. Polakis (I and II); Items 8-1 1 and Items 20-151 of Evidence Appendix), that 
collectively show that, in general, if a gene is amplified in cancer, it is more likely than not that 
the encoded protein will be expressed at an elevated level. For instance, the articles by Orntoft et 
aL, Hyman et ai, and Pollack et al (made of record in Appellants' Response filed July 7, 2004) 
collectively teach that in general gene amplification increases mRNA expression . Second, the 
Declaration of Dr. Paul Polakis (made of record in Appellants' Response filed July 7, 2004), 
principal investigator of the Tumor Antigen Project of Genentech, Inc., the assignee of the 
present application, shows that, in general there is a correlation between mRNA levels and 
polypeptide levels . 

However, the Examiner maintains on page 4 of the Advisory Action mailed October 25, 
2006 that the gene amplification data "have no bearing on the utility of the claimed PR0341 
polypeptides. In order for PR0341 polypeptides to be overexpressed in tumors, amplified 
genomic DNA would have to correlate with increased mRNA levels, Avhich in turn would have 
to correlate with increased polypeptide levels. No data regarding PR0341 mRNA or PR0341 
polypeptide levels in lung tumors have been brought forth on the record. The art discloses that a 
correlation between genomic DNA levels and mRNA levels cannot be presumed, nor can any 
correlation between mRNA levels and polypeptide levels" (see Page 4 of Advisory Action). The 
Examiner, on the other hand, has cited several references previously (like Pennica et al, 
Konopka et al, Hu et al: see Items in Evidence Appendix list) and additional references (see 
Items of Evidence Appendix list) to show that there is no utility for PR0341 polypeptides based 
on gene amplification data. 

In contrast, Appellants have submitted ample evidence (more than 100 references) to 

show that, in general, if a gene is amplified in cancer, it is more likely than not that the encoded 

protein will be expressed at an elevated level. For instance, the articles by Orntoft et al, Hyman 

et al, and Pollack et al (made of record in Appellants' Response filed July 7, 2004) collectively 

teach that in general gene amplification increases mRNA expression . Second, the Declaration 

of Dr. Paul Polakis (made of record in Appellants' Response filed July 7, 2004), principal 

investigator of the Tumor Antigen Project of Genentech, Inc., the assignee of the present 
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application, shows that, in general; there is a correlation between mRNA levels and polypeptide 
levels . Appellants further note that the sale of gene expression chips to measure mRNA levels is 
a highly successful business, with a company such as Affymetrix recording 168.3 million dollars 
in sales of their GeneChip arrays in 2004. Clearly, the research community believes that the 
information obtained from these chips is useful (i.e., that it is more likely than not informative of 
the protein level). 

Appellants emphasize that the utility standard is not absolute certainty . Appellants only 
need to show that it is more likely than not that a DNA/mRNA/protein correlation exists in 
order to meet the utility standard. Therefore, even if a reasonable DNA/mRNA/protein 
correlation is not found in some instances, the utility standard can still be met in the instant 
application because Appellants have provided an overwhelming amount of evidence in the art, 
supporting a general DNA/mRNA/protein correlation. 

Taken together, although there are some examples in the scientific art that do not fit 
within the central dogma of molecular biology that there is a correlation between DNA, mRNA, 
and polypeptide levels, these instances are exceptions rather than the rule . In the majority of 
amplified genes , as exemplified by vast number of references submitted by Appellants, and the 
Polakis Declaration, the teachings in the art overwhelmingly show that gene amplification 
influences gene expression at the mRNA and protein levels . Therefore, one of skill in the art 
would reasonably expect in this instance, based on the amplification data for the PR0341 gene, 
that the PR0341 polypeptide is concomitantly overexpressed. Thus, the claimed PR0341 
polypeptides have utility in the diagnosis of cancer. 

Appellants further submit that even if there is no correlation between gene amplification 
and increased mRNA/protein expression, (which Appellants expressly do not concede), a 
polypeptide encoded by a gene that is amplified in cancer would still have a specific, substantial, . 
and credible utility. Appellants submit that, as evidenced by the Ashkenazi Declaration (made of 
record in Appellants' Response filed October 24, 2003) and the teachings of Hanna and Mornin 
(made of record in Appellants 1 Response filed July 7, 2004), simultaneous testing of gene 
amplification and gene product over-expression enables more accurate tumor classification , even 
if the gene-product, the protein, is not over-expressed. This leads to better determination of a 
suitable therapy for the tumor, as demonstrated by the real-world example of the breast cancer 
marker HER-2/neu . 
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Accordingly, Appellants submit that when the proper legal standard is applied, one 
should reach the conclusion that the present application discloses at least one patentable utility 
for the claimed PR0341 polypeptides. 

Further, since PR0341 polypeptides have utility in the diagnosis of cancer, one of skill in 
the art would know exactly how to use the claimed polypeptides for diagnosis of cancer, without 
any undue experimentation. Appellants 5 add that the specification provides ample, detailed 
guidance to allow the skilled artisan to identify and make polypeptides of PR0341 (SEQ ED 
NO:20). Accordingly, one of ordinary skill in the art would understand how to make and use the 
recited polypeptide variants without any undue experimentation. 

Response to Rejections 

Issue 1. Claims 124-126 and 129-131 are supported by a credible, specific and substantial 
asserted utility, and thus meet the utility requirement of 35 U.S.C. §101/§1 12. first paragraph 
The sole basis for the Examiner's rejection of claim 124-126 and 129-131 under this 
section is that the data presented in Example 170 of the present specification is allegedly 
insufficient under the present legal standards to establish a patentable utility under 35 U.S.C. 
§101 for the presently claimed subject matter. Appellants strongly disagree and, therefore, 
respectfully traverse the rejection. 

A. The Legal Standard For Utility Under 35 U.S.C. $101 

According to 35 U.S.C. §101: 

Whoever invents or discovers any new and useful process, machine, manufacture, 
or composition of matter, or any new and useful improvement thereof, may obtain 
a patent therefor, subject to the conditions and requirements of this title. 
(Emphasis added). 

In interpreting the utility requirement, in Brenner v. Manson, 1 the Supreme Court held 
that the quid pro quo contemplated by the U.S. Constitution between the public interest and the 
interest of the inventors required that a patent Applicant disclose a "substantial utility" for his or 



1 Brenner v. Manson, 383 U.S. 519, 148 U.S.P.Q. (BNA) 689 (1966). 
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her invention, i.e., a utility "where specific benefit exists in currently available form." The 
Court concluded that "a patent is not a hunting license. It is not a reward for the search, but 
compensation for its successful conclusion. A patent system must be related to the world of 

commerce rather than the realm of philosophy." 3 

4 

Later, in Nelson v. Bowler, the C.C.P.A. acknowledged that tests evidencing 
pharmacological activity of a compound may establish practical utility, even though they may 
not establish a specific therapeutic use. The Court held that "since it is crucial to provide 
researchers with an incentive to disclose pharmaceutical activities in as many compounds as 
possible, we conclude adequate proof of any such activity constitutes a showing of practical 

utility." 5 

In Cross v. lizuka, 6 the C.A.F.C. reaffirmed Nelson, and added that in vitro results might 
be sufficient to support practical utility, explaining that "w vitro testing, in general, is relatively 
less complex, less time consuming, and less expensive than in vivo testing. Moreover, in vitro 
results with the particular pharmacological activity are generally predictive of in vivo test results, 

7 

i.e., there is a reasonable correlation there between." The Court perceived, "No insurmountable 
difficulty" in finding that, under appropriate circumstances, "in vitro testing, may establish a 

8 

practical utility." 



2 Id. at 534, 148 U.S.P.Q. (BNA) at 695. 

3 Id. at 536, 148 U.S.P.Q. (BNA) at 696. 

4 Nelson v. Bowler, 626 F.2d 853, 206 U.S.P.Q. (BNA) 881 (C.C.P.A. 1980). 

5 Id. at 856, 206 U.S.P.Q. (BNA) at 883. 

6 Cross v. lizuka, 753 F.2d 1047, 224 U.S.P.Q. (BNA) 739 (Fed. Cir. 1985). 

7 Id. at 1050, 224 U.S.P.Q. (BNA) at 747. 
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The case law has also clearly established that Appellants' statements of utility are usually 

9 

sufficient, unless such statement of utility is unbelievable on its face. The PTO has the initial 
burden to prove that Appellants' claims of usefulness are not believable on their face. In 
general, an Applicant's assertion of utility creates a presumption of utility that will be sufficient 
to satisfy the utility requirement of 35 U.S.C. §101, "unless there is a reason for one skilled in 

11 12 

the art to question the objective truth of the statement of utility or its scope. * 

Compliance with 35 U.S.C. §101 is a question of fact. 13 The evidentiary standard to be 
used throughout ex parte examination in setting forth a rejection is a preponderance of the 

14 

totality of the evidence under consideration. Thus, to overcome the presumption of truth that 
an assertion of utility by the Applicant enjoys, the Examiner must establish that it is more likely 
than not that one of ordinary skill in the art would doubt the truth of the statement of utility. 
Only after the Examiner made a proper prima facie showing of lack of utility, does the burden of 
rebuttal shift to the Applicant. The issue will then be decided on the totality of evidence. 

The well established case law is clearly reflected in the Utility Examination Guidelines 

("Utility Guidelines"), 15 which acknowledge that an invention complies with the utility 
requirement of 35 U.S.C. §101, if it has at least one asserted "specific, substantial, and credible 
utility" or a "well-established utility." Under the Utility Guidelines, a utility is "specific" when 
it is particular to the subject matter claimed. For example, it is generally not enough to state that 



9 In re Gazave, 379 F.2d 973, 154 U.S.P.Q. (BNA) 92 (C.C.P.A. 1967). 

10 

Ibid. 

U In re Langer, 503 F.2d 1380,1391, 183 U.S.P.Q. (BNA) 288, 297 (C.C.P.A. 1974). 

12 See also In re Mies, 628 F.2d 1322, 206 USPQ 885 (C.C.P.A. 1980); In re Irons, 340 R2d 974, 144 
USPQ 35 1 (1965); In re Sichert, 566 F.2d 1 154, 1 159, 196 USPQ 209, 212-13 (C.C.P.A. 1977). 

13 Raytheon v. Roper, 724 F.2d 951, 956, 220 U.S.P.Q. (BNA) 592, 596 (Fed. Cir. 1983) cert denied, 469 
US 835 (1984). 

14 In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d (BNA) 1443, 1444 (Fed. Cir. 1992). 

15 66 Fed. Reg. 1092(2001). 
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a nucleic acid is useful as a diagnostic without also identifying the conditions that are to be 
diagnosed. 

In explaining the "substantial utility" standard, M.P.E.P. §2107.01 cautions,- however, 
that Office personnel must be careful not to interpret the phrase "immediate benefit to the 
public" or similar formulations used in certain court decisions to mean that products or services 
based on the claimed invention must be "currently available" to the public in order to satisfy the 
utility requirement. "Rather, any reasonable use that an applicant has identified for the invention 
that can be viewed as providing a public benefit should be accepted as sufficient, at least with 
regard to defining a 'substantial 5 'utility.'" 16 Indeed, the Guidelines for Examination of 
Applications for Compliance With the Utility Requirement/ 7 gives the following instruction to 
patent examiners: "If the Applicant has asserted that the claimed invention is useful for any 
particular practical purpose . . . and the assertion would be considered credible by a person of 
ordinary skill in the art, do not impose a rejection based on lack of utility." 

B. The Data and Documentary Evidence Supporting a Patentable Utility 

The data presented by the Appellants in the present application and which underlies the 
current dispute is presented in Example 170 starting on page 539 of the specification. Example 
170 describes the results obtained using a very well-known and routinely employed polymerase 
chain reaction (PCR)-based assay, the TaqMan™ PCR assay, also referred to herein as the gene 
amplification assay. This assay allows one to quantitatively measure the level of gene 
amplification in a given sample, say, a tumor extract, or a cell line. It was well known in the art 
at the time the invention was made that gene amplification is an essential mechanism for 
oncogene activation. Appellants isolated genomic DNA from a variety of primary cancers and 
cancer cell lines that are listed in Table 9A (pages 539 onwards of the specification), including 
primary lung cancers of the type and stage indicated in Table 8 (page 546). The tumor samples 
were tested in triplicates with Taqman™ primers and with internal controls, beta-actin and 
GADPH in order to quantitatively compare DNA levels between samples (page 548, lines 33- 
34). As a negative control, DNA was isolated from the cells of ten normal healthy individuals, 

16 M.P.E.P. §2107.01. 

17 M.P.E.P. §2107 11(B)(1). 
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which was pooled and used as a control (page 539, lines 27-29) and also, no-template controls 

(page 548, lines 33-34). The results of TaqMan™ PGR are reported in ACt units, as explained in 

the passage on page 539, lines 37-39. One unit corresponds to one PCR cycle or approximately a 

2-fold amplification, relative to control, two units correspond to 4-fold, 3 units to 8-fold 

amplification and so on . Using this PCR-based assay, Appellants showed that the gene 

encoding for PR0341 was significantly amplified , that is, it showed approximately 1.12-1.33 

ACt units which corresponds to 2 1 12 -2 1 33 - fold amplification or 2.173 fold to 2.514-fold 

amplification in three lung tumors. 

In support of their showing that these gene amplification values are significant, 

Appellants submitted, in their Response filed October 24, 2003, a Declaration by Dr. Audrey 

Goddard. Appellants particularly draw the Board's attention to page 3 of the Goddard 

Declaration which clearly states that: 

It is further my considered scientific opinion that an at least 2-fold increase in gene copy 
number in a tumor tissue sample relative to a normal (i.e., non- tumor) sample is 
significant and useful in that the detected increase in gene copy number in the tumor 
sample relative to the normal sample serves as a basis for using relative gene copy 
number as quantitated by the TaqMan PCR technique as a diagnostic marker for the 
presence or absence of tumor in a tissue sample of unknown pathology. Accordingly, a 
gene identified as being amplified at least 2-fold by the quantitative TaqMan PCR assay 
in a tumor sample relative to a normal sample is useful as a marker for the diagnosis of 
cancer, for monitoring cancer development and/or for measuring the efficacy of cancer 
therapy. (Emphasis added). 

Accordingly, the 2.173-fold to 2.514-fold amplification observed for PR0341 in the three lung 
tumors would be considered significant and credible by one skilled in the art, based upon the 
facts disclosed in the Goddard Declaration. 

It is also well known that gene amplification occurs in most solid tumors, which includes 
lung carcinomas, and is generally associated with poor prognosis. Therefore, the PR0341 gene 
becomes an important diagnostic marker to identify such malignant lung carcinomas, even when 
the lung malignancy associated with PRQ341 molecule is a rare occurrence . Accordingly, the 
present specification clearly discloses enough evidence that the gene encoding the PR0341 
polypeptide is significantly amplified in certain types of lung carcinoma tumors and is therefore, 
a valuable diagnostic marker for identifying certain types of lung carcinomas. 
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In addition, Example 170 in the specification further discloses, "Amplification is 
associated with overexpression of the gene product, indicating that the polypeptides are useful 
targets for therapeutic intervention in certain cancers such as colon, lung, breast and other 
cancers and diagnostic determination of the presence of those cancers" (emphasis added). 
Besides, Appellants have submitted ample evidence (discussed below) to show that, in general, if 
a gene is amplified in cancer, it is more likely than not that the encoded protein will be expressed 
at an elevated level as well. 

First, the articles by Orntoft et aL, Hyman et al, and Pollack et al (made of record in 
Appellants' Response filed July 7, 2004) collectively teach that in general for most genes, DNA 
amplification increases mRNA expression . The results presented by Orntoft et al, Hyman et al , 
and Pollack et al are based upon wide ranging analyses of a large number of tumor associated 
genes. Orntoft et al studied transcript levels of 5600 genes in malignant bladder cancers, many 
of which were linked to the gain or loss of chromosomal material, and found that in general (18 
of 23 cases) chromosomal areas with more than 2-fold gain of DNA showed a corresponding 
increase in mRNA transcripts. Hyman et al compared DNA copy numbers and mRNA 
expression of over 12,000 genes in breast cancer tumors and cell lines, and found that there was 
evidence of a prominent global influence of copy number changes on gene expression levels. In 
Pollack et al, the authors profiled DNA copy number alteration across 6,691 mapped human 
genes in 44 predominantly advanced primary breast tumors and 10 breast cancer cell lines, and 
found that on average, a 2-fold change in DNA copy number was associated with a 
corresponding 1.5-fold change in mRNA levels. In summary, the evidence supports the 
Appellants' position that gene amplification is more likely than not predictive of increased 
mRNA and polypeptide levels. 

Second, the Declaration of Dr. Paul Polakis (made of record in Appellants' Response 

filed July 7, 2004), principal investigator of the Tumor Antigen Project of Genentech, Inc., the 

assignee of the present application, explains that in the course of Dr. Polakis 1 research using 

microarray analysis, he and his co-workers identified approximately 200 gene transcripts that are 

present in human tumor cells at significantly higher levels than in corresponding normal human 

cells. Subsequently, antibodies binding to about 30 of these tumor antigens were prepared, and 

mRNA and protein levels were compared. In approximately 80% of the cases , the researchers 

found that increases in the level of a particular mRNA correlated with changes in the level of 
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protein expressed from that mRNA when human tumor cells are compared with their 
corresponding normal cells . Therefore, Dr. Polakis' research, which is referenced in his 
Declaration,, shows that, in general there is a correlation between increased mRNA and 
polypeptide levels . 

Third, the second Declaration by Dr. Polakis (Polakis II) presented evidentiary data in 
Exhibit B. Exhibit B of the Declaration identified 28 gene transcripts out of 31 gene transcripts 
(i.e., greater than 90%) that showed good correlation between tumor mRNA and tumor protein 
levels. As Dr. Polakis' Declaration (Polakis II) says "[a]s such, in the cases where we have been 
able to quantitatively measure both (i) mRNA and (ii) protein levels in both (i) tumor tissue and 
(ii) normal tissue, we have observed that in the vast majority of cases, there is a very strong 
correlation between increases in mRNA expression and increases in the level of protein encoded 
by that mRNA." Accordingly, Dr. Polakis has provided the facts to enable the Examiner to draw 
independent conclusions regarding protein data. 

Both Polakis Declarations (Polakis I and II) were further supported by the teachings in 
more than 100 references from the art established that the accepted understanding in the art is 
that there is a reasonable correlation between changes in gene expression and the level of the 
encoded protein. Amongst these, there were references like Orntoft et al., that supported the 
assertion that changes in DNA levels generally lead to corresponding changes in the level of the 
encoded polypeptide. 

In summary, Appellants submitted more than 100 references, in addition to the 
declarations and references already of record, to support Appellants' asserted utility. These 
references support the assertion that in general, a change in DNA levels for a particular gene 
leads to a corresponding change in the level of expression of the encoded protein. As Appellants 
have previously acknowledged, the correlation between changes in DNA levels and protein 
levels is not exact, and there are exceptions (see, e.g., Pennica, Konopka et al). However, 
Appellants remind the PTO that the asserted utility does not have to be established to a statistical 
certainty, or beyond a reasonable doubt. See M.P.E.P. at §2107.02, part VII (2004). Therefore, 
the fact that there are exceptions to the correlation between changes in mRNA and changes in 
protein does not provide a proper basis for rejecting Appellants' asserted utility. Appellants 
submit that considering the evidence as a whole, with the overwhelming majority of the evidence 
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supporting Appellants' asserted utility, a person of skill in the art would conclude that 
Appellants' asserted utility is " more likely than not true ." Id. 

Taken together, all of the submitted evidence supports the Appellants' position that, 
increased gene amplification levels, more likely than not, predict increased mRNA and 
polypeptide levels, which clearly meets the utility standards described above. Hence, one of skill 
in the art would reasonably expect that, based on the gene amplification data of the PR0341 
gene, the PR0341 polypeptide is concomitantly overexpressed in the lung tumors studied as 
well. 

Appellants further submit that, even if there were no correlation between gene 
amplification and increased mRNA/protein expression, (which Appellants expressly do not 
concede), a polypeptide encoded by an amplified gene in cancer would still have a specific, 
substantial, and credible utility as explained below. As the Declaration of Dr. Avi Ashkenazi 
(submitted with Appellants 1 Response filed October 24, 2003) explains: 

"even when amplification of a cancer marker gene does not result in significant over- 
expression of the corresponding gene product, this very absence of gene product over- 
expression still provides significant information for cancer diagnosis and treatment." 

Additional supporting evidence for such a utility is presented in a real-world example in an 
article by Hanna and Mornin (submitted with Appellants 1 Response filed July 7, 2004), which 
demonstrates a use for the breast cancer marker HER-2/neu. Hanna and Mornin teach that 
diagnosis of breast cancer includes testing both the amplification of the HER-2/neu gene (by 
FISH), as well as, the over-expression of the HER-2/neu gene product (by IHC). Even when the 
protein is not over-expressed, the assay relying on both tests leads to a more accurate 
classification of the cancer and a more effective treatment of it . Thus, as evidenced by the 
Ashkenazi Declaration and the teachings of Hanna and Mornin, one skilled in the art would 
appreciate that simultaneous testing of gene amplification and gene product over-expression 
enables more accurate tumor classification, even if the gene-product, the protein, were not over- 
expressed. This leads to better determination of a suitable therapy for the tumor. Such testing is 
for the purpose of characterizing not the PR0341 polypeptide, but the tumors in which the gene 
encoding PR0341 is amplified. Therefore, the PR0341 polypeptide is also useful in tumor 
categorization, the results of which become an important tool in the hands of a physician 
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enabling the selection of a treatment modality that holds the most promise for the successful 
treatment of a patient. 

Based on the gene amplification data presented for PR0341 in Example 170 of the 
specification, and all the submitted evidence, there is ample support for the Appellants 1 position 
that increased gene amplification levels, more likely than not, predict increased mRNA and 
polypeptide levels. One of skill in the art would therefore reasonably expect, based on: (a) the 
gene amplification data for the PR0341 gene, (b) the supportive evidence in the Declarations 
submitted, and, (c) the supportive articles presented by the Appellants which were available in 
the art at the time of filing of the instant application, that the PR0341 polypeptide is most likely 
to be concomitantly, overexpressed in certain lung tumors, just like the PR0341 gene, and is 
therefore useful as a tumor marker for these types of lung cancers. Even in the event that the 
PR0341 polypeptide were found not to be overexpressed in the lung tumors where the PR0341 
gene were amplified, (a position expressly not conceded to), the PR0341 polypeptide is still 
useful as a marker in tumor categorization and becomes an useful tool, enabling the physician to 
decipher appropriate lines of treatment for the cancer patient, which is a real-life utility. 

C. A prima facie case of lack of utility has not been established 

Contrary to the Appellants assertion of utility, however, the Examiner alleges that the 
gene amplification results presented in Example 170 does not render the presently claimed 
polypeptides patentably useful, and, finds the declaratory evidence presented in this case, for 
what Appellants consider legally inappropriate reasons, "non-persuasive". Appellants 
respectfully submit, however, that upon application of the proper legal standards described 
above, the appropriate conclusion is that the present application does, in fact, disclose at least 
one patentable utility for the claimed PR0341 polypeptide. 

The Examiner alleges that "it is more likely than not that the claimed PR0341 
polypeptide and the antibodies are not useful as cancer diagnostic agents." (Pages 6-7 of the 
instant Advisory Action). The Examiner has relied on the teachings of Chen et al, Haynes et al, 
Hu et al, Lian et al, Fessler et al, and now cites new references Nagaraja et al, Waghray et al, 
Sagynaliev et al, Lilley et al, Madoz-Gurpide et al. 9 Celis et al, Wildsmith et al, King et al to 
support her position. The Examiner also discusses Appellants' cited references Alberts and 
Lewin, Godbout and Li et al 
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Appellants have already discussed references Chen etal, Haynes et al, Hu et al, et al, 
Haynes et al, Hu et al, Lian et al, Fessler et al, in great detail in their previous responses 
(Appeal brief, Reply Brief, Supplemental response, etc.), and these arguments are hereby 
incorporated by reference for brevity. Appellants' maintain their position that these references 
do not support the Examiner's arguments. The references discussed by the Examiner for the first 
time in the instant Advisory Action are discussed below. 

Nagaraia et al* Waghrav et al. and Sagynaliev et al 

The Examiner asserts that "[c]omprehensive studies where significantly large numbers of 
transcripts and proteins were examined report that increases in mRNA and protein samples are 
not correlated." (Page 12 of the Advisory Action). 

The Examiner cites Nagaraja et al as allegedly teaching that "the proteomic profiles 
indicated altered abundance of fewer proteins as compared to transcript profiles." (Page 12 of 
the Advisory Action). 

Appellants respectfully submit that the fact that many more transcripts than proteins were 
found to be differentially expressed does not mean that most mRNA changes did not result in 
correlating protein changes, but merely reflects the fact that expression levels were only 
measured at all for many fewer proteins than transcripts . In particular, the total number of 
proteins whose expression levels could be visualized on silver-stained gels was only about 300 
(page 2332, col. 1), as compared to the approximately 14,500 genes on the microarray chips for 
which mRNA levels were measured (page 2336, col. 1). Since the expression levels of so many 
fewer proteins than transcripts were measured, it is hardly surprising that a smaller absolute 
number of proteins than mRNAs were found to be overexpressed, because the protein products 
of most of the overexpressed mRNAs would not have been among the small number of proteins 
identified on the gels. 

The Examiner next cites Waghray et al, indicating that "for most of the proteins 
identified, there was no appreciable concordant change at the RNA level," and that "[t]he change 
in intensity for most of the affected proteins identified could not be predicted based on the level 
of the corresponding RNA." (Page 12-13 of the Advisory Action). Appellants reiterate that they 
need only show that there is a correlation between mRNA and protein levels, such that mRNA 

16 

Appeal Brief 
Application Serial No. 09/941,992 
Attorney's Docket No. 39780-2730 P1C1 



overexpression generally predict protein overexpression. A showing that mRNA levels can be 
used to "accurately predict" the precise levels of protein expression is not required . 

Appellants also emphasize that Appellants are asserting that a measurable change in 
mRNA level generally leads to a corresponding change in the level of protein expression, not 
that changes in protein level can be used to predict changes in mRNA level. Waghrav et al did 
not take genes which showed significant mRNA changes and check the corresponding protein 
levels . Instead, the authors looked at a small and unrepresentative number of proteins, and 
checked the corresponding mRNA levels. Waghray et al acknowledge that only "[a] relatively 
small set of genes could be analyzed at the protein level, largely due to the limited sensitivity of 
2-D PAGE" (page 1337, col. 1). In particular, while the authors examined the expression levels 
ofl6,570 genes (page 1329, col. 2), they were able to measure the expression levels of only 1031 
proteins (page 1333, col. 2). Waghray et al does not teach that changes in mRNA expression 
were not correlated with changes in expression of the corresponding protein. All Waghray et al 
state is that "for most of the proteins identified, there was no appreciable concordant change at 
the mRNA level" (page 1337, col. 2). This statement is not relevant to Appellants' assertion of 
utility, since Appellants are not asserting that changes in mRNA levels are the only cause of 
changes in protein levels. Waghray et al do not contradict Appellants' assertion that changes in 
mRNA expression, in general, correspond to changes in expression of the corresponding protein. 

Lastly, the Examiner cites Sagynaliev et al, as allegedly teaching that "it is also difficult 
to reproduce transcriptomics results with proteomics tools." In particular, the Examiner notes 
that according to Sagynaliev et al, of 982 genes found to be differentially expressed in human 
CRC, only 177 (18%) have been confirmed using proteomics technologies. (Page 13 of the 
Advisory Action). 

The Sagynaliev et al reference, titled " Web-based data warehouse on gene expression in 

human colorectal cancer" (emphasis added), drew conclusions based upon a literature survey of gene 

expression data published in human CRC , and not from experimental data. While a literature survey 

can be a useful tool to assist researchers, the results may greatly over-represent or under-represent 

certain genes, and thus the conclusions may not be generally applicable. In particular, Appellants 

note that, as evidenced by Nagaraja et al, and Waghray et al, discussed above, the number of 

mRNAs examined in transcriptomics studies is typically much larger than the number of proteins 

examined in corresponding proteomics studies , due to the difficulties in detecting and resolving 
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more than a small minority of all expressed proteins on 2D gels. Thus the fact that only 18% of all 
genes found to be differentially expressed in human CRC have been confirmed using proteomics 
technologies does not mean that the corresponding proteins are not also differentially expressed, but 
is most likely due to the fact that the corresponding proteins were not identified on 2D gels, and thus 
their expression levels remain unknown. 

The authors of Sagynaliev et al acknowledge the many technical problems in finding 
proteomic data for CRC that can be matched to transcriptomic data to see if the two correlate. The 
authors state that "results have been obtained using heterogeneous samples in particular cell lines, 
whole tissue biopsies, and epithelial cells purified from surgical specimens." However, "Results 
obtained in cell lines do not allow accurate comparison between normal and cancer cells, and the 
presence/absence of proteins of interest has to be confirmed in biopsies." (Page 3072, left column.) 
In particular, the authors specifically note that "only a single study [1] provided differential display 
protein expression data obtained in the human patient, using whole tissue biopsy." (Page 3068, left 
column, second paragraph; see also, Table 2.) 

Appellants further note that Table 2 shows that 6 out of 8 published proteomics studies were 
done using 2-D PAGE. However, the authors state that "2-D PAGE or 2-D DIGE have well-known 
technological limitations . . . even under well-defined experimental conditions, 2-D PAGE parallel 
analysis of paired CRC samples is hampered by a significant variability." (Page 3077, left column, 
third paragraph.) Therefore, Appellants respectfully submit that it is well known in the art that there 
are problems associated with selecting only those proteins detectable by 2D gels. 

Lillev et al. King et al and Wildsmith et al 

The Examiner cites Lilley et al to show that "the extrapolation that changes in transcript 
level will also result in corresponding changes in protein amount or activity cannot always be 
made". (Page 14 of the Advisory Action). 

Appellants repeat that it is not a legal requirement for utility, to establish a necessary 
correlation between an increase in the mRNA level and protein expression levels, or to show that 
changes in transcript level should always result in corresponding changes in protein amount or 
activity. As discussed in the previously filed Responses and Preliminary Amendment, the 
evidentiary standard to be used throughout ex parte examination of a patent application is a 
preponderance of the totality of the evidence under consideration and more likely than not 
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standard . Accordingly, the question is not whether a correlation between an increase in mRNA 
and protein expression levels always exists, rather if it is more likely than not that a person of 
ordinary skill in the pertinent art would recognize such a positive correlation. Nowhere in the 
Lilley paper does the author suggest that it is more likely than not that altered mRNA levels does 
not correlate with altered protein levels. On the contrary, the statement that "changes in 
transcript level will also result in corresponding changes in protein amount or activity cannot 
always be made" implies that the mRNA/protein correlation exits in most cases. 

The Examiner cites King et al to show that "mRNA levels do not necessarily correlates 
with protein levels". (Page 14 of the Advisory Action). 

First of all, Appellants note that the instant application is not directed to the microarray 
assay as in King et al. As discussed above, the utility standard is not absolute certainty. 
Therefore, Appellants should not be required to show a "necessary correlation" between mRNA 
and protein levels in order to establish a patentable utility. King never indicates that it is more 
likely than not that a general correlation between the mRNA and protein levels for a gene does 
not exist, this paper alone does not suffice to establish a prima facie showing of lack of utility. 

Appellants also note that the author discussed numerous advantages of the microarray 
technology, which offers tremendous advantages in the study of human diseases. For instance, 
on page 2287, the author states that "microarrays can be expected to prove extremely valuable as 
tools for the study of the generic basis of complex diseases. The ability to measure expression 

profiles across entire genomes provides a level of information not previously attainable 

Microarrays make it possible to investigate differential gene expression in normal vs. diseased 
tissue, in treated vs. non-treated tissue, and in different stages during the natural course of the 
disease, all on a genomic scale. Gene expression profiles may help to unlock the molecular basis 

of phenotype, response to treatment, and heterogeneity of disease " Therefore, if anything, 

the King reference supports the use of the microarray in the diagnosis of human diseases, which 
silently assumes that, most probably, increases in mRNA levels correlate well with increases in 
protein levels which in turn impacts disease. 

Similarly, Wildsmith never indicated that it is more likely than not that a general 

correlation between the mRNA and protein levels for a gene does not exist. Therefore, this paper 

is not sufficient to establish a prima facie showing of lack of utility. In fact, the Wildsmith 

paper discusses examples of a number of successes of microarray applications in the detection of 
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human diseases (see Page 284). For instance, the author has pointed out that "one area of rapid 
progress using microarray technology is the increased understanding of cancer. Molecular 
pathologies are subgroup ing cancers of tissues such as blood, skin, and breast, based on 
differential gene expression patterns. For example, within a small group of breast cancer tissue 
samples, Perou et al distinguished two broad subgroups representing those expressing or 
alternatively lacking expression of the oestrogen receptor- a- gene. The work was not 
conclusive, but never has progress in this field been so rapid when compared with the previous 
methods of gene amplification. Another example of the impact of this technology is in the 
identification of two biomarkers for prostate cancer, namely hepsin and PIM1 (Dhanasekaran et 
al, 2001). Microarray technology has also accelerated the understanding of the molecular events 
surrounding pulmonary fibrosis. Specially, two distinct clusters of genes associated with 
imflammation and fibrosis have been identified in a disease where, for years, the pathogenesis 
and treatment have remained unknown (Katsuma et al, 2001)." 

Therefore, collectively, the references Lilley et al, Wildsmith et aL 9 and King et al 
references cited by the Examiner show that the art indicates that, generally, if a mRNA is 
overexpressed in cancer, it is more likely than not that the encoded protein will also be expressed 
at an elevated level. 

Madoz Gurpide et al 

Madoz-Gurpide et al explains that mRNA expression alone does not provide 
information regarding the "activation state, post-translational modification or localization of 
corresponding proteins" (emphasis added; page 168, col. 1). That is, Madoz-Gurpide et al 
explain that mechanisms are not apparent from mRNA expression alone. Madoz Gurpide et al 
further state that, it is "unclear" how well the reported RNA levels correlate with protein levels. 
In support of this assertion, the authors cite only a single reference, namely, the Chen et al 
discussed above. Madoz Gurpide et al also acknowledge that the DNA microarray studies, such 
as those carried out by Beer et al (specifically cited by the authors at page 52), "justify the use 
of this technology for uncovering patterns of gene expression that are clinically 
informative" (emphasis added; page 53). Thus, while Madoz-Gurpide et al note that it is 
"more difficult to develop an understanding of disease at a mechanistic level using DNA 
microarrays," (emphasis added; page 53), Appellants respectfully point out that that 
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"understanding of a disease at the mechanistic level" is not relevant to Appellants' assertions of 
utility, as discussed above. Accordingly, a prima facie case cannot be made based on the 
teachings within the Madoz-Gurpide et al reference. 

Accordingly, a prima facie case cannot be made based on the teachings within the Celis 
et al. reference. In fact, Appellants submit that, Celis et al support the Appellants contention 
that it is more likely than not that changes in mRNA levels reflect changes in protein levels, in 
general. 

The Patent Office has failed to meet its initial burden of proof that Appellants' claims of 
utility are not substantial or credible. The arguments presented by the Examiner in combination 
with the Chen et al, Gygi et al, Haynes et al, Hu et al, et al, Madoz-Gurpide et al, Celis et 
al, Lee et al, King et al, Lian et al, Fessler et al, Nagaraja et al, Sagynaliev et al, and 
Waghray et al, do not provide sufficient reasons to doubt the statements by Appellants that 
PR0341 has utility. As discussed above, the law does not require the existence of a "necessary" 
correlation between DNA/mRNA and protein levels. Nor does the law require that protein levels 
be "accurately predicted." According to the authors themselves, the data in the above cited 
references confirm that there is a general trend between protein expression and transcript levels 
and DNA levels, which meets the "more likely than not standard" and show that a positive 
correlation exists between mRNA and protein. In fact, contrary to what the Examiner contends, 
the art indicates that, if a gene is overexpressed in cancer, it is more likely than not that the 
encoded protein will also be expressed at an elevated level. 

Alberts and Lewin etal 

The Examiner acknowledges that "the teachings of Alberts and Lewin (which) disclose 
that initiation of transcription is a common point for a cell to regulate the gene expression" but 
asserts that "it is not the only means of regulating gene expression". (Page 9 of the instant 
Advisory Action). 

Appellants respectfully disagree and submit that the utility standard is not absolute 
certainty. Therefore, Appellants do not need to establish that the production of RNA is 
inevitably equated with production of protein in order to meet the utility standard. Instead, as 
long as it is more likely than not that a change of the transcription level of a gene gives rise to a 
change in translation level of a gene, the utility standard is met. 
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Meric et ah 



The Examiner asserts that Meric teaches that "gene expression is quite complicated and is 
also regulated at the level of mRNA stability, mRNA translation, and protein stability" (Page 10 
of the instant Advisory Action). 

Appellants respectfully submit that Meric simply summarizes the translation regulation of 
cancer cells. Meric indicates that translation initiation is regulated in response to nutrient 
availability and mitogenic stimulation and is coupled with cell cycle progression and cell growth. 
Meric further discusses that alteration in translation control occur in cancer. For example, 
variant mRNA sequences can alter the translational efficiency of individual mRNA molecule. 
(See Abstract). Meric further teaches that the changes of the translational efficiency of a mRNA 
transcript depend on the mutation of a specific mRNA sequence. (Page 973, column 2 to 
page 91556, column 1). Meric never suggest that the translation of a cancer gene is suppressed 
in cancer in general and therefore, an increased mRNA levels will not yield an increased protein 
levels. To the contrary, Meric teaches that the translation efficiency of a number of cancer genes 
is enhanced in cancer cells compared to its normal counterpart. For instance, in patient with 
multiple myeloma, a C-T mutation in the c-myc IRES was identified and found to cause an 
enhanced initiation of translation. (Page 91556, column 1). Therefore, the level of proteins 
encoded by these genes increases in cancer cells at an even higher magnitude than the mRNA 
level. 

Appellants emphasize that it is not a legal requirement to establish an absolute correlation 
between an increase in the mRNA level and protein expression levels that would correlate to the 
disease state nor is it imperative to find evidence that protein levels can be accurately predicted. 
Therefore, the Examiner has misinterpreted the teaching of Meric and applied improperly high 
legal standard. Therefore, the Examiner has failed to establish a prima facie showing of lack of 
utility in this instance. 

Orntoft et aL 

The Examiner asserts that the "Orntoft et al. do not appear to look at gene amplification, 
mRNA levels and polypeptide levels from a single gene at a time." (Page 1 1 of the instant 
Advisory Action). 
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Orntoft et aL 9 looked at the correlation between mRNA levels and protein expression 
levels for individual genes . Orntoft et al clearly explain that "[i]n general there was a highly 
significant correlation (p<0.005) between mRNA and protein alterations. Only one gene [of 
the 40 examined] showed disagreement between transcript alteration and protein alteration." 
(Page 42, col. 2; Emphasis added). Clearly, a correlation in 39 of 40 genes examined supports 
Appellants' assertion that changes in mRNA level generally lead to corresponding changes in 
protein level . 

Celis et al. 

Like Madoz-Gurpide et al, Celis et al discuss that mechanisms are not apparent from 
mRNA expression alone. Celis et al note that "proteomics addresses problems that cannot be 
approached by DNA analysis, namely, relative abundance of the protein product, post- 
translational modification, subcellular localization, turnover, interaction with other proteins as 
well as functional aspects" (page 6, col. 2). However, in their discussion, Celis et al cite Orntoft 
et al (copy enclosed in Evidence List as Item 109) and note that "in most cases there was a good 
correlation between transcript and protein levels." Celis et al further explain that those few 
cases which showed apparent discrepancies may have been due to other causes, such as post- 
transcriptional processing or degradation of the protein, or the choice of methods used to assess 
protein expression levels. Celis et al further note that the observation that there is often more 
change in mRNAs as compared to the proteins may be due to the fact that current technologies 
detect mainly high abundance proteins, while most of the changes affecting protein levels may 
involve low abundance proteins. Thus, the correlation between mRNA and protein levels may 
be even higher than typically observed, given these factors. Celis et al explain that proteomics is 
useful in combination with arrays "for the entire process of drug development and evaluation." 
(page 6; col. 1). Appellants further submit that significant correlations between gene and protein 
expression are most likely to be observed for genes associated with cancer, since as Celis et al 
note, "transformation resulted in the abnormal expression of normal genes, rather than in the 
expression of new ones" (page 11, col. 1). Accordingly, alterations in gene amplification or 
expression are more likely to be associated with altered protein expression in the case of cancer 
than in other cases where DNA microarrays are used because, as explained by Celis et al, the 
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alterations in expression levels of certain normal proteins are part of the process that leads to 
cancer. 

Godbout et aL and Li et aL 

The Examiner asserts that Godbout et aL teaches that "a number of studies suggest that 
co-amplified genes are only overexpressed if they provide a selective advantage to the cells in 
which they are amplified." (Page 16 of the Advisory Action). Appellants respectfully submit 
that the passage cited by the Examiner is based upon two references from 1987 and 1992. In 
contrast, Appellants have made of record three more recent references, published in 2002, by 
Orntoft et aL, Hyman et aL, and Pollack et al, (made of record in Appellants' Response filed July 
7, 2004), which collectively teach that in general gene amplification increases mRNA 
expression . Appellants submit that these more recent references must be acknowledged as more 
accurately reflecting the state of the art regarding the correlation between gene amplification and 
transcript expression than the references cited by Godbout et al. 

The Examiner also cites Li et aL as teaching that "68.8% of the genes showing over- 
representation in the genome did not show elevated transcript levels." (Page 17 of the Advisory 
Action). 

Appellants respectfully point out that Li et aL acknowledge that their results differed 
from those obtained by Hyman et aL and Pollack et aL (of record), who found a substantially 
higher level of correlation between gene amplification and increased gene expression. The 
authors note that "[t]his discordance may reflect methodologic differences between studies or 
biological differences between breast cancer and lung adenocarcinoma" (page 2629, col. 1). In 
fact, as explained in the Supplemental Information accompanying the Li article, genes were 
considered to be amplified if they had a copy number ratio of at least 1.40 . As discussed in 
Appellants' previous responses, and in the Goddard Declaration of record, an appropriate 
threshold for considering gene amplification to be significant is a copy number of at least 2.0 . 
As discussed above the PR0341 gene showed 2.173 to 2.514 fold amplification in three different 
lung tumors, thus meeting this standard. Thus, the results of Li et aL therefore do not disprove 
that a gene with a substantially higher level of gene amplification, such as PRQ34L would be 
expected to show a corresponding increase in transcript expression, and in fact, support 
Appellants' arguments. 
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Thus, based on the asserted utility for PR0341 in the diagnosis of selected lung 
carcinomas, the reduction to practice of the instantly claimed protein sequence of SEQ ID NO: 
20 in the present application (also see page 305), the disclosure of the step-by-step protocols for 
making chimeric PRO polypeptides, including those wherein the heterologous polypeptide is an 
epitope tag or an Fc region of an immunoglobulin in the specification (at page 374, lines 24 to 
page 375, line 9), the disclosure of a step-by-step protocol for making and expressing PR0341 in 
appropriate host cells (in Examples 140-143 and page 376, line 12), the step-by-step protocol for 
the preparation, isolation and detection of monoclonal, polyclonal and other types of antibodies 
against the PR0341 protein in the specification (at pages 390-395) and the disclosure of the gene 
amplification assay in Example 170, the skilled artisan would know exactly how to make and use 
the claimed polypeptide for the diagnosis of lung carcinoma. Appellants submit that based on 
the detailed information presented in the specification and the advanced state of the art in 
oncology, the skilled artisan would have found such testing routine and not 'undue 5 . 

Therefore, since the instantly claimed invention is supported by either a credible, specific 
and substantial asserted utility or a well-established utility, and since the present specification 
clearly teaches one skilled in the art "how to make and use" the claimed invention without undue 
experimentation, Appellants respectfully request reconsideration and reversal of this outstanding 
rejection to Claims 124-126 and 129-131. 
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CONCLUSION 

The main issue in this Appeal is that the Examiner maintains that the gene amplification 
data has no bearing on the utility of the claimed PR0341 polypeptides, and that for PR0341 
polypeptides to be overexpressed in tumors, amplified genomic DNA would have to correlate 
with increased mRNA levels, which in turn would have to correlate with increased polypeptide 
levels. 

Appellants maintain that patentable utility of the PR0341 polypeptides is based upon the 
gene amplification data for the gene encoding the PR0341 polypeptide. Appellants submitted 
that, in general, a change in DNA levels for a particular gene leads to a corresponding change in 
the level of expression of the encoded protein based on supportive evidence. Since the 
specification clearly discloses that the gene encoding PR0341 showed significant amplification, 
in three different lung primary tumors , and further since Appellants have provided an 
overwhelming amount of evidence in the art, supporting a general DNA/mRNA/protein 
correlation (more than 100 references) in addition to the declarations from experts in the filed 
and references already of record, Appellants believe that utility of the claimed PR0341 
polypeptides has been achieved. Even though, in certain instances, the correlation between 
changes in DNA levels and protein levels is not exact, Appellants emphasize that the utility 
standard is not absolute certainty . Appellants only need to show that it is more likely than not 
that a DNA/mRNA/protein correlation exists in order to meet the utility standard, which they 
have done by providing ample evidence from the art. Appellants further submit that exceptions 
to the correlation between changes in mRNA and changes in protein does not provide a proper 
basis for rejecting Appellants' asserted utility. In fact, considering the evidence as a whole, the 
overwhelming majority of the evidence supports Appellants 5 asserted utility, and therefore, a 
person of skill in the art would conclude that Appellants' asserted utility is " more likely than not 
true ." 

For the reasons discussed above, Appellants submit that present specification clearly 
describes, details and provides a patentable utility for the claimed invention. Moreover, it is 
respectfully submitted that based upon this disclosed patentable utility, the present specification 
clearly teaches "how to use" the presently claimed polypeptide. As such, Appellants respectfully 
request reconsideration and reversal of the outstanding rejection of claims 124-126 and 129-131. 
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8. CLAIMS APPENDIX 

Claims on Appeal 

124. An isolated polypeptide comprising: 

(a) the amino acid sequence of the polypeptide of SEQ ED NO: 20; 

(b) the amino acid sequence of the polypeptide of SEQ ID NO: 20, lacking its associated 
signal peptide; 

(c) the amino acid sequence of the polypeptide encoded by the full-length coding sequence 
of the cDNA deposited under ATCC accession number 209792, 

wherein, the nucleic acid encoding said polypeptide is amplified in lung cell carcinomas. 

125. The isolated polypeptide of Claim 124 comprising the amino acid sequence of the 
polypeptide of SEQ ID NO:20. 

126. The isolated polypeptide of Claim 124 comprising the amino acid sequence of the 
polypeptide of SEQ ID NO:20, lacking its associated signal peptide. 

129. The isolated polypeptide of Claim 124 comprising the amino acid sequence of the 
polypeptide encoded by the full-length coding sequence of the cDNA deposited under 
ATCC accession number 209792. 

130. A chimeric polypeptide comprising a polypeptide according to Claim 124 fused to a 
heterologous polypeptide: 

131. The chimeric polypeptide of Claim 130, wherein said heterologous polypeptide is an 
epitope tag or an Fc region of an immunoglobulin. 
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Item 4 was submitted with Appellants* Supplemental Response filed March 30, 2006, and 
noted as considered by the Examiner in the Final Office Action mailed April 21, 2006. 

Item 5 was made of record by the Examiner in the Office Action mailed July 30, 2003. 

Item 6-7 were made of record by the Examiner in the Final Office Action mailed January 
21,2004. 

Items 8-1 1 were submitted with Appellants' Response filed July 7, 2004. 

Item 12 was made of record by the Examiner in the Final Office Action mailed 
September 16, 2004. 

Item 13 was made of record by the Examiner in the Office Action mailed July 30, 2003. 

Items 14-19 were made of record by the Examiner in the Examiner's Answer mailed 
October 12,2005. 

Items 20-151 were submitted with Appellants 1 Information Disclosure Statement filed 
September 25, 2006, and noted as considered by the Examiner in the Advisory Action 
mailed October 25, 2006. 

Items 152 - 160 were made of record by the Examiner in the Advisory Action mailed 
October 25, 2006. 
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10. RELATED PROCEEDINGS APPENDIX 

None- - no decision rendered by a Court or the Board in any related proceedings 
identified above. 
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DECLARATION OF AUDREY P. GODDARD, Ph.D UNDER 37 CF.R. g 1.132 



Assistant Commissioner of Patents 
Washington, D.C. 2023 1 

Sir: 

1, Audrey D. Goddard, PLD. do hereby declare and say as follows: 

1 . I am a Senior Clinical Scientist at the Experimental Medicine/BioOncology, Medical 
Affairs Department of Genentech, Inc., South San Francisco, California 94080. 

2. Between 1 993 and 200 1 , 1 headed the DNA Sequencing Laboratory at the Molecular 
Biology Department of Genentech, Inc. During this time, my responsibilities included the 
identification and characterization of genes contributing to the oncogenic process, and determination 
of the chromosomal localization of novel genes. 

3. My scientific Curriculum Vitae, including my list of publications, is attached to and 
forms part of this Declaration (Exhibit A). 




Serial No.: * 
Filed: * 

4. I am familiar with a variety of techniques known in the art for detecting and 
quantifying the amplification of oncogenes in cancer, including the quantitative TaqMan PCR (i.e., 
"gene amplification") assay described in the above captioned patent application. 

5. The TaqMan PCR assay is described, for example, in the following scientific 
publications: Higuchi et al, Biotechnology 10:413-417 (1992) (Exhibit B); Livak et al, PCR 
Methods AppL 4:357-362 (1995) (Exhibit C) and Heid et al 9 Genome Res. 6:986-994 (1996) 
(Exhibit D). Briefly, the assay is based on the principle that successful PCR yields a fluorescent 
signal due to Taq DNA polymerase-mediated exonuclease digestion of a fluorescently labeled 
oligonucleotide that is homologous to a sequence between two PCR primers. The extent of 
digestion depends directly on the amount of PCR, and can be quantified accurately by measuring the 
increment in fluorescence that results from decreased energy transfer. This is an extremely sensitive 
technique, which allows detection in the exponential phase of the PCR reaction and, as a result, 
leads to accurate determination of gene copy number. 

6. The quantitative fluorescent TaqMan PCR assay has been extensively and 
successfully used to characterize genes involved in cancer development and progression. 
Amplification of protooncogenes has been studied in a variety of human tumors, and is widely 
considered as having etiological, diagnostic and prognostic significance. This use of the quantitative 
TaqMan PCR assay is exemplified by the following scientific publications: Pennica et al, Proc. 
Natl. Acad. Sci. USA . 95(25):14717-14722 (1998) (Exhibit E); Pitti et al, Nature 
396(6712);699-703 (1998) (Exhibit F) andBieche et aL Int. J. Cancer 78:661-666 (1998) (Exhibit 
G), the first two of which I am co-author. In particular, Pennica et al have used the quantitative 
TaqMan PCR assay to study relative gene amplification of WISP and c-myc in various cell lines, 
colorectal tumors and normal mucosa. Pitti et al studied the genomic amplification of a decoy 
receptor for Fas ligand in lung and colon cancer, using the quantitative TaqMan PCR assay. Bieche 
et al used the assay to study gene amplification in breast cancer. 
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Serial No.: * 
Filed:* 

7. It is my personal experience that the quantitative TaqMan PCR technique is 
technically sensitive enough to detect at least a 2-fold increase in gene copy number relative to 
control. It is further my considered scientific opinion that an at least 2-fold increase in gene copy 
number in a tumor tissue sample relative to a normal (i.e., non-tumor) sample is significant and 
useful in that the detected increase in gene copy number in the tumor sample relative to the normal 
sample serves as a basis for using relative gene copy number as quantitated by the TaqMan PCR 
technique as a diagnostic marker for the presence or absence of tumor in a tissue sample of unknown * 
pathology. Accordingly, a gene identified as being amplified at least 2-fold by the quantitative 
TaqMan PCR assay in a tumor sample relative to a normal sample is useful as a marker for the 
diagnosis of cancer, for monitoring cancer development and/or for measuring the efficacy of cancer 
therapy. 

8. \ declare further that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true. I declare that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code, and that such willful false statements may jeopardize the validity of the application or any 
patent issuing thereon. 

Date 



Audrey D. Goddard, Ph.D. 
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# 



AUDREY D. GODDARD, Ph.D. 



Genentech, Inc. 
1 DNA Way 

South San Francisco, CA, 94080 

650.225.6429 

goddarda@gene.com 



110 Congo St. 

San Francisco, CA, 94131 

415.841.9154 

415.819.2247 (mobile) 

agoddard@pacbell.net 



PROFESSIONAL EXPERIENCE 



Genentech, Inc. 



1993-present 



South San Francisco, CA 

2001 - present Senior Clinical Scientist 

Experimental Medicine / BioOncology, Medical Affairs 

Responsibilities: 

• Companion diagnostic oncology products 

• Acquisition of clinical samples from Geneniech's clinical trials for translational research 

• Translational research using clinical specimen and data for drug development and 
diagnostics 

• Member of Development Science Review Committee, Diagnostic Oversight Team, 21 CFR 
Part 1 1 Subteam 

Interests: 

• Ethical and legal implications of experiments with clinical specimens and data 

• Application of pharmacogenomics in clinical trials 



1998 -2001 Senior Scientist 

Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 
Responsibilities: 

• Management of a laboratory of up to nineteen -including postdoctoral fellow, associate 
scientist, senior research associate and research assistants/associate levels 

• Management of a $750K budget 

• DNA sequencing core facility supporting a 350+ person research facility. 

• DNA sequencing for high throughput gene discovery, - ESTs, cDNAs, and constructs 

• Genomic sequence analysis and gene identification 

• DNA sequence and primary protein analysis 

Research: 

• Chromosomal localization of novel genes 

• Identification and characterization of genes contributing to the oncogenic process 

• identification and characterization of genes contributing to inflammatory diseases 

• Design and development of schemes for high throughput genomic DNA sequence analysis 

• Candidate gene prediction and evaluation 
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1993-1998 



Scientist 



Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 
Responsibilities 

• DNA sequencing core facility supporting a 350+ person research facility 

• Assumed responsibility for a pre-existing team of five technicians and expanded the group 
into fifteen, introducing a level of middle management and additional areas of research 

• Participated in the development of the basic plan for high throughput secreted protein 
discovery program - sequencing strategies, data analysis and tracking, database design 

• High throughput EST and cDNA sequencing for new gene identification. 

• Design and implementation of analysis tools required for high throughput gene identification. 

• Chromosomal localization of genes encoding novel secreted proteins. 

Research: 

• Genomic sequence scanning for new gene discovery. 

• Development of signal peptide selection methods. 

• Evaluation of candidate disease genes. 

• Growth hormone receptor gene SNPs in children with Idiopathic short stature 

Imperial Cancer Research Fund 1989-1992 
London, UK with Dr. Ellen Solomon 

6/89 -12/92 Postdoctoral Fellow 

• Cloning and characterization of the genes fused at the acute promyelocytic leukemia 
translocation breakpoints on chromosomes 17 and 15. 

• Prepared a successfully funded European Union multi-center grant application 

McMaster University 1983 
Hamilton, Ontario, Canada with Dr. G. D. Sweeney 

5/83 - 8/83: NSERC Summer Student 

• In vitro metabolism of p-naphthoflavone in C57BI/6J and DBA mice 



EDUCATION 



Ph.D. 



University of Toronto 
Toronto, Ontario, Canada. 
Department of Medical 
Biophysics. 



"Phenotypic and genotypic effects of mutations in 
the human retinoblastoma gene." 
Supervisor: Dr. R. A. Phillips 



1989 



Honours B.Sc 

"The in vitro metabolism of the cytochrome P-448 
inducer p-naphthoflavone in C57BL/6J mice." 
Supervisor: Dr. G. D. Sweeney 



McMaster University, 
Hamilton, Ontario, Canada. 
Department of Biochemistry 



1983 
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ACADEMIC AWARDS 



Imperial Cancer Research Fund Postdoctoral Fellowship 

Medical Research Council Studentship 

NSERC Undergraduate Summer Research Award 

Society of Chemical Industry Merit Award (Hons. Biochem.) 

Dr. Harry Lyman Hooker Scholarship 

J.L.W. Gill Scholarship 

Business and Professional Women's Club Scholarship 
Wyerhauser Foundation Scholarship 



1989-1992 
1983-1988 
1983 



1983 



1981-1983 
1981-1982 
1980-1981 
1979-1980 



INVITED PRESENTATIONS 

Genentech's gene discovery pipeline: High throughput identification, cloning and 
characterization of novel genes. Functional Genomics: From Genome to Function, Litchfield 
Park, AZ, USA. October 2000 

High throughput identification, cloning and characterization of novel genes. G2K:Back to 
Science, Advances in Genome Biology and Technology I. Marco Island, FL, USA. February 



Quality control in DNA Sequencing: The use of Phred and Phrap. Bay Area Sequencing 
Users Meeting, Berkeley, CA, USA. April 1999 

High throughput secreted protein identification and cloning. Tenth International Genome 
Sequencing and Analysis Conference, Miami, FL, USA. September 1998 

The evolution of DNA sequencing: The Genentech perspective. Bay Area Sequencing Users 
Meeting, Berkeley, CA, USA. May 1998 . 

Partial Growth Hormone Insensitivity: The role of GH-receptor mutations in Idiopathic Short 
Stature. Tenth Annual National Cooperative Growth Study Investigators Meeting, San 
Francisco, CA ( USA. October, 1996 

Growth hormone (GH) receptor defects are present in selected children with non-GH-deficient 
short stature: A molecular basis for partial GH-insensitivity. 76 th Annual Meeting of The 
Endocrine Society, Anaheim, CA, USA. June 1994 

A previously uncharacterized gene, myl, is fused to the retinoic acid receptor alpha gene in 
acute promyelocyte leukemia. XV International Association for Comparative Research on 
Leukemia and Related Disease, Padua, Italy. October 1991 



2000 
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PATENTS 

Goddard A t Godowski PJ, Gurney AL. NL2 Tie ligand homologue polypeptide. Patent 
Number: 6,455,496. Date of Patent: Sept. 24, 2002. 

Goddard A, Godowski PJ and Gurney AL. NL3 Tie ligand homologue nucleic acids. Patent 
Number: 6,426,218. Date of Patent: July 30, 2002. 

Godowski P, Gurney A, Hillan KJ, Botstein D, Goddard A, Roy M, Ferrara N, Tumas D, 
Schwall R. NL4 Tie ligand homologue nucleic acid. Patent Number: 6,4137,770. Date of 
Patent: July 2, 2002. 

Ashkenazi A, Fong S, Goddard A, Gurney AL, Napier MA, Tumas D, Wood Wl. Nucleic acid 
encoding A-33 related antigen poly peptides. Patent Number: 6,410,708. Date of Patent:: 
Jun. 25, 2002. 

Botstein DA, Cohen RL, Goddard AD, Gurney AL, Hillan KJ, Lawrence DA, Levine AJ, 
Pennica D, Roy MA and Wood Wl. WISP polypeptides and nucleic acids encoding same. 
Patent Number: 6,387,657. Date of Patent: May 14, 2002. 

Goddard A, Godowski PJ and Gurney AL. Tie ligands. Patent Number: 6,372,491. Date of 
Patent: April 16, 2002. 

Godowski PJ, Gurney AL, Goddard A and Hillan K. TIE ligand homologue antibody. Patent 
Number: 6,350,450. Date of Patent: Feb. 26, 2002. 

Fong S, Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Tie 
receptor tyrosine kinase ligand homologues. Patent Number: 6,348,351. Date of Patent: 
Feb. 19, 2002. 

Goddard A, Godowski PJ and Gurney AL. Ligand homologues. Patent Number: 6,348,350. 
Date of Patent: Feb. 19, 2002. 

Attie KM, Carlsson LMS, Gesundheit N and Goddard A. Treatment of partial growth 
hormone insensitivity syndrome. Patent Number: 6,207,640. Date of Patent: March 27, 
2001. 

Fong S, Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Nucleic 
acids encoding NL-3. Patent Number: 6,074,873. Date of Patent: June 13, 2000 

Attie K, Carlsson LMS, Gesunheit N and Goddard A. Treatment of partial growth hormone 
insensitivity syndrome. Patent Number: 5,824,642. Date of Patent: October 20, 1998 

Attie K, Carlsson LMS, Gesunheit N and Goddard A. Treatment of partial growth hormone 
insensitivity syndrome. Patent Number: 5,646,113. Date of Patent: July 8, 1997 

Multiple additional provisional applications filed 
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PUBLICATIONS 

Seshasayee D, Dowd P, Gu Q, Erickson S, Goddard AD Comparative sequence analysis of 
the HER2 locus in mouse and man. Manuscript in preparation. 

Abuzzahab MJ t Goddard A, Grigorescu F, Lautier C, Smith RJ and Chernausek SD. Human 
IGF-1 receptor mutations resulting in pre- and post-natal growth retardation. Manuscript in 
preparation. 

Aggarwal S, Xie, M-H, Foster J, Frantz G, Stinson J t Corpuz RT, Simmons L, Hillan K, 
Yansura DG, Vandlen RL, Goddard AD and Gurney AL. FHFR, a novel receptor for the 
fibroblast growth factors. Manuscript submitted. 

Adams SH, Chui C, Schilbach SL, Yu XX, Goddard AD, Grimaldi JC, Lee J, Dowd P, Colman 
S., Lewin DA. (2001) BFIT, a unique acyl-CoA thioesterase induced in thermogenic brown 
adipose tissue: Cloning, organization of the human gene, and assessment of a potential link 
to obesity. Biochemical Journal 360: 1 35-1 42. 

Lee J. Ho WH. Maruoka M. Corpuz RT. Baldwin DT. Foster JS. Goddard AD. Yansura DG. 
Vandlen RL. Wood Wl. Gurney AL. (2001) IL-17E, a novel proinflammatory ligand for the IL- 
17 receptor homolog IL-17RM. Journal of Biological Chemistry 276(2): 1660-1664. 

Xie M-H, Aggarwal S, Ho W-H, Foster J, Zhang Z, Stinson J, Wood Wl, Goddard AD and 
Gurney AL (2000) Interleukin (IL)-22, a novel human cytokine that signals through the 
interferon-receptor related proteins CRF2-4 and IL-22R. Journal of Biological Chemistry 275: 
31335-31339. 

Weiss GA, Watanabe CK, Zhong A, Goddard A and Sidhu SS. (2000) Rapid mapping of 
protein functional epitopes by combinatorial alanine scanning. Proc. Natl. Acad. Sci. USA 97: 
8950-8954. 

Guo S, Yamaguchi Y, Schilbach S, Wada T.;Lee J, Goddard A, French D , Handa H, 
Rosenthal A. (2000) A regulator of transcriptional elongation controls vertebrate neuronal . 
development. Nature 408: 366-369. 

Yan M, Wang L-C, Hymowitz SG, Schilbach S, Lee J, Goddard A, de Vos AM, Gao WQ, Dixit 
VM. (2000) Two-amino acid molecular switch in an epithelial morphogen that regulates 
binding to two distinct receptors. Science 290: 523-527. 

Sehl PD, Tai JTN, Hillan KJ, Brown LA, Goddard A, Yang R, Jin H and Lowe DG. (2000) 
Application of cDNA microarrays in determining molecular phenotype in cardiac growth, 
development, and response to injury. Circulation 101: 1990-1999. 

Guo S, Brush J, Teraoka H, Goddard A, Wilson SW, Mullins MC and Rosenthal A. (1999) 
Development of noradrenergic neurons in the zebrafish hindbrain requires BMP, FGF8, and 
the homeodomain protein soulless/Phox2A. Neuron 24: 555-566. 

Stone D, Murone, M, Luoh, S, Ye W, Armanini P, Gurney A, Phillips HS, Brush, J, Goddard 
A, de Sauvage FJ and Rosenthal A. (1999) Characterization of the human suppressor of 
fused; a negative regulator of the zinc-finger transcription factor Gli. J. Cell Sci. 1.12: 4437- 
4448. 

Xie M-H, Holcomb I, Deuel B, Dowd P, Huang A, Vagts A, Foster J, Liang J, Brush J, Gu Q, 
Hillan K, Goddard A and Gurney, A.L. (1999) FGF-19, a novel fibroblast growth factor with 
unique specificity for FGFR4. Cytokine 1 1 : 729-735. 
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Yan M, Lee J, Schilbach S, Goddard A and Dixit V. (1999) mE10, a novel caspase 
recruitment domain-containing proapoptotic molecule. J. Biol. Chem. 274(15): 10287-10292. 

Gurney AL, Marsters SA, Huang RM, Pitti RM, Mark DT, Baldwin DT, Gray AM, Dowd P t 
Brush J, Heldens S, Schow P, Goddard AD, Wood.WI, Baker KP, Godowski PJ and 
Ashkenazi A. (1999) Identification of a new member of the tumor necrosis factor family and its 
receptor, a human ortholog of mouse GITR. Current Biology 9(4): 215-218. 

Ridgway JBB, Ng E t Kern J A ,Lee J, Brush J, Goddard A and Carter P. (1999) Identification 
of a human anti-CD55 single-chain Fv by subtractive panning of a phage library using tumor 
and nontumor cell lines. Cancer Research 59: 2718-2723. 

Pitti RM, Marsters SA, Lawrence DA, Roy M, Kischkel FC, Dowd P, Huang A, Donahue CJ, 
Sherwood SW, Baldwin DT, Godowski PJ, Wood Wl, Gurney AL, Hillan KJ, Cohen RL, 
Goddard AD t Botstein D and Ashkenazi A. (1998) Genomic amplification of a decoy receptor 
for Fas ligand in lung and colon cancer. Nature 396(6712): 699-703. 

Pennica D, Swanson TA, Welsh JW, Roy MA, Lawrence DA, Lee J, Brush J, Taneyhill LA, 
Deuel B, Lew M, Watanabe C, Cohen RL, Melhem MF, Finley GG, Quirke P, Goddard AD, 
Hillan KJ, Gurney AL, Botstein D and Levine AJ. (1998) WISP genes are members of the 
connective tissue growth factor family that are up-regulated in wnt-1 -transformed cells and 
aberrantly expressed in human colon tumors. Proc. Natl. Acad. Sci. USA. 95(25): 14717- 
14722. 

Yang RB, Mark MR, Gray A, Huang A, Xie MH, Zhang M, Goddard A, Wood Wl, Gurney AL 
and Godowski PJ. (1998) Toll-like receptor-2 mediates lipopolysaccharide-induced cellular 
signalling. Nature 395(6699): 284-288. 

Merchant AM, Zhu Z, Yuan JQ, Goddard A, Adams CW, Presta LG and Carter P. (1998) An 
efficient route to human bispecific IgG. Nature Biotechnology 16(7): 677-681. 

Marsters SA, Sheridan JP, Pitti RM, Brush J, Goddard A and Ashkenazi A. (1998) 
Identification of a ligand for the death-domain-containing receptor Apo3. Current Biology 8(9): 
525-528. 

Xie J, Murone M, Luoh SM, Ryan A, Gu Q, Zhang C, Bonifas JM, Lam CW, Hynes M, 
Goddard A, Rosenthal A, Epstein EH Jr. and de Sauvage FJ. (1998) Activating Smoothened 
mutations in sporadic basal-cell carcinoma. Nature. 391(6662): 90-92. 

Marsters SA, Sheridan JP, Pitti RM, Huang A, Skubatch M, Baldwin D, Yuan J, Gurney A, 
Goddard AD, Godowski P and Ashkenazi A. (1997) A novel receptor for Apo2L/TRAIL 
contains a truncated death domain. Current Biology. 7(12): 1003-1006. 

Hynes M, Stone DM, Dowd M, Pitts-Meek S, Goddard A, Gurney A and Rosenthal A. (1997) 
Control of cell pattern in the neural tube by the zinc finger transcription factor Gli-1. Neuron 
19:15-26. 

Sheridan JP, Marsters SA, Pitti RM, Gurney A., Skubatch M, Baldwin D, Ramakrishnan L, 
Gray CL, Baker K, Wood Wl, Goddard AD, Godowski P, and Ashkenazi A. (1997) Control of 
TRAIL-lnduced Apoptosis by a Family of Signaling and Decoy Receptors. Science 277 
(5327): 818-821. 
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Goddard AD, Dowd P, Chemausek S, Geffner M, Gertner J, Hintz R, Hopwood N, Kaplan S, 
Plotnick L, Rogol A, Rosenfield R, Saenger P, Mauras N, Hershkopf R, Angulo M and Attie, K. 
(1997) Partial growth hormone insensitivity: The role of growth hormone receptor mutations in 
idiopathic short stature. J. Pediatr. 131: S51-55. 

Klein RD, Sherman D, Ho WH, Stone D, Bennett GL, Moffat B, Vandlen R, Simmons L, Gu Q, 
Hongo JA, Devaux B, Poulsen K, Armanini M, Nozaki C, Asai N, Goddard A, Phillips H, 
Henderson CE, Takahashi M and Rosenthal A. (1997) A GPI-linked protein that interacts with 
Ret to form a candidate neurturin receptor. Nature. 387(6634): 717-21. 

Stone DM, Hynes M, Armanini M, Swanson TA, Gu Q, Johnson RL, Scott MP, Pennica D, 
Goddard A, Phillips H, Noll M, Hooper JE, de Sauvage F and Rosenthal A. (1996) The 
tumour-suppressor gene patched encodes a candidate receptor for Sonic hedgehog. Nature 
384(6605): 129-34. 

Marsters SA, Sheridan JP, Donahue CJ, Pitti RM, Gray CL, Goddard AD, Bauer KD and 
Ashkenazi A. (1996) Apo-3, a new member of the tumor necrosis factor receptor family, 
contains a death domain and activates apoptosis and NF-kappa p. Current Biology 6(12): 
1669-76. 

Rothe M, Xiong J, Shu HB, Williamson K, Goddard A and Goeddel DV. (1996) l-TRAF is a 
novel TRAF-interacting protein that regulates TRAF-mediated signal transduction. Proc. Natl. 
Acad. Sci. USA 93: 8241-8246. 

Yang M, Luoh'SM, Goddard A, Reilly D, Henzel W and Bass S. (1996) The bglX gene 
located at 47.8 min on the Escherichia coli chromosome encodes a periplasmic beta- 
glucosidase. Microbiology 142: 1659-65. 

Goddard AD and Black DM. (1996) Familial Cancer in Molecular Endocrinology of Cancer. 
Waxman, J. Ed. Cambridge University Press, Cambridge UK, pp. 187-21 5. 

Treanor JJS, Goodman L, de Sauvage F, Stone DM, Poulson KT, Beck CD, Gray C, Armanini 
MP, Pollocks RA, Hefti F, Phillips HS, Goddard A, Moore MW, Buj-Bello A, Davis AM, Asai N, 
Takahashi M, Vandlen R, Henderson CE and Rosenthal A. (1996) Characterization of a 
receptor for GQNF. Nature 382: 80-83. 

Klein RD, Gu Q, Goddard A and Rosenthal A. (1996) Selection for genes encoding secreted 
proteins and receptors. Proc. Natl. Acad. Sci. USA 93: 7108-7113. 

Winslow JW, Moran P, Valverde J, Shih A, Yuan JQ, Wong SC, Tsai SP, Goddard A, Henzel 
WJ, Hefti F and Caras I. (1995) Cloning of AL-1, a ligand for an Eph-related tyrosine kinase 
receptor involved in axon bundle formation. Neuron 14: 973-981 . 

Bennett BD, Zeigler FC, Gu Q, Fendly B, Goddard AD, Gillett N and Matthews W. (1995) 
Molecular cloning of a ligand for the EPH-related receptor protein-tyrosine kinase Htk. Proc. 
Natl. Acad. Sci. USA 92: 1866-1870. 

Huang X, Yuang J, Goddard A, Foulis A, James RF, Lernmark A, Pujol-Borrell R, 
Rabinovitch A, Somoza N and Stewart TA. (1995) Interferon expression in the pancreases of 
patients with type I diabetes. Diabetes 44: 658-664. 

Goddard AD, Yuan JQ, Fairbairn L, Dexter M, Borrow J, Kozak C and Solomon E. (1995) 
Cloning of the murine homolog of the leukemia-associated PML gene. Mammalian Genome 
6:732-737. 
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Goddard AD, CoveNo R, Luoh SM, Clackson T, Attie KM, Gesundheit N, Rundle AC, Wells 
JA, Carlsson LMTI and The Growth Hormone Insensitivity Study Group. (1995) Mutations of 
the growth hormone receptor in children with idiopathic short stature. N. Engl. J. Med. 333: 
1093-1098. 

Kuo SS, Moran P, Gripp J, Armanini M, Phillips HS f Goddard A and Caras IW. (1994) 
Identification and characterization of Batk, a predominantly brain-specific non-receptor protein 
tyrosine kinase related to Csk. J. Neurosci. Res. 38: 705-715. 

Mark MR, Scadden DT f Wang Z, Gu Q, Goddard A and Godowski PJ. (1994) Rse, a novel 
receptor-type tyrosine kinase with homology to Axl/Ufo, is expressed at high levels in the 
brain. Journal of Biological Chemistry 269: 1 0720-1 0728. 

Borrow J, Shipley J, Howe K, Kiely F, Goddard A, Sheer D f Srivastava A, Antony AC, 
Fioretos T, Mitelman F and Solomon E. (1994) Molecular analysis of simple variant 
translocations in acute promyelocytic leukemia. Genes Chromosomes Cancer 9: 234-243. 

Goddard AD and Solomon E. (1993) Genetics of Cancer. Adv. Hum. Genet 21: 321-376. 

Borrow J, Goddard AD, Gibbons B, Katz F, Swirsky D, Fioretos T, Dube I, Winfield DA, 
Kingston J, Hagemeijer A, Rees JKH, Lister AT and Solomon E. (1992) Diagnosis of acute 
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transcripts. Br. J. Haematol. 82: 529-540. 

Goddard AD, Borrow J and Solomon E. (1992) A previously uncharacterized gene, PML, is 
fused to the retinoic acid receptor alpha gene in acute promyelocytic leukemia. Leukemia 6 
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Zhu X, Dunn JM, Goddard AD, Squire JA, Becker A, Phillips RA and Gallie BL. (1992) 
Mechanisms of loss of heterozygosity in retinoblastoma. Cytogenet. Cell. Genet 59: 248-252. 

Foulkes W, Goddard A. and Patel K. (1991) Retinoblastoma linked with Seascale [letter]. 
British Med. J. 302: 409. 

Goddard AD, Borrow J, Freemont PS and Solomon E. (1991) Characterization of a novel zinc 
finger gene disrupted by the t(15;17) in acute promyelocytic leukemia. Science 254: 1371- 
1374. 

Solomon E, Borrow J apd Goddard AD. (1991) Chromosomal aberrations in cancer. Science 
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Borrow J, Black DM, Goddard AD, Yagle MK, Frischauf A.-M and Solomon E. (1991) 
Construction and regional localization of a Not\ linking library from human chromosome 17q. 
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Borrow J, Goddard AD, Sheer D and Solomon E. (1990) Molecular analysis of acute 
promyelocytic leukemia breakpoint cluster region on chromosome 17. Science 249: 1577- 
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SIMULTANEOUS AMPLIFICATION AM) DETECTION OF 
SPECIFIC DNA SEQUENCES 

Russell Higuchi*, Gavwx Bollinger 1 , P. Sean Walsh and Robert Griffith 

Roche Molecular System*, Inc., M0055rd St., EmeiyvUk, CA 94(308. 'Chiron Corporation, 1400 53rd St, Emeryville, CA 
04fiOB, ^Corresponding author. 



We have enhanced the polymerase chain 
reaction (PGR) such that specific DNA 
sequences can he detected without open- 
ing the reaction tube. This enhancement 
requires die addition of ethidium bromide 
(EtBr) to a PGR* Since the fluorescence of 
EtBr increases in the presence of double* 
stranded (ds) DNA an increase in fluores- 
cence in such a PGR indicates a positive 
amplification, which can be easily moni- 
tored externally. In fact, amplification can 
be continuously monitored in order to 
follow its progress. The ability to simulta- 
neously amplify specific DNA sequences 
and detect the product of the amplification 
both simplifies and improves PGR and 
may facilitate its automation and more 
widespread use in the clinic or in other 
situations requiring high sample through- 
put 



lthouch the potential bereft* of PCR 1 to clin- 
ical <&grMwtK3 arc well known 2,9 , it i$ suli not 
widely used in this setting, even though it is 
m four year* fiiuco thcrmQ>st*bl* DNA poty" 1 **"* 

ase$ 4 made PCR practical, Seine of the reasons for it* slow 
acceptance are high cost, tack, of automation of pre- and 
post-PCR processing steps, and false positive results, from 
carryovcKOntaminatioD, The first two point* are related 
in that labor is the largest contributor to cost ait the present 
stage of PCR development. Most Current assays require 
some form of "downstream" processing once thermocy* 
ding h done in order in determine whether the target 
DNA sequence was present and has amplified. These 
include DNA hybridiwuon**, gel eJectropboreAis with or 
without use of restriction digestion 7 ;*; HPLCr, or capillary 
electrophoresis 10 . These methods are labor-intense, have 
low throughput, and arc difficult to automate. The third 
point is aLo closely related to downstream processing. 
The handling of the PCR product in these downstream 
processes increases the chances that amplified DNA . will 
spread through the typing lab, resulting in a .risk of 



"carryover" false positives in subsequent testing". 

These downstream processing steps would be elimi- 
nated if specific amplification and detection of amplified 
DNA took place simultaneously within an unopened re- 
action vessel Assays m which such different processes take 
>lace without, the need to separate reaction components 
iavc been termed ■^mogeitetnis"'. No truly homoge- 
neous PCR assay has been demonstrated to date, although 
progress towards this end has been reported, Chehab, et 
al. 1 * developed a PCR product detection scheme using 
fluorescent primers that resulted in a fiuorcficent PCR 
product AllclMpecifu: primers, each with different fluo- 
rescent tags, were used to indicate the genotype of the 
DNA. However, the unincorporated primers must still be 
removed in a downstream process in order to visualize the 
result Recently, Holland, et al. ls , developed an assay in 
which the endogenous 5 r exonudease assay of Taq DNA 
polymerase was exploited to cleave a labeled oligonucko 
tide probe. The probe would only deftve if PCR ampfiu 
cation had produced its coroplemeataiy sequence. In 
order to decect the cleavage products, however, a subse- 
quent process w again needed. 

We have developed a truly homogeneous assay for PCR 
and PCR product detection based upon tbc gready in- 
creased fluorescence that ethidium bromide and other 
DNA binding dyes exhibit when they are bound to ds- 
DNA 1 *- 19 . As outlined in Figure J, a prototypic PCR 
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RQOBE 1 Principle of simultaneous amplification and dctedtOtl of 
PCR product The components of a PCR contsrintn^ EtBr dial arc 
fluores^nT are hswd—EtBr itself, EtBr bound to other ssDN A ot 
dsDNA. There vt a Jar^e Hiiorcsccncc cnhamxmctit when EtBr is 
bound to t>NA and binding fa greatly enhanced when DNA .is 
double-stranded* After sumdent <n> .cydcs of PGR, the.net 
inercaM 'm d.qt>NA renuks in- addkional EtBr bonding, and & net 
increase in total HucKrcsccrice; 
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WCOTf 2 Gel electrophoresis of PCS. amplification products of the 
human, nuclear gene, HLA made in the presence of 

increasing amounts of EtBr (up to 8 M-gftnl). The presence of 
EtBr lias no obvious effect on the yield or specificity of amplifi- 
cation. 
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FIGURE 3 (A) Fluorescence xtrasurcracnulrom PCRs that contain 
0.5 pgfrnl EtBr and that are specific for Y-chrotnosonic repeat 
$eGOencc*, Five replicate P€Rs/*ere begun confining each of the 
DNA* specified. At each indicated cycle, one of the five replicate 
FCfcs for each DNA vras removed from thermocyding and tt$ 
fluorescence measured. Units of fluorescence arc arbitrary. (B) 
UV photography of PGR tube* (0,5 ml Eppcndorf^tylc, pc4yprO~ 
pykne mkro-eentiifuRc tubes) contenting reactions, those ata.tfc. 
ing from Z ng male DNA and control reactions without any DNA, 
from (A). 
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begins with primers that are single-stranded DNA (ss- 
DNA)> dNTPs, and DNA rx>iyznerase; An amount of 
ctaDNA containing the target sequence (target DNA) is 
also typically present. This amount can vary, depending 
on the application, from single-cell amounts of DNA 17 to 
micrograms per PCR^ 8 , If EtBr is present, the reagents 
that will fluoresce, in order of increasing fluorescence, are 
free EtBr itself t and EtBr bound to the single-stranded 
DNA primers and to the double-stranded target DNA (by 
its intercalation between the stacked bases of the DNA 
doubk>hch*)» After the first denatu ration cyde, target 
DNA will be largely singie-stranded. After a VCR is 
completed, the most significant change is the increase in 
the amount of dsDNA (the PGR product itself) of up to 
several micrograms- Formerly free EtBr is bound to the 
additional dsDNA, resulting in an increase in fluores- 
cence. There is also some decrease in the amount of 
ssDNA primer, but because the binding of EtBr to ssDN A 
is much kss than to dsDNA, che effect of this change on 
the total fluorescence of the sample is smalL The fluores- 
cence increase can be measured by directing excitation 
illumination through the walls of the amplification vessel 
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before and after, or even continuously during, themocy- 
ciing. 

RESULTS 

PCR in the presence of EtBr. fn order to assess the 
affect of EtBr In FOR, amplifications of the human HLA 
DQct gene 19 were performed with the dye present at 
concentrations from 0,06 to 8.0 ugmil (a typical concen- 
tration of EtBr used in staining of nucleic aads following 
gel electrophoresis is 0.5 p.g/m[). As shown in Figure 2, gel 
electrophoresis revealed litde or no difference in the yield 
or quality of the amplification product whether EtBr was 
absent or present at any of these concentrations, indicat- 
ing that EtBr does not inhibit PGR, 

Detection of human Y-chtomcwnHs specific 
oneness* Sequent-specific., fluorescence enhancement of 
EtBr as a result of PGR was demonstrated in a scries of 
amplifications containing 0,5 jtg/ml EtBr and primers 
specific to repeat DNA sequences found on the human 
Y -chromosome 20 - These PGRs initially contained cither 
60 ng male, 60 ng female, 2 ng roak human or no DNA. 
Five replicate PCRs were begun for each DNA* After 0, 
IV, 21 , 24 and 29 cycles of thermocyding, a FCR for each 
DNA was removed from the thermocyder, and its. fluo- 
rescence measured in a spectroflnorometer and plotted 
vs. amplification cycle number (Fig. 3A). The shape of this 
curve reflects the fact that by the time an increase in 
fluorescence can be detected, the increase in DNA U 
becoming linear and not exponential with cycle number. 
As shown, the fluorescence increased about three-fold 
over the background fluorescence for the PCRs contain* 
ing human male DNA, but did not significantly increase 
for negative control PCRs, which contained either no 
DNA or human female DNA. The more male DNA 
present to begin with— 60 ng versus 2 ng—rhe fewer 
cycles were needed to give a detectable increase in fluo- 
rescence. Gel dectropHorcsfc on- die products of these 
amplifications showed that DNA fragments of the ex- 
pected size were made in the male. DNA containing 
reactions and that litde DNA synthesis took place in the 
control samples. 

In addition, the increase in fluorescence was visualized 
by simply laying the completed, unopened PCRs on a UV 
transilifuminator and photographing them through a red 
filter. This is shown in figure SB lor the reactions that 
began with 2 ng male DNA and those with no DNA. 

Detection of spedfle alkies of the human p-gtobin 
gene* In order to demonstrate that this approach has 
adequate apedficity to allow genetic screening* a dttcction 
of the sxcklc-cdl anemia mutation was performed- Figure 
4 shows the fluorescence from completed ampMcacio** 

containing EtBr (0.5 ng/ml) a* detected bf photography 

of the reaction tubes on a UV transilluminator. These 
reactions were performed using- primer* specific for ci- 
ther the wiW-tvpe or skkk-ceil mutation of the human 
p^globm gene* The specifkity for each allele i$ imparted 
by placing the sickle-mutation site at the terminal V 
nucleotide of one primer. By using an appropriate primer 
annealing temperature, primer extension— and thus an> 
pliocation — can take place only if the %* nucleotide of the 
primer is comnleincntary to the g-gJabin allele present* • 
Each pair oi ainpltifications shown in Figure 4 consists of 
a reaction with etcher the wiM-typc allele spedfk (left 
tube) or sfckle-allele specific (right tube) primers. Three 
different DNAs were typed: DNA from a homozygous, 
wflki-type p-globin individual (AA); from a heterozygous 
sickle pMglpbin individual (AS); and from a homozygous 
sickle P-gioW individual (SS). Each DNA (50 ng genomic 
DNA to start each PGR) was analyzed in triplicate (3 pairs 
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0 f reactions each), The DNA .type -was reflected in tbc 
rtktiv*; fluorescence intensities in each pair of competed 
fl ^plificatk>itt. There was a significant increase in fluores- 
ce* only where a &-globin allele DISfA matched the 
primer act. Whco measured oa a spcctroflngronietcr 
Mata not shown), this Buorcsccrjtcc was about three times 
[j*t present in a PGR where both p-globfti alleles were 
^matched to the primer set* Gel ctactropbofesfc (not 
fl hown) established that this increase in fluorescence was 
due to the synthesis of nearly a microgram of a DNA 
fragment of the expected size for p^lobin. There was 
little synthesis of dsDNA in reactions in . which the allele- 
<tpecinc primer was mismatched to both alleles* 

Conrinucw momiovmg of a PGR* Using » fiber optic 
devkcr it i& possible to direct excitation illumination from 
n spectrofl uoromete r to a PGR undergoing thcrmocyding 
and to rctirrn its fluorescence ra the KpcctroftuorotwciCT* 
The fluorescence readout of such an arrangement, di* 
i^cted at an EtBr-containmg ampliftcadoQ of Y-chrorjDo- 
some specifce sequences from 25 ng of toman male DNA+ 
H shown in Figure 5. The readout from a control tCR 
w jih no target DNA is also shown. Thirty cycles of PGR 
w erc monitored for cach- 

THe fluorescence trace as a function of time dearly 
shows the effect of the thennocyding. Fluorescence inten- 
sity rises and . fells inversely with temperature* The fluo- 
rescence intensity is minimum at the denaturadon tem- 
perature (<M°C) and jncwririiufn at the anneaUng/extenMon 
temperature (SOX). In the negaliv&control FCR, these 
fluorescence maxima and minima do not change signifi- 
cantly over the thirty tbcrmOcycJefi, indicating that there is 
little dsDN'A synthesis without the appropriate target 
DNA, and there is little if any bleaching of EtBr during 
the continuous illumination of the sample. 

Jn the PGR containing male DNA, the fluorescence 
maxima at the annealing/extension temperature begin to 
increase at about 4000 seconds of therinocycling, and 
continue to increase with time, indicating that dsDNA is 
being produced at a detectable level. Note that the fluo- 
rescence minima at the denaturation temperature do not 
fligiuficantly increase, presumably because at this temper- 
ature there is no cUDNA for EtBr to bind- Thus the course 
of the amplification is followed by tracking the ftuorev. 
cence increase at the annealing temperature. Analysis of 
the products of these two amplifications by gel electropho- 
resis showed a DNA fragment of the expected size for the 
male DNA containing sample and no detectable DNA 
synthesis for the control sample. 

DISCUSSION 

Downstream processes such as- hybridization to a se- 
quence-specific probe can enhance the specificity of DNA 
deceiAiuu PGR. The chininatKm of uScac process? 
means that* the specificity of this homogeneous assay 
depends solely on that of PCR. In the case of skUe-celi 
dlita&e, we have shown that PGR alone has sufficient DNA 
sequence apedficky to permit genetic screening. Using 
appropriate amplification conditions, there is liufc nor** 
specific production of dsDNA in the absence of the 
appropriate target alkie. 

The specificity required to detect pathogens can be 
more or less than that required' to do genetic screening, 
depending on the number of pathogens in the sample and 
the amount of other DNA that must be taken with the 
sample. A difficult target is HIV, which requires detection 
of a viral genome that can be at the level of a few copies 
per thousands of host cells*. Compared with generic 
screening, which is performed on cells containing at least 
one copy of die target sequence* HIV [detection requires 
both more specificiry and the input of more total 
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Heterozygous 

AS 
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-, ITV photography of PCR tubes containing anmUBcauons 
v&nz EtBr that &t specific to wild-type (A) or sicWte (5> alleles erf 
the human £-globin gene. The left <tf <&ch pair of tubes contains 
aMe-tttedfie primers to the wild-type alleles, the right tube 
primers to the sickle aflete- The photograph was taken after SO 
cycles of PGR, and the input DNAs and the alleles ihcv contain 
are indicated. Fifty tog of DNA was used to begin. PGR. Typing 
was done in triplicate (3 p&ttt of PC&) for each input DNA: 
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20 ng of male DNA 
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RGVKI 5 Continuous real-time monitoring of a PCR. A fiber optic 
was used to carry excitation Jtgnt to a rXJR ha progress and also 
emitted light back to a Btforomctcr (see Experimental PKtocot), 
Amplification UMOg human mado-DNA specific primers in a PCR 
SWrfing with 20 Ag of human nude DNA <top), or in a control 
PGR without DNA. (bottom), were monitored. Thirty cydej of 
PGR w crc foJW^d for each. Tl*e temperature Cycled between 
94*C (deuaturatian) and 50*0 (annealing and extension). Note in 
the male DNA PCR* the cydte (time) <kptrtact)t Riorasc in 
fluorescence at the BOrtcaHttg/exteDMOD temperature. 
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DNA — up to microgram amount»~-in order to have suf- 
ficient numbers of target sequences. This large amount of 
starting DNA m an amplification signi&csmrty increases 
the background fluorescence over which amy additional 
fluorescence produced by PGR must be detected. An 
additional complication that occurs with targets in low 
copy-number is the formation of the **primer-climer n 
artifact. This b the result of the extension of one primer 
using the other primer a* a template, Although this occurs 
infrequently, once it occurs the extension product is a 
substrate for PCR amplification, and can compete with 
true PCR targets if those targets are rare, f ne primer- 
dirner product is of course dsDNA and thus is a potential 
source of false signal in this homogeneous a*$ay. 

To increase PCR specificity and reduce the effect of 
prirner-dimcT amplification, we are investigating a num- 
ber of approaches, including the use of nested-primer 
amplifications dial take place in a single tube 8 , and the 
"hot-start**, in which nonspecific amplification is reduced 
by raising the temperature of the reaction before DNA 
synthesis begins 23 . Preliminary results using these ap- 
proaches suggest tbatpruncr-dimeT is effectively reduced 
and it is possible to detect the increase in Eifir fluores- 
cence in a PCR instigated by a single HIV genome in a 
background of 10 9 ceils. With larger numbers ofccfls, the 
background fluorescence ccrctributed by genomic DNA 
becomes problematic. To. reduce this background, it may 
be possible to use sequence-specific DNA-binding dyes 
that can be made to preferentially bind PCR product over 
genomic DNA by incorporating the dye-binding DNA 
sequence into the PCR product through a 5' "add-on" to . 
the oliTOnvdeotidc primer 2 ' 1 . 

We nave shown that the detection of fluorescence 
generated by an EtBr-containing PCR is straightforward, 
both once PGR is completed and continuously during 
ihermocyding. The ease with which automation of spe- 
cific DNA detection can be accomplished is the most 
promising aspect of this assay. The Huorescence analysis 
of completed PCRs is already Possible with crating instru- 
mentation in 96-well format* . In tlus format, the fluores- 
cence in each PCR can be quantitated before, after, and 
even at selected points during therraocyciio.g by moving 
the rack of PCRs to a 9^-microwcH plate fluorescence 
reader 26 . 

The instrumentation necessary to continuously monitor 
multiple PCRs simultaneously is also simple in principle. 
A diTcct extension of the apparatus used here is to have 
multiple fiberoprics transmit the excitation light and flu- 
orescent emissions to and from multiple PCRs. The ability 
to monitor multiple PCRs continuously may allow quan- 
titation of target DNA copy number. Figure 5 shows that 
the larger the amount of starting target DNA, the sooner 
during Pf.R a fluorescence increase is detected. Prelimi- 
nary experiments <Higiichi and DoUinger, manuscript in 
preparation) with continuous moratoring have shown a 
sensitivity to two-fold differences in initial target DNA 
concentration. 

Conversely* if the number of target molecules is 
known — as it can be in genetic screening— continuous 
monitoring may provide a means of detecting false posi- 
tive and false negative results. With a known number of 
target molecules, a true positive would exhibit detectable 
fluorescence by a predictable number of cycles of PCR. 
increases in fluorescence detected before or after that 
cycle would indicate potential artifacts. False negative 
results due to, for eacample, inhibition of DNA polymer- 
ase, may be detected by including within each PCR an 
inefficiently amplifying marker. This marker results in a 
fluorescence increase only after a large number of cy- 
cles — many more than are necessary co detect a true 
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positive. If a sample fails to have a fluorescence increase 
alter this many cycles, inhibition may be suspected. Since, 
in this assay, conclusions are drawn based on the presence 
or absence of fluorescence signal alone, such controls may 
be important. In any event, before any test based on this 
principle is ready for the clinic, an assessment of ttt false 
positive/false negative rates will need to be obtained using 
a large number of known samples. 

In summary, the inclusion m PCR of dyes whose fluo- 
rescence is enhanced upon binding dsDNA makes it 
possible to detect specific DNA amplification from outside 
the PGR tube. In the future, instruments based upon this 
principle may facilitate the more widespread use of PCR 
in applications that demand the high throughput of 
samples* 

EXPERIMENTAL PROTOCOL 

Raman HLA-DQn gedte ^pKffcadoas containing ft&r. 
PCRs were set up in 100 volumes containing 10 mM Tris^HCh 
pH 8.3; 50 mM RC1; 4 mM MgO z : $5 units of for DNA 
polymerase (Perltm*E)tncr Ccnu, Norwalk* CT); 20 prriole each 
of human HlA-DQa ' gene specific oligonuclcoooe primers 
(*H26 and CH27" and approximately W Copies of DQ* PCR 
product diluted from a previous reaction. Ethidium bromide 
(Et&r; StgtnA) was used at tibe concentrations Indicated in Figure 
2. Thcimocyding proceeded for £0 cycles in a model 480 
thcrmocydcr (Perfcm-Elmer Cctu*, Norwalk, CT) using a "step- 
cyclc" program of 94*C for 1 ram, denaturation and GOT* for w 
sec. annealing and 72°C for 30 sec. extension. 

Y-chEomosomc specific PCR. PCRs (200 fil total reaction 
volume) containing t)JE» itgtaol EtBr were prepared as described 
for HLA-DQfl, except with different primers and target DNAs. 
These PCRs contorted 1 5 pmolc each male DN A**peeific primer* 
Y 1 . 1 and V 1.2*°, and cither 60 rig male, GO ug female, 2 ng male, 
or no human DNA. Tnernttcytling was Wvtor I min- and 60*C 
for 1 tnin using a Vcp-cyde* ptog r a m. The number ofcycfcs for 
a sample were as indicated in Figure 3. Fluorescence measure- 
ment is dascribed below. 

Allek'SpccifiCf human p-gtobin groe PCR* AmpUneaiions of 
100 p4 vojumt vtm% 0.5 jig/tnl of XtBr were prepared as 
described for HLA^DQ<* above except with different primers 2nd 
target DNAs. These PCRs contained euW. primer pair HGP£/ 
H0 1 4A <wfW-type gfobin speculi primers) or HGF2/Hpl4S <«ck- 
lc-giobin specific primers) at 10 pmoJe each primer per PCR, 
Tltese primers were developed by Wu ct aL a \ Three different 
Cacgei DNAa were used in separate amplifxcationsr-50 ng each of 
human DNA that was homozygous for the sickle trait ($S)» DNA 
diat was heteroayrous for the sickle wak (A$X or DNA that was 
homozygous for the w.t- ^Jobin CAA). ThcrmocycBng w^w for ^0 
cycles at 94"C for 1 nun. and 55 D C for 1 min. Using 9 H *tep-cyckf 
program. An anneaHng temperature of 55*0 bad nccn shown vy 
Wu et at 21 to provide allclc^pccinc atnpntotion. Completed 
PCRs vtere photographed through a red filter (Wratiert 23A) 
after placing the reaction tubes atop a model TM-56 transiHunfti- 
nator (UV-orOductS Sah^Gabrkl, CA). 

nnore$<«r»ce measmrexnetzt. FUioreirCeucc measurement were 
cnad^ on PCRs containing EtBr in a Ftuorolog-2 0OoromCter 
(SPEX* Edison, NJ). Excitation was at the 500 nra band with 
About 2 ntn bandwidth with a GO 43S nm cut-off filter jMelles 
Grist Inc.* Irvine. CA) to exclude sccond*order light. Emitted 
light was detected at 5?0 nm wiui a bandwidth of about 7 nm. An 
OG 530 »m cut-off filter was used to remove ihcCTcrtaoon hgjlt 
CondmtooA ftooretcence luv ui to Ui g of PGR, Condniwus 
monitoring of a PCR in progress was accomplished using 01C 
spccu-ofluorometer And •etdnga descrrbed Above as well as a 
ffccTupbc accessory (SPEX cat no. 1950) 10 both send excitation 
fight to. and receive emitted light from, a PCR placed in a well of 
a model 480 me^mpcycler (Fterkm-Eliaer Cetus). The probe end 
- - - ' - * — % fa 



wiui its cap removeo) etteaweiy scaling il inc cxposea top vj 
i,he PCR tube and the end of the fiberoptic cable were shielded 
from room light and the room light* were kept dimmed during 
each run. The monitored pCft was an amplication of Y-cfcro- 
mosdme-spedfk repeat sequences as described above, except 
usingan anncahng/extension temperature of 50°C. The reaCOOti 
was covered yt'tih xnirteral oil (2 drops) to prevent evaporation- 
Tfacrmox^cling and fiuorcsccncc measurement were started si- 
multaneously, A time-base scan with a 10 second kittgratioi) tone 
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v a% unsd and the emta&>n signal was radbed to the excitation 

SnJ to control for change* in light-iourcc intensity. OaC* were 
xied using the dra3090f, version (SPEX) data system. 

We <tani Bob Jones for help with r,he spectrofluormetric 
inftfuurcincnts And Hcalherbdl Fonyfor editing this manuscript 
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IMMUNO BIOLOGICAL LABORATORIES 



sCD-14 EUSA 

Trauma, Shock and Sepsis 




The CD-14 molecule is' expressed on the surface of 
monocytes and some macrophagea. Membrane- 
bound CD-14 is a receptor for I Ipopoty saccharide 
(LPS) complexed to LPS-8inding-Protein (LBP). The 
concentration of its soluble form is altered under 
certain pathological conditions. There, is evidence for 
an Important role of $CD-14.with polytrauma, sepsis, 
burnings and inflammations. 
During septic conditions and acute infections il seems 
to be a prognostic mariner -and is therefore of vafue in 
monitoring these patients. 



1BL offers an ELISA for quantitative determination of 
soluble CD-14 in human serum, -plasma, cell-culture 
supernatants and other biological fluids. 
Assay features: .12x8 determinations 
(microliter strips), 
precoaied wfth a specific 
monoclonal antibody, 
2x1 hour incubation, 
standard range: 3 - 96 ng/ml 
detection limit: 1 ng/ml 
CV: intra- and interassay < $% 

For more information caU or fax 
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SIMULTANEOUS AMPLIFICATION AND DETECTION C 
SPECIFIC DNA SEQUENCES 

Russell Higucbi*, Gavm Bollinger 1 , P. Sean Walsh and Robert Griffith 

Roche Molecular Systems. Inc. 1400 53rd St., Emeryvufe, CA 9400$. »C3»ron Corporation, 1400 53rd Su e^eryvulc, CA 
$4606. + <CoTresponding author. 



We have enhanced the polymerase chain 
reaction (PGR) such that specific DNA 
sequences can be detected without open- 
ing the reaction tube. This enhancement 
requires the addition of ethidimn bromide 
(EtBr) to a PGR. Since the fluorescence of 
EtBr increases in the presence of double* 
stranded (ds) DNA an increase in fluores- 
cence in such a PGR indicates a positive 
amplification, which can be easily moni- 
tored externally* In fact, amplification can 
be continuously monitored in order to 
follow its progress. The ability to simulta- 
neously amplify specific DNA sequences 
and detect the product of the amplification 
both simplifies and improves PGR and 
may facilitate its automation and more 
widespread use in the clinic or in other 
situations requiriag high sample through- 
put 

Although the potential tencfit* of PCR 1 to clin- 
ical diagnostics arc wcU known, 2,9 , it is still not 
widely used in this setting, even though it is 
frmt- year* biuco thermostable I>NA potymi^ 
ase* 4 made PGR practical Some of the reasons for it* slow, 
acceptance are high cost, tact of automation of pre^ and 
post-PCR processing steps, and false positive results, froni 
carryovcT<onumination. The first two points arc related 
in that labor is the largest contributor to cost flit the present 
stage of PGR development. Most current assays require 
some form of "downstream" processing once thei-mocy* 
cHng is done in order to determine whether tlte target 
DNA sequence was present and has amplified. These 
include DNA hybridiwoon 5 *. gel efectrophom is w** 1 or 
without use of restriction digestion*** HFLCr, or capillary 
electrophoresis 10 . These method* arc labor-intense, have 
low throughput, and arc diB&cuh to automate. The third 
point is abo closely related to downstream processing. 
The handling of the PGR product in these downstream 
processes increases the chances that aniplified DNA will 
spread through the typing lab, resulting in a risk of 



"carryover" false positives in subsequent testing". 

These downstream processing steps would be elinti- 
nated if srjectfic amplification and detection of amplified 
DNA took place simultaneously within an unopened re- 
action vessel Assays m which such different processes take 
place without, the need to separate reaction components 
have been termed ^mogeneous*'. No truly homoge- 
neous PGR assay has been demonstrated, to date, although 
progress towards this end has been reported. Chehab, et 
aL 1 * developed a PGR product detection scheme using 
fluorescent primers that resulted in a fluorescent PGR 
product AUe*c-specific primers, each with different fluo- 
rescent tags, were used to indicate the genotype of the 
DNA. However, the unincorporated primers must still be 
removed in a downstream process in order to visualize the 
result Recently, Holland, et at. 13 , developed an assays in 
which the endogenous 5 r oxdnudease assay of Ttq DNA 
polymerase was exploited tt> cleave a labeled oligonucleo- 
tide probe. The probe would only cleave if PGR amplifi- 
cation had produced its complementary sequence. In 
order to detect die ckavage products, however, a subse- 
quent process w again needed. 

Wc have developed a truly homogeneous assay For PGR 
and PGR product detection based upon the greatly in- 
creased fluorescence that ethidium bromide and other 
DNA binding dyes exhibit when they are bound xojis- 
DNA t4 ~ 16 . As outlined in Figure J, a prototype PCR 
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fKOBE 1 Principle of simultaneous amplification and detection of 
PCR product The components of a PCR containing EtBr dial aro 
fluoreseeiw are listed— EtBr itself, Et»r bound to other ssDN-A or" 
dsDN A, There is a large florescence enhancanent when EtBr is 
bound to PNA and binding ia greatly enhanced when DNA is 
douhlc-strandcd, Ater sumcient (n). cydcs of PGR* the net 
increase in d&DNA rciidts in additionai £t&r b&ding, and * net 
increase in total Ruom.sccncE_ 
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rWW 1 Gel cloctrophoresb of PCR ana plificauon products of (he 
human, mtdcar gene, HLA DQa, mad* in the presence of 
increasing amounts of EtBr (up to 8 The presence of 

EtBr has no obvious effect on the yield or specificity of amplifi- 
cation. 





AGGIE & (A) ftttotCSCCnce measurement* from PCRs that contain 
0.5 ng/m] ElBr and that are specific for Y<Jjrotno$ojjac repeat 
»equerice*. Five replicate PCRs were begun containing each of the 
DNA* specified. At each indicated cycle, one of the five replicate 
PCfcs for each DNA -was removed from merxnocyebng and tt$ 
fluorescence measured. Unit* of fluorescence are arbitrary. (B) 
UV photography of PGR tubes (0.5 ml Eppcndorf«tylcv pcHyprc- 
pykne mu^^nirifuec tubes) contenting reactions, those .scatty 
ing from 2 ng male DNA and control reactions Without any DMA, 
from (A), 
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begins with primers that are single-stranded DNA (as- 
DNA), dNTPs, and DNA polymerases An amount cf 
dsDNA containing the target sequence (target DNA) is 
also typically present. This amount can vary, depending 
on the application, from stogie-cell amounts of DNA iT to 
micrograms per PCR^ e , If EtBr is present, the reagents 
that will fluoresce, in order of increasing fluorescence, are 
free EtBr itself, and EtBr bound to the single-stranded 
DNA primers and to the double-stranded target DNA (by 
its intercalation between the stacked bases of the DNA 
doublc-hclut)* After the first denatu ration cyde, target 
DNA will be largely single-stranded. After a PCR is 
completed, the most significant change is the increase in 
the amount of dsDNA (the PGR product itself) of up to 
several mkrograrns. Formerly free EtBr is bound to the 
additional dsDNA, resulting in an increase in fluores- 
cence. There is also some decrease in the amount of 
ssDNA primer, but because the binding of EtBr to ssDN A 
is much less than to d&DNA, the effect of th» change on 
the total fluorescence of the sample is small. The fluores- 
cence increase can be measured by directing excitation 
illumination through the walls of the amplification vessel 
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before and after, or even comjrjuously during, thermocv- 
cling. ' 

RESULTS 

PCR in the presence of EtBr. U order to assess the 
affect of EtBr lo FOR, amplifications of (he human HI A 
DQct gene 19 were performed with the dye present at 
concentrations from 0,06 to 8.0 u-gfail (a typical concen- 
tration of EtBr used in staining of nucleic aads following 
gel electrophoresis is 0.5 u.g/mO» As shown in Figure 2, gel 
electrophoresis revealed little or no difference in the yield 
or quality of the amplification product whether EtBr was 
absent or present at any of these concentrations, indicat- 
ing that EtBr does not inhibit PCR, 

Detection, of human Y-c&tomosonMs specific 
oircnces* Secnierice-ipedfic, fluorescence echanoemexit of 
EtBr as a result of FOR was demonstrated in a scries of 
amplifications containing 0.5 u.g/ml EtBr and primers 
specific to repeat DNA sequences found on the human 
V -chromosome 20 . These PCRs initially contained either 
60 ng male* 60 ng female, 2 ng male human or no DNA. 
Five replicate PCRs were begun for each DNA* After 9, 
17, 21 , 24 and 29 cycles of therittocycling, a PCR for each 
DNA was removed from the thermocyaer t and its fluo- 
rescence measured in a spectrofmoromeier and plotted 
vs. amplification cycle number (Fig. 3 A). The shape of this 
curve rcftccts the fact that by the time an increase in 
fluorescence tan be detected, the increase in DNA is 
becoming linear and not exponential with cycle number; 
As shown, the fluorescence increased about three-fold 
over the background fluorescence for the PCRs contain- 
ing human male DNA, but did not significantly increase 
for negative control PCRs, which contained either no 
DNA or human female DNA. The more male DNA 
present to begin with— 60 ng versus 2 ng— the fewer 
cycles were receded to give a detectable increase in fluo- 
rescence. Gel cle<tmpnoresi$ on the products of these 
amplifications showed that DNA fragments of the ex- 
pected size were made in the male DNA containing 
reactions and that Utile DN A synthesis took place in the 
control samples. 

In addition, the increase b fluorescence was visualized 
by simply laying the completed, unopened PCRs on a UV 
transilhirninatOT and photographing them through a red 
Alter. This is shown in figure 5B lor the reactions that 
began with 2 ng male DNA and those with no DNA. 

Defection of specific allele* of the human fi-globiu 
gene. In order to demonstrate that this approach has 
adequate specificity to allow genetic screenings a detection 
of the skkfc-cdl anemia mutation was performed. Figure 
4 shows the fluorescence from completed amputations 

containing £tBr (O.S ng/ml) a« detected by photography 

of the reaction tubes on a UV transilluminator. These 
reactions were performed using primers specific for ci- 
ther the. wild-type or sickle-cell mutation of the .human 
P-gtobin gene*\ The specifkity for each allele i$ imparted 
by placing the sickle-mutation site at the terminal V 
nucleotide of one primer. By using an appropriate primer 
annealing temperature, primer extension — and thus an> 
pli5c3t)on--can take place only if the %' nucleotide of the 
primer is coraplerncntary to the p-globin allele present*' ^ 
Each jpair of ampliations shown in Figure 4 consists of 
a reaction with either the witd-typc allele specific (left 
tube) or skklc-allele specific (right tube) primers. Three 
different DN As were typed: DNA from a homozygous, 
wild-type f^-globin individual (AA); from a heterozygous 
sickle p^glpbin individual (AS); and from a homozygous 
sickle f^Qhm individual (SS). Each DNA (50 ng genomic 
DNA to start each PGR) vtas analyzed in tripKcate (3 pairs 
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c f reactions each). The DNA .type vas reflected in the 4 
rc ]atrve fluorescence intensities in each pair of competed 
A(n p]t£catk>ittt There was a significant increase in fluores* 
(ctioc only where a p~globin allele DNA matched the 
printer set. When measured on a spcctroSnoromctcr 
[data not shown)> this fluorescence was about three times 
j^t present in a FOR where both p-globhi alleles were 
jnbinatched to the pHmet set. Gel eicarophare*)* (noi 
phown) established that this increase iri fluorescence was 
due t,o the synthesis of nearly a microgram of a DNA 
fragment of the expected size for 0-globin. There was 
little synthesis of ds£)NA in reactions in which the allele- 
^pecinc primer was mismatched to both alleles. 

Coutimiwifl numitoriDg of a PGR. Using a fiber optk 
device? it U possible to direct excitation illumination from 
,1 spectrofl uoromete r to a PGR undergoing: thcrmocycling 
and to return its fluorescence to the Rpectroftuowmeter. 
The fluorescence readout of such an arrangement, di- 
ttoed at an Etfir-comaining amplification of Y*chrorjnc 
some specific sequences from 25 njj of human male DNA* 
is shown in Figure 5. The readout from a control fCR 
with no target DNA is also shown. Thirty cycles of PCR 
were monitored for each- 

The fluorescence trace as a function of time dearly 
shows the effect of the thermocyding. Fluorescence inten- 
sity rises and. fails inversely with temperature The fluo- 
rescence intensity is niinimum at the denaturalion tem- 
pera tore (94°C) and maadmum at the anneaUn^extension 
temperature (50°C>- In the negative-control PCR, these 
fluorescence maxima and minima do not change signifi- 
cantly over the thirty thennocycte, indicating that there is 
little dsDNA synthesis without the appropriate target 
DNA, and there is litde if any bleaching of EtBr during 
the continuous illumination of the sample. 

In the PCR containing male DNA, the fluorescence 
maxima at the annealing/extension temperature begin to 
increase at about 4000 seconds of thcrroocyding, and 
continue to increase with time, indicating that dsDNA is 
being produced at a detectable leveL Note that the fluo- 
rescence minima at the denaturation. temperature do not 
flifmi&candy increase, presumably because al thb temper- 
ature there is no dsDNA for EtBr to bind. Thus the course 
of the amplification is followed by tracking the ftuorcs-. 
cenee increase at the aaneaKrig temperature. Analysis of 
the products of these two amplifications by gel ejetftropbo- 
rcpis showed a DNA fragment of the e xp ec t ed sixe for the 
male DNA containing sample and no detectable DNA 
jymhesis for the control sample* 

DISCUSSION 

Downstream processes such as hybridization to a se- 
quence-spedfic probe can enhance die specificity of DNA 
dectiAivu b> PGR. "The cbmu»t»on of (li^ic processes 
means that' the specificity of this homogeneous assay 
depends solely on ihat of tCfL In the case of skkle-celi 
disease, wc have shown that PGR alone has sufficient DNA 
sequence HpedfidLy to permit genetic screening* Using 
appropriate amplification conditions, there is little non* 
specific producdon of dsDNA in the absence of die 
appropriate target allele. 

The spedndty required to detwet pathogens can be 
more or less than that required' to do genetic screening, 
depending on the number of pathogens in the sample and 
the amount of other DNA chat must be taken with the 
sample. A difficult target is HIV, which TcquiTcs detection 
of a viral genome that can be at the level of a few copies 
per thousands of host cells*. Compared with genetic 
screening, which is performed on ceils containing at least 
one copy of die target sequence, UIV :detection requires 
both more spedndty and the input of more total 
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UV photography of PCR tubes containing Janpt/ficatroiu 
using EtBr that are specific : to wikl-type (A) or sic&te (5) alleles of 
the human 0-globin gene. The left of each pair of tubes contains 
aHele-spedfic primer* to the wild-type afldes, the rigtu tube 
primers to the sfcWe atWe. The phmograph was tafcen after 30 
cycles of PCR, and the input DNAs and the alleles they contain 
arc indicated. Fifty tog of DNA was used to begin PGR, Typmg 
was done in triplicate (3 pair* of PCRs) for each input DNA 
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nGVME 5 Continuous, real-time monitor mg of a PCR. A fiber optk 
was oscd to carry excitation light to a PCJR m progress and also 
emitted light back to a fluorometcr (sec Exoenmcntal Protocol). 
Amplification using human malcDNA specific primers in a PCR 
Starting with 20 ng of human male DNA (top), or in * control 
PCR without DNA (bottom), were, monitored. Thirty cyd« of 
PCR were followed for each. The' temperature cycled between 
94*C (denaturatiori) and 50*C (annealing and extension), Note in 
the male DNA PCR,. the cyde (time) dejrtflrfcot increase in 
fluorescence at the aoneaEn^extension temperature. 
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DNA— lip to microgram amounts— -in order to have suf- 
ficient numbers of target sequences. This large amount of 
starting DNA m an amplification simitittutty increases 
the background fluorescence over which any additional 
fluorescence produced by PGR mu$t be detected. An 
additional complication that occurs with targets in low 
copy-number h the formation of the **primer-dimer" 
artifact. This is the result of the extension of one primer 
using the other primer <t* a template, Although this occurs 
infrequently, once it occurs the extension product is a 
substrate for PGR amplification* and can compete wtth 
true PGR targets if those targets are rare, The primer- 
dimer product i$ of course dsDNA and thus is a potential 
source of false signal in this homogeneous a*$ay\ 

To increase FOR specificity and reduce the effect of 
primer-dimcr amplification, we are investigating a num- 
ber of approaches, including the use of nested-primer 
amplifications that take place in a singfc tube 8 , and the 
4 'hot-start*\ in which nonspecific amplification is reduced 
by raising the temperature of the reaction before DNA 
synthesis begins 25 . Preliminary results usbg these ap- 
proaches suggest tbatprbncr-dimcT is effectively reduced 
and it is possible to detect the increase in EtBr fluores- 
cence in a PGR instigated by a single HIV genome in a 
background of 10* celts. With larger numbers of octts, the 
background fluorescence contributed by genomic DNA 
becomes problematic. To reduce this background, it may 
be possible to use sequence -specific DNA-binding dyes 
that can be made to jpicferenoally bind PGR product over 
genomic DNA by incorporating the dye-binding DNA 
sequence into the FCR product through a 5' *add-on n to . 
the oligonucleotide primer 44 . 

We nave shown that the detection of fluorescence 
generated by an EtBr-containwg JPCR is straightforward, 
both once PGR is completed and continuously during 
ihermocycfing. The ease with which automation of spe- 
cific DNA detection can be accomplished is the most 
promising aspect of this assay. The fluorescence analysis 
of completed PGR* is aheadyjpossiblc with existing instru- 
mentation in 96-weJl format? . In tills format, the fluores- 
cence in each PGR can be cj[uantitated before, after, and 
even at selected points during therraocyc«n.g by moving 
the rack of PCRs to a 96-rnicroivcll plate fluorescence 
reader 20 . 

The instrumentation necessary to continuously monitor 
multiple PGRs simultaneously is also simple in principle. 
A direct extension of the apparatus used here is to have 
multiple fiberoptics transmit the excitation light and flu- 
orescent emissions to and from multiple PCRs. The ability 
to monitor multiple PCRs continuously may allow quan- 
titation of target DNA copy number- Figure 5 shows that 
the larger the amount of starting target DNA, the sooner 
rlti ring Pf.R a fluorescence increase is detected. Prelimi- 
nary experiments {HigucM and Dollinger, manuscript in 
preparation) with continuous monitoring have shown a 
sensitivity to two-fold differences in initial target DNA 
concentration. 

Conversely, if the number of target molecules is 
known — as it can be in genetic screening— continuous 
monitoring may provide a means of detecting false posi- 
tive and false negative result*. With a knovn number of 
target molecules, a true positive would exhibit detectable 
fluorescence by a predictable number of cycles of PGR. 
Increases in fluorescence detected before or after that 
cycle would indicate potential artifacts* False negative 
results due to, for example, , inhibition of DNA polymer- 
ase, may be detected by including within each PGR an 
inefficiently amplifying marker. This marker results in a 
fluorescence increase only after a large number of Cy- 
cles— many more ' than arc necessary to detect a true 
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positive. If a sample fails to have a fluorescence increase 
after this many- cycles, inhibition may be suspected. Since, 
in this assay, conclusions are drawn based on the presence 
or absence of fluorescence signal alone, such controls may 
be important. In any event before any test based on this 
principle is ready for the clinic, an assessment of its false 
positiWfalse negadve rates will need to be obtained using 
a large number of known samples. 

In summary, the inclusion in PGR of dyes whose fluo- 
rescence is enhanced upon binding dsDNA makes it 
possible to detect speciftc DNA amplification from outside 
the FCR tube. In the future, instruments based upon this 
principle may facilitate the more widespread use of PCR 
in applications that demand the High throughput of 
sampler 

EXPERIMENTAL PROTOCOL 
Human HLA-DQ** ^pHflouiotks containing EtBr. 

PGRs were set up in 100 t*l volumes containing 10 mM Tris-HCK 
pH 8.3; 50 mM KC1; 4 mM MgC^: « units of too DNA 
polymerase (P^rkii^EImcr Genu. Norwalk* CT>; 20 pinole each 
of human HtA-DQa ' gene specific oligonucleotide primers 
and CH27 19 and approjoroawl; 10* copies of rjQfc PCK 
product diluted from a previous reaction. Ethidium bromide 
(Ei&r; SigtnA) was used at the concentrations indicated in Figaro 
2. Thennocyding proceeded for 20 cycles in a model 4*0 
uiermocydcr <Perkm«EJmcr Cccua, Norwalfe, CT) using a "stcp- 
cycJc" progrun of 94*Q for 2 ram, denature Lion and wTr for 30 
sec annealing and 72°C for 30 sec. extension* 

V-chrontowmc specific PCR. PCRs (100 ul total reaction 
volume) containing 03 >j£/«ol EtBr were prepared as described 
for HLA»DQct, except widn different primers and target DNAs. 
These PCRs contained ) $ pmolc each male DNA-spcctfic primes 
YI.l and Vl.2 M , and cither 60 ng roale, 60 tig female, 5> ng male, 
or no human DNA. Therniocycling *as W*C Tor I min- and 60?C 
for I min using a "stcp-Cyde* program. The number of cyctes for 
a sample were as indicated in Figure 3. Fluorescence measure- 
ment is described below. 

Allek^pccific, human PCR* AmpUncauons of 

100 vcriumc vsmp; 0 5 >tg/ml of ZtBr were prepared as 
described for HLA-OQa above except with different primers and 
target DNAs. These PCRs contained eiiW. primer pair HGP$/ 
H0HA <wiW-type globin spedae primer*) or HGF2/Hfn4$ (sick- 
lc-gtobin specific primers) at 10 pmole $ach piimcr per FCR. 
These prime rs were developed by Wu ct aL 21 . Three different 
(atgei DNA* werc ti*cd in separate amplificatkrasr— 50 ng eacii of 
human DNA that was homozygous for the sfcfetc trait (5S)» DNA 
that was heterorygous for the stcMe Irak (AS), or DNA that was 
homozygous For the w.t. globin (AA). ThcTmocycnng ww for 20 
cycles at 94T for 1 nun. and 55*C for 1 min. it«n| a * , .-rtcp-cycte n 
program. An annealing temperature of 55^ bad ncen shown by 
Wu ct al, 21 to provide aUcJc-spctinc atflpIiKeation- Completed 
PCRs were phcSngraphcd through a red filter (Wratien 23A) 
after placing the reaction tubes atop a model transSHuWu- 
nator (tiV-productS Sani^bricl, CA). 

FhiOTesecnc* measurement. Ftaortt^KX measurements were 
road* oh PCRs containing EtBr in a Fluorolog-2 fltioromCtcr 
(SFEX* Edison, NJ). Excitation was at the 500 nra band with 
abour 2 ntn bandwidth with a GG 435 nm cut-off ftlnerjMeHes 
Grist Inc., Irvine. CA) to exclude second-order light. Emitted 
light was detected a< 5 70 nm with a bandwidth of about 7 nm* An 
OG 530 nm cut-off filter was used to remove the excitation hght 

Contitatotift Rnoreflccnce inomMtng of FCK, Co»dnuouS 
monitoring of a fCR in progress was accomplished usin^ d3C 
spectronuorometer and setnngH dcscribod above as weU as a 
flWjpac accessory (SPJEX caL no. 1950) to both send excrtauoa 
Rght to, and receive emitted light from, a PCR pitaccd in a well of 
a model 480 thcrnvocycicr (Pcrkm-Eliner Cetus). The probe end 
of the fiberoptic cable was attached with "5 mtoutc-cpoxy"' to tbte 
open top of a PCR tube (a O.o ml poiypronyienc centrifuge tube 
with its cap removed) effectively dealing it. The cxposedTloP <J 
the PGR tuoe and the end of the fiberoptic caWc were shielded 
from room light and the room light* were kept dimmed during 
each ruin. The moniiorcd PCR was an amplification of Y-cbro- 
mosQme-spcclftc repeat sequences as described above, except 
using . an anncating/extension cemperauirc of 50°C. The reaction 
was covered with xnirvend oil (2 drops) to prweitt evaporanon- 
ThemocycHng and* fluorescence rocasurcmcnt were started st- 
multancously, A time-base <»t\ with a 10 second integra£iOi>' t»ne 
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w as urm) and the emission signal was ratioed to the excitation 
jiigrtftl to control for. ch?w#» in jkht-*ourcc intcnirity. ttoc* were 
fleeted using the dntfOOOf, version 15 (SPEX) data system. 
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IMMUNO BIOLOGICAL LABORATORIES 



sCD-14 EUSA 

Trauma, Shock and Sepsis 




The CD-14 molecule is' expressed on the surface of 
moncx;ytes and some macrophages. Membrane- 
bound CD-14 is a receptor for I ipapoly saccharide 
(LPS) completed to LPS-Binding-Pratein (IBP). The 
concentration of te soluble form is aftered under 
certain pathological conditions. There- is evidence for 
an important role of $CD-14.with polytrauma, sepsis, 
burnings and inflammations. 
During septic conditions and acute infections it seems 
to be a prognostic marker -and is therefore of value in 
monitoring these patients. 



IBL offers an EUSA for quantitative determination of 

soluble CD-14 in human serum, -plasma, cell-culture 

supematants and other biological fluids. 

Assay features: 12x8 determinations 
(microliter strips), 
precoated with a specific 
monoclonal antibody, 
2x1 hour incubation, 
standard range; 3-96 ng/mi 
detection limit 1 ng/mi 
CV: intra- and tnterassay < 8% 



For more information cad or fax 
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Oligonucleotides with Fluorescent Dyes at 
Opposite Ends Provide a Quenched Probe 
System Useful for Detecting PCR Product 
and Nucleic Acid Hybridization 

Kenneth J. Livak, Susan J.A. Flood, Jeffrey Marmaro, William Giusti, and Karin Deetz 

Perkin-EIrner, Applied Biosysteras Division, Foster City, California 94404 



The 5' nuclease PCR assay detects the 
accumulation of specific PCR product 
by hybridization and cleavage of a 
double-labeled fluorogenk probe 
during the amplification reaction. 
The probe Is an oligonucleotide with 
both a reporter fluorescent dye and a 
quencher dye attached. An Increase 
In reporter fluorescence Intensity In- 
dicates that the probe has hybridized 
to the target PCR product and has 
been cleaved by the 5' -+B' nucle- 
olytic activity of Taq DNA polymerase. 
In this study/ probes with the 
quencher dye attached to an internal 
nucleotide were compared with 
probes with the quencher dye at- 
tached to the 3 -end nucleotide. In all 
cases, the reporter dye was attached 
to the 5' end. All intact probes 
showed quenching of the reporter 
fluorescence. In general, probes with 
the quencher dye attached to the 3'- 
end nucleotide exhibited a larger sig- 
nal In the 5' nuclease PCR assay than 
the Internally labeled probes. It Is 
proposed that the larger signal Is 
caused by increased likelihood of 
cleavage by Taq DNA polymerase 
when the probe Is hybridized to a 
template strand during PCR. Probes 
with the quencher dye attached to 
the 3 '-end nucleotide also exhibited 
an increase In reporter fluorescence 
Intensity when hybridized to a com- 
plementary strand. Thus, oligonucle- 
otides with reporter and quencher 
dyes attached at opposite ends can 
be used as homogeneous hybridiza- 
tion probes. 



homogeneous assay for detecting 
the accumulation of specific PCR prod- 
uct that uses a double-labeled fluoro- 
genic probe was described by Lee et al. (1) 
The assay exploits the 5'^> 3' nucle- 
olytic activity of Taq DNA poly- 
merase^ and Is diagramed in Figure 1. 
The fluorogenic probe consists of an oli- 
gonucleotide with a reporter fluorescent 
dye, such as a fluorescein, attached to 
the 5' end; and a quencher dye, such as a 
rhodamine, attached internally. When 
the fluorescein is excited by irradiation, 
its fluorescent emission will be 
quenched If the rhodamine is close 
enough to be excited through the pro- 
cess of fluorescence energy transfer 
(FET). (4 - 5) During PCR, if the probe is hy- 
bridized to a template strand, Taq DNA 
polymerase will cleave the probe be- 
cause of its inherent 5' 3' nucleolytic 
activity. If the cleavage occurs between 
the fluorescein and rhodamine dyes, it 
causes an increase in fluorescein fluores- 
cence intensity because the fluorescein 
is no longer quenched. The increase in 
fluorescein fluorescence intensity indi- 
cates that the probe-specific PCR product 
has been generated. Thus, FET between a 
reporter dye and a quencher dye is criti- 
cal to the performance of the probe in 
the 5' nuclease PCR assay. 

Quenching is completely dependent 
on the physical proximity of the two 
dyes. (6) Because of this, it has been as- 
sumed that the quencher dye must be 
attached near the 5' end. Surprisingly, 
we have found that attaching a rho- 
damine dye at the 3' end of a probe 
still provides adequate quenching for 
the probe to perform in the 5' nuclease 



PCR assay. Furthermore, cleavage of this 
type of probe is not required to achieve 
some reduction in quenching. Oligonu- 
cleotides with a reporter dye on the 5' 
end and a quencher dye on the 3' end 
exhibit a much higher reporter fluores- 
cence when double-stranded as com- 
pared with single-stranded. This should 
make it possible to use this type of dou- 
ble-labeled probe for homogeneous de- 
tection of nucleic acid hybridization. 

MATERIALS AND METHODS 
Oligonucleotides 

Table 1 shows the nucleotide sequence 
of the oligonucleotides used in this 
study. Linker arm nucleotide (LAN) 
phosphoramidite was obtained from 
Glen Research. The standard DNA phos- 
phoramidites, 6-carboxyfluorescein (6- 
FAM) phosphoramidite, 6-carboxytet- 
ramethylrhodamine succinimidyl ester 
(TAMRA NHS ester), and Phosphalink 
for attaching a 3'-blocking phosphate, 
were obtained from Perkin-Elmer, Ap- 
plied Biosystems Division. Oligonucle- 
otide synthesis was performed using an 
ABI model 394 DNA synthesizer (Applied 
Biosystems). Primer and complement 
oligonucleotides were purified using 
Oligo Purification Cartridges (Applied 
Biosystems). Double-labeled probes were 
synthesized with 6-FAM-labeled phos- 
phoramidite at the S' end, LAN replacing 
one of the T's in the sequence, and Phos- 
phalink at the 3' end. Following de- 
protection and ethanol precipitation, 
TAMRA NHS ester was coupled to the 
UN-containing oligonucleotide in 250 



«:357-3€2«i1995 by Cold Spring Harbor Laboratory Press ISSN 1054-9803/95 S5.00 PCR Methods and Applications SS7 

PACE 2/7 » RCVD AT 10/24/2005 5:55:20 PM [Pacific Daylight Time) ■ SVR:SVCS01/0 * DNIS:6034 * CSID:(613) 991-5695 * DURATION (mm-ss):0546 



From (613) 991-5695 



Researchlllim 




Polymerization 



Forward 
Primer 



Probe 



Ipar 



Strand displacement 



5'. 

3'- 



Reverse 
Primer 



Cleavage 



5'« 

3'- 



Ip3< 



Polymerization completed 



0 




— 3' 
■5* 



FIGURE 1 Diagram of 5' nuclease assay. Stepwise represenution of the ^^^vn 
tiviry of Taq DNA polymerase acting on a fluorogenic probe during one extension phase of PGR. 



itim Na-bicarbonate buffer (pH 9.0) at 
room temperature. Unreached dye was 
removed by passage over a PD-10 Sepha- 
dex column. Finally, the double-labeled 
probe was purified by preparative high- 
performance liquid chromatography 
(HPLC) using an Aquapore C 8 220x4.6- 
mm column with 7-jim particle size. The 
column was developed with a 24-min 
linear gradient of 8-20% acetonitrile in 
0.1 m TEAA (triethylamine acetate). 
Probes are named, by designating the se- 
quence from Table 1 and the position of 
the LAN-TAMRA moiety. For example, 
probe Al-7 has sequence Al with LAN- 
TAMRA at nucleotide position 7 from the 
5' end. 



PCR Systems 

All PCR amplifications were performed 
in the Perkin-Elmer GeneAmp PCR Sys- 
tem 9600 using 50-m.I reactions that con- 
tained 10 ibm Tris-HCl (pH 8.3), SO niM 
KC1, 200 pM dATP, 200 dCTP, 200 p.M 
dGTP, 400 \iM dUTP, 0.5 unit of AmpEr- 
ase uracil tf-glycosylase (Perkin-Elmer), 
and 1.25 unit of AmpliTaq DNA poly- 
merase (Perkin-Elmer). A 295-bp seg- 
ment from exon 3 of the human p-actin 



gene (nucleotides 2141-2435 in the se- 
quence of Naka)lma-H|ima et al.) (7) was 
amplified using primers AFP and ARP 
(Table 1), which are modified slightly 
from those of du Breuil et al. (B) Actin am- 
plification reactions contained 4 mM 
MgCl 2 , 20 ng of human genomic DNA, 
50 ny Al or A3 probe, and 300 hm each 



primer. The thermal regimen was 50°C 
(2 min), 95°C (10 min), 40 cycles of 95°C 
(20 sec), 60°C (1 min), and hold at 72°C. 
A 515-bp segment was amplified from a 
plasmid that consists of a segment of X 
DNA (nucleotides 32,220-32,747) in- 
serted in the Smal site of vector pUC119. 
These reactions contained 3.5 mM 
MgCl* 1 ng of plasmid DNA, 50 nw P2 or 
P5 probe, 200 nM primer F119, and 200 
nM primer R119. The thermal regimen 
was 50°C (2 min), 95°C (10 min), 25 cy- 
cles of 95°C (20 sec), 57°C (I min), and 
holdat72°C 



Fluorescence Detection 

For each amplification reaction, a 40-u,l 
aliquot of a sample was transferred to an 
individual well of a white, 96-well mlcro- 
titer plate (Perkin-Elmer). Fluorescence 
was measured on the Perkin-Elmer Taq- 
Man LS-50B System, which consists of a 
luminescence spectrometer with plate 
reader assembly, a 485-nm excitation fil- 
ter, and a SIS-nm emission filter. Excita- 
tion was at 488 nm using a 5-nm slit 
width. Emission was measured at 518 
nm for 6-FAM (the reporter or R value) 
and 582 nm for TAMRA (the quencher or 
Q. value) using a 10-nm slit width. To 
determine the increase in reporter emis- 
sion that is caused by cleavage of the 
probe during PCR, three normalizations 
are applied to the raw emission data. 
First, emission intensity of a buffer blank 
is subtracted for each wavelength. Sec- 
ond, emission intensity of the reporter is 



TABLE 1 Sequences of Oligonucleotides 



Name 


Type 


F119 


primer 


R119 


primer 


P2 


probe 


P2C 


complement 


PS 


probe 


P5C 


complement 


AFP 


primer 


ARP 


primer 


Al 


probe 


A1C 


complement 


A3 


probe 


A3C 


complement 



Sequence 



ACCXZACAGGAACTGATCACCACTC 

ATGTCGCGTTCCGGCTGACGTTGTGC 

TCGCATTACrGATCGTrGCCAACCAGTp 

GTACTGGTTGGCAACGATCAGTAATGCGATG 

CGGA'ITTGCTGGTATCTATGACAAGGATjp 

TTCATCCTTGTCATAGATACCAGCAAATCCG 

TCACCCACACTGTG CCCATCTACGA 
CAGCGGAACCGCfCATTGCCAATGG 
ATGCCCTCCCCCATGCCATCCTGCGlp 
AGACGCAGGATGGCATGGGGGAGGGCATAC 

CGCCCTGGACTTCGAGCAAGAGATp 
CCATCTCTTGCTCGAAGTCCAGGGCGAC 



For each oligonucleotide used in this study, the nucleic add sequence is given J™ " * 
y -3' direction. There are three types of oligonucleotides: PCR pnmer, fluorogenic 
t T^s-nudMie assay and complement used to hybridize to the corresponding probe. For the 
"J£Z^S&ZZ^ a position where LAN with TAMRA attached was subs* 
tuted for a T. (p) The presence of a 3' phosphate on each probe. 
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A1-2 lUQGCCCTCCCCCATGCCATCCPGCGTp 

A1-7 RATGCCCQCCCCCATGCCATCCTGCGTp 

A1 -1 4 RATGCCCTCCCCCAQGCCATCCTGCGTp 

A1 -1 9 RatgccctcccccatgccaQcctgcgtp 

A1-22 RATGCCCTCCCCCATGCCATCCQGCGTp 

A1-26 RATGCCCTCCCCCATGCCATCCTGCGQp 



Probe 


510 


nm 


582 nm 


RQ- 


RQ+ 


ARQ 




no temp. 


+ temp. 


no temp. 


+ temp. 








A 1-2 


25.5 ±2.1 


32.7 ±1.9 


38.2 ± 3.0 


38.2 ±2.0 


0.67 + 0.01 


0.66 ±0.06 


0.19 ±0.06 


A1-7 


. 53.5 ±4.3 


395.1 ±21.4 


108.5 ±6.3 


110.3 ±5.3 


0.49 ±0.03 


3.58 ±0.17 


3.09 ±0.18 


A1-14 


127.0±4.9 


403.5 ±19.1 


109.7 ±5.3 


93.1 ±6.3 


1.16 ±0.02 


4.34 ±0.15 


3.18 ±0.15 


A1-19 


187.5 ±17.9 


422.7 ±7.7 


70.3 ±7.4 


73.0 ±2.8 


2.67 ±0.05 


5.80 ±0.15 


3.13±0.16 


A1-22 


224.6 ±9.4 


482.2 ±43.6 


100.014.0 


96.2 + 9.6 


£25 ±0.03 


5.02 ±0,11 


2.77 ±0.12 


A1-26 


160.2 ±8.9 


454.1 ±18.4 


93.1 ±5.4 


90.7 ±3.2 


1.72 ±0.02 


5,01 ±0.08 


3.29 ±0.08 



FIGURE 2 Results of 5' nuclease assay comparing p-actin probes with TAMRA at different nucle- 
otide positions. As described in Materials and Methods, PGR amplifications containing the in- 
dicated probes were performed, and the fluorescence emission was measured at 516 and 582 nm. 
Reported values ate the average ±1 s.D. for six reactions run without added template (no temp.) 
and six reactions run with template (+temp.). The RQ ratio was calculated for each individual 
reaction and averaged to give the reported RQ~ and RQ + values. 



divided by the emission intensity of the 
quencher to give an RQ ratio for each 
reaction tube. This normalizes for well- 
to-weil variations in probe concentra- 
tion and fluorescence measurement Fi- 
nally, ARQ is calculated by subtracting 
the RQ value of the no-template control 
(RQ~) from the RQ value for the com- 
plete reaction including template 
(RQ*). 

RESULTS 

A series of probes with increasing dis- 
tances between the fluorescein reporter 
and rhodamine quencher were tested to 
investigate the minimum and maximum 
spacing that would give an acceptable 
performance in the 5' nuclease PCR as- 
say. These probes hybridize to a target 



sequence in the human p-actin gene. 
Figure 2 shows the results of an experi- 
ment in which these probes were in- 
cluded in PCR that amplified a segment 
of the p-actin gene containing the target 
sequence. Performance In the 5' nu- 
clease PCR assay is monitored by the 
magnitude of ARQ, which is a measure 
of the increase in reporter fluorescence 
caused by PCR amplification of the 
probe target. Probe Al-2 has a ARQ value 
that is close to zero, indicating that the 
probe was not cleaved appreciably dur- 
ing the amplification reaction. This sug- 
gests that with the quencher dye on the 
second nucleotide from the 5' end, there 
is insufficient room for Taq polymerase 
to cleave efficiently between the reporter 
and quencher. The other five probes ex- 
hibited comparable ARQ values that are 



clearly different from zero. Thus, all five 
probes are being cleaved during PCR am- 
plification resulting in a similar increase 
in reporter fluorescence. It should be 
noted that complete digestion of a probe 
produces a much larger increase in re- 
porter fluorescence than that observed 
in Figure 2 (data not shown). Thus, even 
in reactions where amplification occurs, 
the majority of probe molecules remain 
uncleaved. It is mainly for this reason 
that the fluorescence intensity of the 
quencher dye TAMRA changes little with 
amplification of the target. This Is what 
allows us to use the 582-nm fluorescence 
reading as a normalization factor. 

The magnitude of RQ~ depends 
mainly on the quenching efficiency in- 
herent in the specific structure of the 
probe and the purity of the oligonucle- 
otide. Thus, the larger RQ~ values indi- 
cate that probes AM4, Al-19, Al-22, and 
Al-26 probably have reduced quenching 
as compared with Al-7. Still, the degree 
of quenching is sufficient to detect a 
highly significant increase in reporter 
fluorescence when each of these probes 
is cleaved during PCR. 

To further investigate the ability of 
TAMRA on the 3' end to quench 6-FAM 
on the 5' end, three additional pairs of 
probes were tested in the 5' nuclease 
PCR assay. For each pair, one probe has 
TAMRA attached to an internal nucle- 
otide and the other has TAMRA attached 
to the 3' end nucleotide. The results are 
shown in Table 2. For all three sets, the 
probe with the 3' quencher exhibits a 
ARQ value that is considerably higher 
than for the probe with the internal 
quencher. The RQ" values suggest that 
differences in quenching are not as great 
as those observed with some of the Al 
probes. These results demonstrate that a 
quencher dye on the 3' end of an oligo- 
nucleotide can quench efficiently the 



TABLE 2 Results of 5' Nuclease Assay Comparing Probes with TAMRA Attached to an Internal or 3'-tenninal Nucleotide 



518 nm 582 nm 



Probe 


no temp. 


+ temp. 


no temp. 


+ temp. 


RQ" 


RQ + 


ARQ 


A3-6 
A3-24 


S4.6 ± 3.2 
72.1 ± 2.9 


84.8 ± 3.7 
236.5 ± 11.1 


116.2 ±6.4 
84.2 ± 4.0 


115.6 ±2.5 
90.2 ± 3.8 


0.47 ± 0.02 
0.86 ± 0.02 


0.73 ± 0.03 
2.62 ±0.05 


0.26 ± 0.04 
1.76 ± 0.05 


P2-7 
P2-27 


82.8 ± 4.4 
113.4 ±6.6 


384.0 ±34.1 
555.4 ± 14.1 


105.1 ± 6.4 
140.7 ± 8.5 


120.4 ± 10.2 
118.7 ± 4.8 


0.79 ± 0.02 
0.81 ± 0.01 


3.19 + 0.16 
4.68 ± 0.10 


2.40 ±0.16 
3.88 ± 0.10 


P5-10 
P5-28 


77.5 ± 6.5 
64.0 ± 5.2 


244.4 ± 15.9 
333.6 ±12.1 


86.7 ± 4.3 
100.6 ± 6.1 


95.8 - 6.7 
94.7 ±6.3 


0.89 ± 0.05 
0.63 ± 0.02 


2.55 ± 0.06 
3.53 ±0.12 


1.66 ± 0.08 
2.89 ±0.13 



Reactions containing the indicated probes and calculations were performed as described in Material and Methods and in the legend to Fig. 2. 



PCR Methods and Applications 359 

PAGE 4/7 * RCVD AT 10/24/2005 5:55:20 PM [Pacific Daylight Time] * SVR:SVCS01/0 * DN1S:6034 * CSID:(613) 991-5695 * DURATION (mm-SS):0546 



From (613) 991-5695 



ResearchWWMi 




fluorescence of a reporter dye on the 5' 
end. The degree of quenching is suffi- 
cient for this type of oligonucleotide to 
be used as a probe in the 5' nuclease PCR 
assay. 

To test the hypothesis that quenching 
by a 3' TAMRA depends on the flexibility 
of the oligonucleotide, fluorescence was 
measured for probes in the single- 
stranded and double-stranded states. Ta- 
ble 3 reports the fluorescence observed 
at S18 and 582 nm. The relative degree 
of quenching is assessed by calculating 
the RQ ratio. For probes with TAMRA 
6-10 nucleotides from the 5' end, there 
is little difference in the RQ values when 
comparing single-stranded with double- 
stranded oligonucleotides. The results 
for probes with TAMRA at the 3' end are 
much different. For these probes, hy- 
bridization to a complementary strand 
causes a dramatic increase in RQ. We 
propose that this loss of quenching is 
caused by the rigid structure of double- 
stranded DNA, which prevents the 5' 
and 3' ends from being in proximity. 

When TAMRA is placed toward the 3' 
end, there is a marked Mg 2 " 1- effect on 
quenching. Figure 3 shows a plot of ob- 
served RQ values for the Al series of 
probes as a function of Mg 2 "" concentra- 
tion. With TAMRA attached near the 5' 
end (probe Al-2 or Al-7), the RQ value at 
0 mM Mg 2 * is only slightly higher than 
RQ at 10 mM Mg 2 *. For probes Al-19, 
Al-22, and Al-26, the RQ values at 0 mM 
Mg 2 * are very high, indicating a much 



reduced quenching efficiency. For each 
of these probes, there is a marked de- 
crease in RQ at 1 mM Mg* + followed by 
a gradual decline as the Mg z+ concen- 
tration increases to 10 mM. Probe Al-14 
shows an intermediate RQ value at 0 mM 
Mg 2 * with a gradual decline at higher 
Mg 2 * concentrations. In a low-salt en- 
vironment with no Mg 2+ present, a sin- 
gle-stranded oligonucleotide would be 
expected to adopt an extended confor- 
mation because of electrostatic repul- 
sion. The binding of Mg 2 ^ ions acts to 
shield the negative charge of the phos- 
phate backbone so that the oligonucle- 
otide can adopt conformations where 
the 3' end is close to the 5' end. There- 
fore, the observed Mg 24 " effects support 
the notion that quenching of a 5' re- 
porter dye by TAMRA at or near the 3' 
end depends on the flexibility of the oli- 
gonucleotide. 

DISCUSSION 

The striking finding of this study is that 
it seems the rhodamlne dye TAMRA, 
placed at any position in an oligonucle- 
otide, can quench the fluorescent emis- 
sion of a fluorescein (6-FAM) placed at 
the 5' end. This implies that a single- 
stranded, double-labeled oligonucle- 
otide must be able to adopt conforma- 
tions where the TAMRA is close to the 5' 
end. It should be noted that the decay of 
6-FAM in the excited state requires a cer- 
tain amount of time. Therefore, what 



TABLE 3 Comparison of Fluorescence Emissions of Single-stranded and 
Double-stranded Fluorogenic Probes 



518 nm 



582 nm 



RQ 



Probe 


ss 


ds 


ss 


ds 


ss 


ds 


Al-7 


27.75 


68.53 


61.08 


138.18 


0.45 


0.50 


Al-26 


43.31 ' 


509.38 


53.50 


93.86 


0.81 


5.43 


A3-6 


16.75 


62.88 


39.33 


165.57 


0.43 


0.38 


A3-24 


30.05 


578.64 


67.72 


140.25 


0.45 


3.21 


P2-7 


35.02 


70.13 


54.63 


121.09 


0.64 


0.58 


P2-27 


39.89 


320.47 


65.10 


61.13 


0.61 


5.25 


P5-10 


27.34 


144.85 


61.95 


165.54 


0.44 


0.87 


P5-28 


33.65 


462.29 


72.39 


104.61 


0.46 


4.43 



(ss) Single-stranded. The fluorescence emissions at 518 or 582 nm for solutions containing a final 
concentration of SO rm indicated probe, 10 mM Tris-HCl (pH 8.3), 50 mM KC1, and 10 mM MgCl 2 . 
(ds) Double-stranded. The solutions contained, in addition, 100 nM A1C for probes Al-7 and 
Al-26, 100 nM A3C for probes A3-6 and A3-24, 100 nM P2C for probes P2-7 and P2-27, or 100 nM 
P5C for probes P5.10 and P5-28. Before the addition of MgCl* 120 pi of each sample was heated 
at 95°C foi 5 min. Following the addition of 80 yd of 25 mM MgCl 2 , each sample was allowed to 
cool to room temperature and the fluorescence emissions were measured. Reported values are 
the average of three determinations. 



matters for quenching is not the average 
distance between 6-FAM and TAMRA 
but, rather, how close TAMRA can get to 
6-FAM during the lifetime of the 6-FAM 
excited state. As long as the decay time of 
the excited state is relatively long com- 
pared with the molecular motions of the 
oligonucleotide, quenching can occur. 
Thus, we propose that TAMRA at the 3' 
end, or any other position, can quench 
6-FAM at the 5' end because TAMRA is in 
proximity to 6-FAM often enough to be 
able to accept energy transfer from an 
excited 6-FAM. 

Details of the fluorescence measure- 
ments remain puzzling. For example, Ta- 
ble 3 shows that hybridization of probes 
Al-26, A3-24, and P5-28 to their comple- 
mentary strands not only causes a large 
increase in 6-FAM fluorescence at 518 
nm but also causes a modest increase in 
TAMRA fluorescence at 582 run. If 
TAMRA Is being excited by energy trans- 
fer from quenched 6-FAM, then loss of 
quenching attributable to hybridization 
should cause a decrease in the fluores- 
cence emission of TAMRA. The fact that 
the fluorescence emission of TAMRA in- 
creases indicates that the situation is 
more complex. For example, we have an- 
ecdotal evidence that the bases of the 
oligonucleotide, especially G, quench 
the fluorescence of both 6-FAM and 
TAMRA to some degree. When double- 
stranded, base-pairing may reduce the 
ability of the bases to quench. The pri- 
mary factor causing the quenching of 
6-FAM in an intact probe is the TAMRA 
dye. Evidence for the importance of 
TAMRA is that 6-FAM fluorescence 
remains relatively unchanged when 
probes labeled only with 6-FAM are used 
in the 5' nuclease PCR assay (data not 
shown). Secondary effectors of fluores- 
cence, both before and after cleavage of 
the probe, need to be explored further. 

Regardless of the physical mecha- 
nism, the relative independence of posi- 
tion and quenching greatly simplifies 
the design of probes for the 5' nuclease 
PCR assay. There are three main factors 
that determine the performance of a 
double-labeled fluorescent probe in the 
5' nuclease PCR assay. The first factor is 
the degree of quenching observed in the 
intact probe. This is characterized by the 
value of RQ " , which is the ratio of re- 
porter to quencher fluorescent emis- 
sions for a no template control PCR. In- 
fluences on the value of RQ" include 
the particular reporter and quencher 
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FIGURE 3 Effect of Mg 2+ concentration on RQ ratio for the Al series of probes. The fluorescence 
emission intensity at 518 and 582 am was measured for solutions containing 50 nM probe, 10 mM 
Tris-HCl {pH 8.3), 50 mM KG, and varying amounts (O-10 mM) of MgCl 2 . The calculated RQ 
ratios (sis nm intensity divided by 582 nm intensity) are plotted vs. MgCi 2 concentration (mM 
Mg). The key (upper right) shows the probes examined. 



dyes used, spacing between reporter and 
quencher dyes, nucleotide sequence 
context effects, presence of structure or 
other factors that reduce flexibility of 
the oligonucleotide, and purity of the 
probe. The second factor is the efficiency 
of hybridization, which depends on 
probe T ra , presence of secondary struc- 
ture in probe or template, annealing 
temperature, and other reaction condi- 
tions. The third factor is the efficiency at 
which Taq DNA polymerase cleaves the 
bound probe between the reporter and 
quencher dyes. This cleavage is depen- 
dent on sequence complementarity be- 
tween probe and template as shown by 
the observation that mismatches in the 
segment between reporter arid quencher 
dyes drastically reduce the cleavage of 
probe. (l) 

The rise in RQ" values for the Al se- 
ries of probes seems to indicate that the 
degree of quenching is reduced some- 
what as the quencher is placed toward 
the 3' end. The lowest apparent quench- 
ing is observed for probe Al-19 (see Fig. 
3) rather than for the probe where the 
TAMRA is at the 3' end (Al-26). This is 
understandable, as the conformation of 
the 3' end position would be expected to 
be less restricted than the conformation 
of an internal position. In effect, a 
quencher at the 3' end is freer to adopt 
conformations close to the 5' reporter 
dye than is an internally placed 
quencher. For the other three sets of 



probes, the interpretation of RQ" values 
Is less clear-cut The A3 probes show the 
same trend as Al, with the 3' TAMRA 
probe having a larger RQ~ than the in- 
ternal TAMRA probe. For the P2 pair, 
both probes have about the same RQ" 
value. For the P5 probes, the RQ" for the 
3' probe is less than for the internally 
labeled probe. Another factor that may 
explain some of the observed variation is 
that purity affects the RQ" value. Al- 
though all probes are HPLC purified, a 
small amount of contamination with 
unquenched reporter can have a large ef- 
fect on RQ". 

Although there may be a modest ef- 
fect on degree of quenching, the posi- 
tion of the quencher apparently can 
have a large effect on the efficiency of 
probe cleavage. The most drastic effect is 
observed with probe Al-2, where place- 
ment of the TAMRA on the second nu- 
cleotide reduces the efficiency of cleav- 
age to almost zero. For the A3, P2, and PS 
probes, ARQ is much greater for the 3' 
TAMRA probes as compared with the in- 
ternal TAMRA probes. This is explained 
most easily by assuming that probes 
with TAMRA at the 3' end are more likely 
to be cleaved between reporter and 
quencher than are probes with TAMRA 
attached internally, For the Al probes, 
the cleavage efficiency of probe Al-7 
must already be quite high, as ARQ does 
not increase when the quencher is 
placed closer to the 3' end. This illus- 



trates the importance of being able to 
use probes with a quencher on the 3' 
end in the 5' nuclease PCR assay. In this 
assay, an increase in the intensity of re- 
porter fluorescence is observed only 
when the probe is cleaved between the 
reporter and quencher dyes. By placing 
the reporter and quencher dyes on the 
opposite ends of an oligonucleotide 
probe, any cleavage that occurs will be 
detected. When the quencher is attached 
to an internal nucleotide, sometimes the 
probe works well (Al-7) and other times 
not so well (A3-6). The relatively poor 
performance of probe A3-6 presumably 
means the probe is being cleaved 3' to 
the quencher rather than between the 
reporter and quencher. Therefore, the 
best chance of having a probe that reli- 
ably detects accumulation of PCR prod- 
uct in the 5' nuclease PCR assay is to use 
a probe with the reporter and quencher 
dyes on opposite ends. 

Placing the quencher dye on the 3' 
end may also provide a slight benefit in 
terms of hybridization efficiency. The 
presence of a quencher attached to an 
internal nucleotide might be expected to 
disrupt base-pairing and reduce the T m 
of a probe. In fact, a 2°C-3°C reduction 
in T m has been observed foT two probes 
with internally attached TAMRAs. {9) This 
disruptive effect would be minimized by 
placing the quencher at the 3' end. Thus, 
probes with 3' quenchers might exhibit 
slightly higher hybridization efficiencies 
than probes with internal quenchers. 

The combination of increased cleav- 
age and hybridization efficiencies means 
that probes with 3' quenchers probably 
will be more tolerant of mismatches be- 
tween probe and target as compared 
with internally labeled probes. This, tol- 
erance of mismatches can be advanta- 
geous, as when trying to use a single 
probe to detect PCR-amplified products 
from samples of different species. Also, it 
means that cleavage of probe during PCR 
is less sensitive to alterations in an- 
nealing temperature or other reaction 
conditions. The one application where 
tolerance of mismatches may be a disad- 
vantage is for allelic discrimination. Lee 
et al. (1> demonstrated that allele-spedfic 
probes were cleaved between reporter 
and quencher only when hybridized to a 
perfectly complementary target. This al- 
lowed them to distinguish the normal 
human cystic fibrosis allele from the 
AF508 mutant. Their probes had TAMRA 
attached to the seventh nucleotide from 
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the 5' end and were designed so that any 
mismatches were between the reporter 
and quencher. Increasing the distance 
between reporter and quencher would 
lessen the disruptive effect of mis- 
matches and allow cleavage of the probe 
on the incorrect target. Thus, probes 
with a quencher attached to an internal 
nucleotide may still be useful for allelic 
discrimination. 

In this study loss of quenching upon 
hybridization was used to show that 
quenching by a 3' TAMRA is dependent 
on the flexibility of a single-stranded oli- 
gonucleotide. The increase in reporter 
fluorescence intensity, though, could 
also be used to determine whether hy- 
bridization has occurred or not. Thus, 
oligonucleotides with reporter and 
quencher dyes attached at opposite ends 
should also be useful as hybridization 
probes. The ability to detect hybridiza- 
tion in real time means that these probes 
could be used to measure hybridization 
kinetics. Also, this type of probe could be 
used to develop homogeneous hybrid- 
ization assays for diagnostics or other ap- 
plications. Bagwell et al. (10) describe just 
this type of homogeneous assay where 
hybridization of a probe causes an in- 
crease in fluorescence caused by a loss of 
quenching. However, they utilized a 
complex probe design that requires add- 
ing nucleotides to both ends of the 
probe sequence to form two imperfect 
hairpins. The results presented here 
demonstrate that the simple addition of 
a reporter dye to one end of an oligonu- 
cleotide and a quencher dye to the other 
end generates a fluorogenic probe that 
can detect hybridization or PGR amplifi- 
cation. 
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Wc have developed a novel "real time" quantitative PCR method. The method meacure* PCR product 
accumulation through a dual-labeled lliioroeenlc probe (Lc TaqMan Prob*). This method provides very 
accurate and reproducible quantitation of gene copies. Unlike other quantitative PCR methods, real-time PCR 
does nor require poa-PCR santple handling, prevendng potential PCR product carry-over contamination and 
resulting In much faster and higher throughput assays. The real-time PCR method has a very large dynamic 
range of starting target molecule determination (at tax five orders of magnitude). Real-time Quantitative 
PCR is extremely accurate and less labor-intensive than current quantitative PCR methods. 



Quantitative nucleic acid sequence Analysis lias 
had an important role in many fields of biologi- 
cal research. Measurement of gtuiv. expression 
(RNA) has btwsn used extensively In monitoring 
biological responses to various slimuii (Tan el al, 
1994; Huang el al. 1995a,b; I'rud'bomme ct al. 
1995). Quantitative gen* analysis (DNA) has 
iH-cn used to d«ltirmine the genome tjuaiiLily of a 
particular gene, as in the case, of the human }I1LA2 
gene, which is amplified in -30%. of breast tu- 
mors (Slarnon et al. 1987). Gene and genome 
quantitation (DNA and UNA) also have been used 
for analysis of human immunodeficiency virus 
(IIJV) burden demonstrating changes in the lev- 
els of virus throughout the different phases of the 
disease (Connor «t al. 1993; i'latak ct al. J9v;»h; 
1-uTtado et al. 1V95). 

Many methods have been described for tin: 
quantitative analysis of nucleic acid sequences 
(both for UNA and DNA; Southern IV Sharp el 
al. 1980; Thomas 1980). Recently, PCR has 
proven to be a powerful tool for quantitative 
nucleic acid analysis. PCJR and reverse transcrip- 
tase (K'0-PCR have permitted the analysis of 
minimal starting quantities of nucleic acid (as 
little as one cell equivalent). This has made pos- 
sible many experiments that could not hove, been 
performed with traditional methods. Although 
PCR has provided a powerful tool, it is imperative 
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that \\ be uacU properly for quantitation (U»«y- 
maeKers 1995). Many early reports of quantita- 
tive PCR and RT-PCR described quantitation of 
the PCR product but did not measure the Inltiol 
target sequence quantity. It is essentia] to design 
proper controls for the quantitation of the initial 
target sequences (Perrc 1992; Clementl ct al. 
100?.) 

rU'Nftfirchcxs have developed several methods 
of quantitative PCR and RT-PCR. One approach 
measures PCR product quantity in the lug phase 
of the reaction before the plateau (Kellogg ct al. 
1990; Pang ct a). 1990). This method requires 
thai each sample has equal input amounts of 
nucleic add and that each sample under analysis 
amplifies with identical efficiency up to the. point 
of quantitative analysis. A gene sequence (con- 
tained in all samples at relatively constant quan- 
tities, such as p-aelln) can be used for sample 
amplification efficiency normalization. Using 
conventional methods of Kin detection and 
quantitation (gel electrophoresis or plate capture 
hybridization), it is extremely laborious to assure 
that all samples are analyzed during the log phase 
of the reaction (for both the target gene and the 
normalization gene), Another method, quantita- 
tive competitive (QC)-HCR, has been developed 
and is used widely for PCR quantitation. QC-1'CR 
relies on the inclusion- of an internal control 
.competitor in each reaction (Becker-Andre 1991; 
Platak et al. 1993« # h). Th« efficiency of each re- 
action is normalised to the internal competitor* 
a ininwn a it ti unit of internal competitor can be 
annrv 7nc« no/, wj «'c:*t 7An7/cn/7T 
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oddod to each sample. To obtain relative quant- 
totlon, ihc unknown target PGR product iswtn. 
pared with the known competitor K:k product. 
Success of a quantitative competitive I'CU assay 
relit*s on acvcloping an Internal control ll ml am- 
plifira Willi the same efficiency us the Uii^el mol- 
ecule. The design of the coinpetltoi and the va- 
cation of amplification efficiencies jcquire a 
dedicated effort. However, because QC-PCK does 
not require that PC '11 pioducls be walyyxxl during 
the lo& phase of I he amplification, it is tin: easier 
of Ihr two methods to use. 

Sever;) I detection systems uie wm:U for quan 
UUtive l-'CR and RT-PCtt analysis; 0) a$«m*so 
ycls, (2) fluorescent labeling of PCR products and 
detection with In.iKT-jndue.cd fluoroAeeilCC uslll£ 
capillary elexrtrophoTesiri (htisco ct a). 1995; Wil- 
liams ct ah 1990) or acrylaiukde gels, urul (3) jihiie 
capture 1 , and sandwich probe hybrid I wium (Mul- 
der el ah 1994). Although these method.* pmvrd 
successful, each method Tequlres post -PCR nia- 
nipularlom That add Tlm« to the analysis and 
tmty lead to htboNituiy * onlrtii'iination. The 
sample throughput of these method* i> limited 
(with I In- i-xtcptlon of the plate capture ap- 
proach), nnd, thurufnre, these methods ore not 
well suited hit u>o demanding high sample 
throughput (I.e., .screening of large numbers of 
ljU>tuwI*Tv.ulc* 01 aiittty/.tit^ .Vimplva fuj diagili^* 
tic* or clinical trial*). 

More wv report the development of a novel 
assay for quantitative DNA analysis. The assay is 
bfl.icd on 1h«: use of the ,S' nuclease assay first 
described by Hollund et ah (1993). The method 
uses ihe. .V tmc.ICA.ic Activity of 7Vj</ polymerase to 
cleave a non extendible hybridization probe dur- 
ing tbc extension phnac of I'CU. The tippronch 
uses diiiil-tnbcled fluorogenic hybridisation 
probes (Lee. el nl. 1 !>93; Jtosslcr ct til. 1995; l.ivolt 
el al. l$96o,h). One fluorescent dye serves as a 
reporter IF AM (i.e., (^Ciirboxyfluoreseeiii)! and i1s 
emission spectra is quenched by the second fluo- 
rescent dye., TAMRA (l.ft. ( u -carboxy-ie tram ethyl - 
rhodamine). The nuclease degradation of the hy- 
hrldlxiitloii probe releases the quenching of I he 
I'AM fluorescent emission, resulting in an In- 
crease In peak fluorescent emission at S3 y run. 
The use Of a sequence dclccior (AUI Prism) allows 
measuicmeni of fluoresce, nt spectra of all 96 wells 
uf rite thermal cycler conrtnuutisly during Hie 
]*C;K amplification. Then: fore, the rcuetious wc 
moultnrvd in real time. The output data is de- 
scribed and quantitative umilysls of input urgct 
UNA sequences L5 discussed below. 



RESULTS 



PCR Produce Derecrlon in R«al Time 

Hie goal vvjis to develop a high-throughput, sen- 
sitive, and neruratc gene quantitation assay for 
use In monitoring lipid mediated therapeutic 
gene delivery. A piasmld unending human factor 
VIII gene sequence, pI'8TM Methods), wax 
used as a model therapeutic «en<:. The assay uso<i 
fluorescent l aqinan methodology and an instru- 
ment capable of measuring fluorescence in real 
time (Alii Prism 7700 Sequence TVlcclor). 'Ilu: 
iaqnmn reaction requires a hybridization pTohe 
lal>cled with two different fluorescent dyes. One 
dye Is a report wr dyv (I»AM), the other is quench- 
ing dye (TAMRA). When the proln: is in lac I, fluo- 
i eaten I energy transfer occurs and the reporter 
dye fluorescent emission is absorbed by the 
quenching dye (TAKfRA). During the extension 
phase of the PCK cycle, the fluorescent hybrid- 
V/Jiihui probe 1^ cleaved by tlie S'-.T nuclcolytic 
activity n( thn DNA polymerase. On cleavage of 
the probe, the reporter dye emission is no longer 
transferred efficiently to the quenching dye, re 
mi] til in hi tin increase of the report or dye fluores- 
cent cmta-tlmi ap*etra. PCU primers and probuw 
were designed ft»« tliu iiuman factor VJ1J se- 
quence and human p-actln gene described iii 
Methods), Optimization reactions were per- 
formed to choose the appropriate probe and 
-magnesium concentrations yielding the highest 
Intensity of reporter fluorescent signal without 
sacrificing specificity. The Inst rumen I uses a 
chm-Ki:>couplcd device (i.e., CCD camera) for 
measuring the fluorescent einissi on apeetni from 
. r iOO to Ci$0 nm, liach TCU tiibc was monitored 
sequentially for 2 A rn.s«e whh ei>iitInuous moni- 
toring throughout tlit: ainplifieuthM'i. Uach lube 
wo* rr-exan dried every H»5 see. Conjputcr sofi- 
ware. was dcsigne.d to examine thr fluorescent In- 
tensity of both the rejmrter dye (PAM).and 
the quenching dye (TAMRA). T>ie Muoresccnt 
intensity of the quenching dye, 'I^MHA, chtuiges 
very Utile over the course of the PCR amplifi- 
cation (datH not shown). Therefore, Ihc intensity 
of TAMUA dye c-inlssJon serves *s un Internal 
.tlandurd wltli which to norniullxe the reporter 
dyi: (l : AM) emission vnrioUoits. The .software e*d- 
euloles »i v<)lue termed ^Kn (or ARO) uslnj; the 
following equation: ARn - (Un J ) (Hn , ") ( where 
Kn 4 emission intensity \>t rcporier/emission in- 
tensity of quencher al ony given time In a retu: 
tion lube, and fcu r- emission intensitity of re- 
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poncr/cmlsslon Imvmily of quencher measured 
prior 10 TCU iimplilicalion in Hint same reaction 
tube. J ; or the purpose of quantitation, the Um 
three data points (ARns) collected during the. ex- 
tension step for each I J CK cycle were analyst! 
The nucleolytic degradation of the Hyundai ion 
probe occurs during the extension phase or i'i at, 
and, therefore, reporter fluorescent amnion in- 
creases during thts lime. 'I in: thice data points 
were averaged for each KJK cycle and the nicwn 
value for each was plotted in an "amplification 
plot" shown In J'iflurc 3 A. The AKn mean value is 
plotted on the j^axis, and time, represented by 
cycle number, isplolledon thv*-axis. During the 
early cycles of the PCR amplification, the ARn 



value remains at bas* lino When sufficient hy- 
bridization probe has been cleaved hy the Titq 
]x>lymcraste nuflfcAfic- activity, the inleiisily of ro- 
porter fluorescent emission invreasvt.. Mo.M \Kl)\ 
amplifk-Mjons reach a plateau phone of reporter 
fJuurcHV.nl cmifirion if the reaction 1* carried out 
to high cycle uuiiiK-is. The amplification plot \9 
examined tvily in Uu» reaction, ut a point lhai 
n-prcsents ihf log phase of product arrmnula* 
tion. This is done hy ussigning an arhiUary 
ihreshoJd thai is b«.scd on the variability of the 
base-line dwiu. frt Wgure 1 A, the IhffiShold whs set 
at 10 standard deviations above the mean of 
ba*o Unci emission calculated from lydcn 1 lo 1 5. 
Once the threshold is chosen, the point at which 
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PCR product detection in real time w »•■■*• — — ■ T , . j j a 

fr o,n the extension phase fluorescent erosion data, collected during *. PCR ^c^e ? nd.rd d* 

times the 
genomic 



viation is determined irom the data points collected from the base line of the P'fJ" \\ 

calculated by determining the point al which the fluorescence exceeds a thresho d limi us " a ' y ^ 
s andarcl deviation of the base line). (B) Overlay ot amplification plots of serially (1:2) dll uted ^uman 



. standard 

DNA samples am 



oCrfied with fi-actin primers. (Q Input DNA concentration of the samples plotted versus C T All 
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the amplification plot crotsca the thrctihold'-ivcte 
fined as C,, C r is reported an the cycle number at 
1 1 lift point. Ar will be demonstrated, th« C, value 
Is ptciiicUvt of the quantity of input turgtrt. 

Cj Values Provide a Quantitative Measurement* of 
Input Target Sequences 

Figure 1R shows amplification ploU of 
enl PGR amplifications overlaid, 'Phe amplify 
Hons were performed on a 1:2 serial dthitiora •«*'. 
human genomic JWA. The amplified target v*a* 
human p nctln. The amplification plohi Khifl to 
the right (to higher threshold cycles) na the input 
target quantity is reduced. 'j*hi$ [$ expected ho. 
eauKU nmetloriK with fuwur starting remit w of tlio 
target molecule require greater amplification to 
degrade enough probe to attain the Threshold 
fluorescence. An arbitrary threshold of 10 "stan- 
dard deviations above the base line was used to 
determine the O r values. Figure 1C represents the 
C r values plotted versus the sample dilution 
value, Each dilution was amplified in triplicate 
Pf.R amplifications and plotted as mean values 
with error bars representing one standard devia- 
tion. The C r values decrease linearly with Jncrca.V 
ing target quantity. Thus, C; r values can be used 
.as u quantitative measurement of the input target 
number. It should be noted that the amplifica- 
tion plol for the is.6.ng sample shown In Figure 
IB does not reflect the snmc fluorescent rate of 
Increase exhibited by most of the other samples. 
The 15.6-ng sample also achieves cmdpoini pla- 
teau at a lower fluorescent value than would he 
expected based on the input l?NA. This phennui. 
enon has been observed, occasionally with other 
samples (data not shown) and may be attribut- 
able to late cycle inhibition; this hypothesis is 
still under investigation. It is important to note 
that the flattened slope and early plateau do not 
impact significantly the calculated O, value us 
demonstrated by the fit on Die line shown »n 
Figure 1 C. All triplicate amplifications resulted in 
very similar Cr values— the standard deviation 
did not exceed 0.5 for any dilution, this experi- 
ment contains a > 3 00,000-fold range of Input tar- 
get molecules. Using C v values for quantitation 
permits a much larger assay range than directly 
using total fluorescent omission intensity for 
quantitation. The linear range ol lluoresccnt in- 
tensity measurement of the AIM Prism 7700 Se- 
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rmmts over n very large r;mj;c nf rs*\M\vo starting 
target quantities. 

Sample Preparation Validation 

Several parameters influence the efncli : nry nf 
PC:r amplification: magnesium and salt conceiv 
nations, reaction conditions (i.e., time and ic.m* 
pe.rulurc), PCK target size and composition, 
primer sequences, and sample purity, All of the 
above (actors are common to a single PCR assay, 
except sample to sample purity. In an effort to 
validate the method of sample preparation for 
the iactor VJJ1 assay, PCK amplification reproduo 
ihility and oJflcicncy ol JO replicate sample 
prt'iiaratiOTis were examined. After genomic DHA 
was prepared from the TO replicate samples, the 
DNA was qunnUialcd by ultraviolet spectroscopy. 
Amplifications were performed analyzing (S-aciln 
Kcth: content in 100 and 25 iir of total xunoinic 
DNA. liach PCK amplification was performed in 
triplicate. Coinpiiiison of C r values for each trip, 
licate sample show minimal variation based on 
standard deviation and coefficient of variance 
(Table 1). *lt>ercforc, each ol the triplicate PCK 
amplifications was highly reproducible, demon- 
strating that real time PCK using this instrumen- 
tation introduces minimal variation Into the 
quantitative. PCK analysis. Comparison of tin: 
mean C n values of the 10 replicate sample prepa- 
rations alsu showed minimal variability, indicat- 
ing that each sample preparation yielded similar 
results for p-nclin gene quantity. The highest C T 
difference between any of the samples was Q.$$ 
and 0.73 for the 100 and 25 ng samples, respec- 
tively. Additionally, the amplification of each 
sample, exhibited an equivalent rate of fluorcv* 
cent emission intensity change per amount of 
DNA target analysed as indicated by similar 
slopes derived from the sample dilutions (Pig. 2). 
Any sample contaixiing an excess of a PCX inhibi- 
tor would exhibit a greater measured 3-actin G r 
value for a given quantity of UNA. In addition, 
the inhibitor would be diluted along with tht! 
sample in the dilution analysis (lig, Z) t altering 
the expected c; ( . vaiuc change. F.ach sample am- 
plification yielded a similar result in the analysis, 
demonstrating that this method of sample prepa- 
ration is highly reproducible with regard to 
sample purity. 

Quantitative Analysis of a Plasm id After 

jncfl new rttr w-r rc:&t 7nn7/cn/7T 
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Table 1. Reproducibility of &*mplo Preparation Method 



Samplo 



no. 



3 
4 
5 
6 

; 



10 



Mean 



100 ng 



standard 
m^an deviation 



CV 



18.2-1 

18.23 

10.33 

18.33 

18.35 

1A;44 

1B.3 

U.3 

18.42 

18.15 

18.23 

18.32 

T8.4 

18.38 

18.46 

18,54 

18,67 

19 

18.2B 

18.36 

1S.52 

18.45 

1S.7 

18.73 

18.18 

18.34 

18.26 

18.42. 

18.57 

l 8.66 

(1 10) 



13.27 
l&.V 
18.34 



18.55 
1S.<12 



0.06 
0.06 
0.07 



18.23 0.08 



1BM2 0.04 



18.7-1 0.24 



18.39 0.12 



18.63 0.16 



18.29 0.1 



0.12 
0.17 



0.32 
O.V 
.0.36 

0.46 
0.23 
1.26 
0.66 
0.S3 

0.6S 
0,90 



25 ng 



20.46 

20.SS 

20,5 

20.61 

20.59 

20.41 

20,54 

20.6 

20.49 

20.48 

20,44 

20.38 

20.68 

20.87 

20,63 

21.09 

21,04 

21.04 

20.67 

20,73 

20.65 

20,98 

20.84 

20.75 

20,46 

20.54 

20.48 

20.79 

20.78 

20.62 



standard 
mean deviation CV 



20*51 0.03 0.17 

?0.S4 0,11 0.54 

20.54 0.06 0,28 

20.4 3 0.05 0.26 

20.73 0.13 0.61 

21.06 0.03 0.15 

20. 6 ft 0.04 0.2 

20.86 0,12 0.57 

20.51 0.07 0.32 

20.73 0,1 0.16 

20.66 0.19 0.94 



tor containing a partial cDNA for human factor 
vu l, pl-'gTM. A scries of transections was set 
up ustng a decreasing amount of the plasmid v (40, 
A, U.5, and 0.1 ^g). Tvv rimy- Tour hours post- 
tr« i Mfeci ion, total DNA wnn purified from each 
flask uf crib. p-Aclin gene quantity vya^ ehtocn 
a value for normalisation of genomic DNA con- 
centration from cavil sample. In litis cxpeur jjviit, 
|i-actin gene content should remain constant 
relative to total genomic DNA. Figure 3 shows the 
result of the p-actln ON A measurement (100 jig 
total DNA determined by ultraviolet spectros- 
copy) of each sample. Kach sample was analyzed 
in triplicate and the mean p-actin Cj- values of 
the triplicates were plotted (error bars represent 
r^-<**i#i«fn Hwiaiinm *l h*» htnhpsr niffrrrnrr 

O 7 A fvh 



betwywn any iwo s&mplct moans was 0.9S C r Ten 
nanograms of total DNA of each sample were also 
examined for (VacUn. llic results again slmwcd 
that very a! i nilar amounts of genomic 1>NA were 
present; the maximum mean fi actio <"; ( value 
difference wa.s 1.0. As Insure 3 shows, the rale of 
p-actln C,. change lx.Lwocn the 100 and 10-ng 
sample* was similar (sJoj>c values rang« bwtwoon 
3.56 and -3.45). This verifies again that this 
method of .sample preparation yields sajnpl«s of 
identical PCR integrity (i.O-, no sample contained 
an excessive amuuul uf a VCR Inhibitor). How- 
ever, these results indicate that each sample con- 
tained slight diffcienc.es in the actual amount of 
genomic 1>NA analyzed. Determination of actual 
uuuoiiiic *>NA umccntraiion was accomplished 
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Figure 2 So in pie preparation punty, The replicate 
samples shown In Table 1 wore also- amplified In 
tripicate vising 2S ng of each DMA sample. The fig* 
uife shows die input DNA concentration (100 and 
25 ng) VS. C, In \hv lirjnrp, ihp 100 and 75 ng 
poloU for each sample are connected by a line. 



by plotting the mean (i-actin O, value obtained 
for i*at:h 100n g sample v»n a fUaciln si ami art! 
i-urve (shown in J'ig- 40). The actual genomic 
1">NA coiicputrnil"" c»/ each suinpl«, was ob 
lalncd by extrapolation to thu* x-axls,. 

Figure 4 A shows the measured ^{ • tt, r noil* 
normal1?«e<l) quantise* of factor VI) I plnrtirud 
DNA (pPSTM) from each of tlu; four transient cell 
lni»»>faclion.v Each reaction contained J 00 ng of 
lotol sample 13 N A (as determined by UV spectros- 
copy). V&ch sample w;js ui ial yzed in triplicate 



23 



20 



V 27.73.* ^Irt/rH- 1 



ofigTM trenslooto d 

■» ~4.0 ftp 
■ 0*9 JtO 
A 0.1 mo 



0.B 



f— 
1.2 



m i-a i.e 

log (ng Input ONA) 

Figure 3 Analy*b of li defected cdl DNA quantity 
and purity. I he DNA preparations of the four 293 
cell transections (40, 4, 0.5, and 0.1 u.g of pF8TM) 
were analyzed for the (5-actln gent*. 700 and 10 ng 
(determined by ultraviolet spectroscopy) of each 
sample were amplified in triplicate. For each 
amount of pF8TM that was transferred, the |5-actln 
C T values are plotted versus the total Input DNA 



PC :u amplifications. As shown, pl'STM purified 
jfioic Jbc 20:* cells decreases (moan C, values in- 
ervwi't; with decreasing amounts of pi asm Id 
ttrumU'ttcd. The mean C A values obtained for 
pWFM *in Tlyurc 4A were plotted on a standard 
curve comprised uf scilijlly diluted pKHTM, 
shown .in 'figure 4R. The quantity ui phKTM, \>, 
found in each <if the four transections was do- 
tcrmined by extrapolation to the x ax It of the 
standard curve In Pigurc 4H. 'Hutsc uncorrected 
values, h, for pWTJ'M were normaliyAid \c> deter- 
mine the actual amount of pP81M fuuml per 100 
rig of genomic DNA by using I he equation:. 

t> x 10 0 ng uclual pFBTM copies i>er 
" T 100 ng uf genomic DNA 

whore a actual genomic DNA in u sample and 
b «- pFHTM copies from the standard curve. 'Die 
normal ir-ed quantity of ptOTM per 100 ng of ge- 
nomic DNA for each of the four Irans'fccilons Is 
shown in Hgure 4JJ. 'nicsc results a how ilicu ihc 
quantity of factor Vlll plasiuid associated wiih 
the celln, 24 hr after iruMsfvciiuii, di:i.ii:.ise;. 
wltti dcCTC'cislUK pJti^iuiiJ unu.i'.nuaiiou used in 
the iransfctTtion. Tlu: quantity of pl'tt'J M nwocJ- 
atcu wun 293 celb, after trunsfecilon with 40 ixg 
t>f piusmid, was 35 pg per 100 ng genomic ONA. 
Tills results in -520 plasuiid copies per cell. 



DISCUSSION 

We have described a new method for quanlitiH- 
iufc gene copy numbers using fcaMlmc analysts 
of PLK amplifications. Real-time PCK is compat- 
ible with cJthcT of the two PCK (KT-PCR) ap- 

pruache*: (1) quantitative con»[K-.liiivi; where an 
inlenial cum pelt tor for each target sequence i» 
used for nojTualixflUon (data not shown) or (2) 
qu a niitailve comparative PCK using n uuiiualiza- 
tton ^jene contained within the sample (i.e., [3-nc- 
tJii) ox a "housekeeping" gene for UT-PCK. If 
equal amounts of nucleic ucicl are analyzed for 
cacti sample and if the amplification efficiency 
before quantitative analysis ^ identical for each 
sample, \t)t iTirernal cuntuil (nwjmali^iliou jjene 
or competiuir) should Rive equal M^ndls for aIJ 
samples. 

Tlie real-time PCK method offers several ad- 
vantages over the other two methods currently 
employed (sec tbc Introduction). I ; irst, the. real- 
time I'CR method is performed in a doscd-tubc 
system and requires no post-PCR manipulation 
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Figure 4 Quantitative- analytic of pFSTM in transfccted cells. (A) Amount of 
plasmid DNA used for the irunsfection plotted against the it H-un C, value deier- 
iorprSTM remaining ?a hr alter tronsfotcuon. <0,O Standard curves of 
pf-RTM *nd p-actfn, respectively. pfBTM DNA (fl) and genomic DNA (Q were 
diluted fiArlally 1 ;5 before Amplification with the approprinlc primers. The p-actin 
standard curve wa* usod to normalise the results of A to 1 00 ncj of genomic DNA. 
<C) The amount of P F8TM present per 100 ng of genomic DNA. 



nf sample. Therefore, I In* potential for TCK con- 
tamination in the laboratory is reduced because 
amplified products can be amdyy.ed and disposed 
of without opening the reaction lubes. Second, 
this method suppoiU the umi of a iioruiiilixiitlon 
«enc (Lc, (3-actin) for quantitative. PCR or house- 
keeping genes for quantitative RT-l'Clt controls. 
Analysis Js perform e-U in real time during the Jog 
phase of product accumulation. Analysis during 
k»K phase permits many different genes (over a 
wide input target range) to be analy/cd simulta- 
neously, without concern of reaching reaction 
plateau at different cycles This will make niulli- 
gene. analysis assays much ca.Me.i lu develop, be- 
cause individual internal Mutipetltui* will not he 
necded for coch gene under analysis. Third, 
adinpic throughput will inweasc dramatically 
with the new method because there is no post. 
TCK pinccising time. Additionally, winking hi a 
96-well format is highly compatible wtth auto- 
mation technology, 

The real-time PCR method ii> highly rcprn. 
dudblc. Replicate amplifications can be analyzed 



for c-acli sample minimising ]>otenU«i wmr. The. 
system allows i'ch a very large assay dynamic 
run fte (approaching 1,000,000- fold starting tin- 
gel), tiering u Mandord curve for the target oi in* 
XervM, relative copy number values can be deter- 
mined for any uiikjiuwu *uwph\ Fluorescent 
threshold value*, O r , coneJate linearly with rela- 
tive DNA copy number.?. Ileal time quantitative 
HT-'1*C:H methodology (O'lbson ct ill., this Issue) 
ha* afao bee:n developed. Finally, real lime quan- 
titative PCli methodology can be used lu dcvclup 
high-throughput acre en In g aaaay.H for n variety of 
applications [quantitative gene c*pns>&iuii (RT- 
PCft)i gene copy aaaaya (Itcrfc, I11V, etc.), gcnn- 
typing (knockout mouse analysis), and Jmmuno- 

rcuj. 

Real-time Pf!R may al.w be jTerformed using 
intercalating dyo.s (Higuchi cl al. such as 

eihidium bromide. The fluorogenie probe 
method offers a major advantage over inter- 
calating dyes- -greater specificity (i.e., primer 
dimvrs and nonspecific PCR products are not de- 
tected). 
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METHODS 

Generation of <i PUsmld Containing & Partial 
cDN A Tor Human Factor Vlll 

Total RNA v*« harvcMcd (UN A*ol U (Km* '! <•> Test, inc., 
rr^ndswood, TX) from w\\> ii-%iwfecled wlih a factur VI U 
eKjHtasiuii vector, pC:iS2.lk?.SU (tottm el id/ IUHG; Cor. 
mnn ct al. 1900). A factor VII! partial ellNA wpimiv WOS 
^t iKimUd by irj* K:U KloiicAmp YJ. (Tth ItNA IT.U Kit 

(pan NHOR-cn /y, i't Applied hiosystcms i : o*wi r/»ty, t;A)j 

uslnfc the l*t:u primers KKfor 1-flrcv (primer wcruenre* 
arc shown below), Tin- am oilcan was fcaniplified ttslnR 
modified rtifor and Wrcv primers (nn|XMKled wall tor/IH 
and 1/wdUI restriction site sequences »» the v wwlj and 
clonal into jXiHM- 3Z (Promt^u tlorp., Muiluan, WI). The 
resulting clone, jiPfflH was u«d lor transient iransfccilon 
of cells. 

Amplification of Target DNA anil Dclccilon of 
Amplicon Factor VIII Plasmid DNA 

(pFHTM) was tiinpllfUnl with the i»niu-ii PHfor S'-CX.'CN 

<rrci<;(^\AGAu:iUAt;iii«TC-3' and wrev s'-AAAccrr- 
luocxrrocjA'Jxitj'rACiCi-a'.HicivriciUiii pivduved n -122- 
np i*v;k product. The forwu rtl primer mw de>h;iied tu kv 
ognl/.i; u unique M'i|iirHu* fuuiid Jn (he !>' untranslated 
region of Oil* paitrnt pCU>2.tHZ5)> pldMiiid <int] therefore 
dues not ivU"H iiLau mid amplify the huuiOlv factor Vlll 
gene. I'rimnrft woro chosen with thy avsivUHitY* of I he coin. 
I>uler program Olij;o 4* n (Nutiinml UUwicienees, lnv<, Mly* 
mouth, MN). The human p-actt« gem- whs amplified with 
the primers f}- t ntin forward primer S'TCACOCACACrrciT 
GCCCATO*AGGA-.V and P-actin reverse p»imer V-( !A( 

CGGAACCX;f:ix:ATr(><:(^A"j"GG-3'. The reaction pro- 
oviceo ,i 2y5-r>p vCM product. 

Amplification reactions (SO (xJ) contained a DNA 
sample, )0X l'C;it Huffe.r 1! (6 yJ), 200 n-M dATP, dGTP, 
dGTP, and 400 h-m riUTP, 4 inx< MgCl ? , Units Amph 
7'wt; DNA polymciwc, 0,5 unit AriipKrnsc uracil N-Rly- 
awylunc (UNO), SO pinole of cAeh faciei VIII prlinvi, unci 15 
pi in ilt* of uttt'.h |l Actln ptlmw. Tho kwIKhi*, tik<» «:onlo(ncd 
OIK Of the foMo w 'i»^ tUM^Ctlnn prnhos (WMJ nu curli): 

j'Hj>r«*hc A , (i'AW)Ac:friYrrc:cut:c:T<;frn , (rrrrt:rc: , r- 

GCCTT(TAMRA)v 3' «ud p-ttt-tijt probe 5 r tTAM)AT(Jf.x;c;- 
XCrAMUAJCCCC'CATGCCATCp-.T where p indicates 
phnAphnryl aM<w find X IndtcotcA a linker arm nucleotide. 
Reaction Mh\* wrn- Mit:n»Amp Opt ion 1 Tu1h s s (part i\um- 
WrNkOI 09.1.1, PerlcJn Ulniur) tiiat wore /rutted (nt IVrhln 
Timer) to prcvciil l»yl»l from /c fleeting. Tube caps were 
ilmiJ.iv in Micro A nip Unps hut ipccinlly designed to pre- 
vent Hfilit sewtlepng. AlUil H<<- Vi'M LUm/iwrniilvIv* were ttu>*- 
|/liv:d l>y Pli Applied ltjoftyitcm* (l ! t»*lor f!hy, CA) except 
thr fuel or Vlll priuien, wlik-h weu* .ty ntliesl/ed at Cenvn 
lech, Inc. (South f>u»' Pranclsco, CA). Prohev w^ri* Jc*si^ji(.*<l 
using the Ojiyv 4.0 5ofrwarc, follvwlM^ guidelines m\y>- 
i^CMeci m tnc Motk'l 7700 .sequence l>vuru>r lu.iiituiit*Ml 
manual, briefly, prube 'l' m ^))ih»U1 he ai Jeaat 5 U C hl r t^ r »' 
man The" aruu*ulliix ieai|^*j<iUirn uaciI during ihi-rmul ey- 
rhng; primers sli»\ilil not Xvnn NUhle duplexed wilh the 
probe. 

The iherujfll ryrllng eundiOoas Inrludvd 2 jiiIii tit 
Sirf: and 10 niin at 95"C. Thej-mal tycliilJU proceeded vvllh 



rcactioiiK were performed in the Model 77(H) Sequence IV- 
|«ior (PU Applied Uluxysumiv), uihlrh cuntxhis n Gw- 
Amp K'lU Syrtwm P<rOO. Uc:a*:Uon ctnidilion< vy. rf jnn. 
^nitlilltcU ui* .i I'wwvr MacinttMlt V10XJ (Apple Com pi ih>r ( 
Santa Clora, C^\) linked dirvtily to the Model '/?00 
c)uqikv IXtltfCtor. Ajiaty»U *tf data also pcKor inert on 
the MNi-lnU«h computer, f Vtllnrtlcm and analyRU tofiwaro 
wu» devv|o|wl hi 1*K Appllwi Wosyxitmis. 



Tranjfection of Cells with Factor Vlll Coiulrucl 

Pour T17.S Hask* of 293 cells {ATCX: OKI. ]S7'.t), n human 
fetol kidney st^pen^on cell line, were h"'**" to f>n% con- 
llueney And tftiufccttd ptWI'M. Cells were Krown In the* 
following media: 50% HAM'S HI 2 without GMT, Ji>% Ioim 
glucose JXiJlKtxN^ modified Ka^le medium (DMKM) with* 
enn Rtyoinu wiUi sodium bicaTbnnate, 10% ictal Uwinc 
scrum, 2 imm k^luUinint, And 1% peniollin-slrcptomy- 
tin. The medio was clwngcd 30 mln brhrv tl, r tronsfee 
lion. plOTM t)NA amounts of 40, 4, 0„S ,ind 0.1 ^f? were 
iiditcd it> ml of ft solution containing 0.125 m Cu>a?; 
And 1 x IllU'tiS. The four mixtures were left at roon tt»in- 
|.K'.ntt«ri' feu 10 mln and then uddvtl dmpwlM- to the cells. 
Tlrv n«»k> wviehiv.uUled at 37°C and 5,% i'X\ for 24 hr, 
washed with PUS, i»»tJ Suspended In PUS. The tvhux 
jA-ndi^l cells were divided into » liquid a und DNA wftd cv- 
tmded Innnedlulely usinK the QIAaiup W(n H \ Kit (QUpen. 
Qwt9W>rtIi ( l>NA wu.s e.luled Into 200 P l ul SO i"M 
Trb-lirjut pllH.O, 
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Contributed by David Botstein and Arnold J. Levine, October 21, 1998 

ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identification 
of two genes, WISP-1 and WISP-2, that are up-regulated in the 
mouse mammary epithelial cell line C57MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISP-3, these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1. These included (i) C57MG cells infected with a Wnt-1 
retroviral vector or expressing Wnt-1 under the control of a 
tetracyiihe repressible promoter, and (it) Wnt-1 transgenic 
mice. The WISP-1 gene was localized to human chromosome 
8q24.1-8q24 j. WISP-1 genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpressed (2- to > 30-fold) in 84% of the tumors examined 
compared with patient-matched normal mucosa. WISP-3 
mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to > 40-fold) in 63% of the colon tumors analyzed. 
In contrast, WISP-2 mapped to human chromosome 20ql2- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced (2- to >30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WSP expression in colon 
cancer may play a role in colon tumorigenesis. 



Wnt-1 is a member of an expanding family of cysteine-rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsh) to the cell'membrane (1, 2, 6). Dsh then inhibits the 
kinase activity of the normally constitutively active glycogen 
synthase kinase-30 (GSK-30) resulting in an increase in 
0-catenin levels. Stabilized j3-catenin interacts with the tran- 
scription factor TCF/Lefl, forming a complex that appears in 
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the nucleus and- binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (APC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
0-catenin levels (9). APC is phosphorylated by GSK-3J3, binds 
to /3-catenin, and facilitates its degradation. Mutations in 
either APC or /3-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3> a member of 
the transforming growth factor (TGF)-/3 superfamily, and the 
homeobox genes, engrailed, goosecoid, twin (Xtwn), and siamois 
(2). A recent report also identifies c-myc as a target gene of the 
Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell pheno- 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phenotype, characterized by elongated 
and refractile cells that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two cell lines might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WISP-1 
and WISP-2, and a third related gene, WISP-3, The WISP genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
nov, a family not previously linked to Wnt signaling. 

MATERIALS AND METHODS 

SSH. SSH was performed by using the PCR-Select cDNA 
. Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, transforming growth factor; CTGF, connective 
tissue growth factor; SSH, suppression subtractive hybridization; 
VWC, von Willebrand factor type C module. 
Data deposition: The sequences reported in this paper have been 
deposited in the Genbank database (accession nos. AF100777, 
AF100778, AF100779, AF100780, and AF100781). 
tTo whom reprint requests should be addressed, e-mail: diane@gene. 
com. 
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cDNA was synthesized from 2 u.g of poly(A) v RNA isolated 
from the C57MG/Wnt-1 cell line and driver cDNA from 2 u,g 
of poly(A) + RNA from the parent C57MG cells. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further, analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WISP-1 were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-1 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
. coding full-length mouse and human WISP-2 were isolated by 
screening a C57MG/Wnt-1 or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WISPS were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA. PCR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 of each dNTP at 
94°C for 1 sec, 62°C for 30 sec, 72°C for 1 min, for 22-32 cycles. 
WISP and glyceraldehyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. 33 P-labeIed sense and antisense ribo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WISP-1 or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse WISP-2. All tissues were processed as described (40). 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DNAs were PCR-amplified, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
. and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation f luorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centrifugation 
over CsCl cushions or prepared by using RNAzol. 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR. Gene-specific primers and 
fluorogenic probes (sequences available on request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2 (ActJ where ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The 
d-method was used for calculation of the SE of the gene copy 
number or RNA expression level. The If/S/'-specific signal was 
normalized to that of the gIyceraldehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-Elmer Applied Biosystems. 

RESULTS 

Isolation of WISP-I and WISP-2 by SSH. To identify Wnt- 
1-inducible genes, we used the technique of SSH using the 



mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1,384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match; To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/Wnt-1 cells. 

Two of the cDNAs, WISP-1 and WISP-2; were differentially 
expressed, being induced in the C57MG/Wnt-1 cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. I A and B). Wnt-4, unlike Wnt-1, does not induce 
the morphological transformation of C57MG cells and has no 
effect on /3-catenin levels (13, 14). Expression of WISP-1 was 
up-regulated approximately 3-fold in the C57MG/Wnt-1 cell 
line and WISP-2 by approximately 5-fold by both Northern 
analysis and reverse transcription-PCR. 

An independent, but similar, system was used to examine 
WISP expression after Wnt-1 induction. C57MG cells express- 
ing the Wnt-1 gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-1 mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnt-1 mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISPs may be an indirect response to Wnt-1 
signaling. 

cDNA clones of human WISP-I were isolated and the 
sequence compared with mouse WISP-1. The cDNA sequences 
of mouse and human WISP-I were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of ~40,000 {M r 40 K). Both have 
hydrophobic N-terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosylation sites 
and are 84% identical (Fig. 7A). 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of ~27,000 (M r 27 K) (Fig. IB), Mouse and human 
WISP-2 are 73% identical. Human WlSP-2 has no potential 
N-linked glycosylation sites, and mouse WISP-2 has one at 
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Fig. 1. WISP-1 arid WISP-2 are induced by Wnt-1, but not Wnt-4, 
expression in C57MG cells. Northern analysis of WISP-1 (A) and 
WISP-2 (B) expression in C57MG, C57MG/Wnt-1, and C57MG/ 
Wnt-4 cells. Poly(A) + RNA (2 n-g) w *s subjected to Northern blot 
analysis and hybridized with a 70-bp mouse WISP-1 -specific probe 
(amino acids 278-300) or a 190-bp WISP- 2- specific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were rehybridized with 
human )3-actin probe. 
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Fig. 2. Encoded amino acid sequence alignment of mouse and 
human WISP-1 (A) and mouse and human WISP-2 (B). The potential 
signal sequence, insulin-like growth factor-binding protein (IGF-BP), 
VWC, thrombospondin (TSP), and C-terminal (CT) domains are 
underlined. 

position 197. WISP-2 has 28 cysteine residues that are con- 
served among the 38 cysteines found in WISP-1. 

Identification of WISPS. To search for related proteins, we 
screened expressed sequence tag (EST) databases with the 
WISP-1 protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
WISP-3 cDNA of 1,371 bp was isolated corresponding to those 
ESTs that encode a 354 T aa protein with a predicted molecular 
mass of 39,293. WISP-3 has two potential N-linked glycosyl- 
ation sites and 36 cysteine residues. An alignment of the three 
' human WISP proteins shows that WISP-1 and WISP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with WISP-1 and 32% identity with WISP-3 (Fig. 14). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human WISP-1, WISP-2, and WISP-3 are novel sequences; 
however, mouse WISP-1 is the same as the recently identified 
Elml gene. Elml is expressed in low, but not high, metastatic 
mouse melanoma cells, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15). 
Human and mouse WISP-2 are homologous to the recently 
described rat gene, rCop-1 (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene nov. CTGF is a chemotactic and mitogenic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced by TGF-j3 (17). Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overexpressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, similarity to Wnt-1. All are 
secreted, cysteine-rich heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 3B) (21). The N-terminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (GCGC- 
CXXC) conserved in most insulin-like growth factor' (IGF)- 
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Fig. 3. (A) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-1 and WISP-2 that are not 
present in WISP-3 are indicated with a dot. (B) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain 
that are absent in WISP-3 are indicated with a dot. (C) Expression of 
WISP mRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas WISP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF-BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagens and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 3 A and B). 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated giycoconjugates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerization and receptor binding (26). The CT domain is 
present in all CCN family members described to date but is 
absent in WISP-2 (Fig. 3 A and B). The existence of a putative 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown). 

Expression of WISP mRNA in Human Tissues. Tissue- 
specific expression of human WISPs was characterized by PCR 
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analysis on adult and fetal multiple tissue cDNA panels. 
WISP-1 expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WISP-2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of WISPS was seen in adult 
kidney and testis and fetal kidney. Lower levels of WISPS 
expression were detected in placenta, ovary, prostate, and 
small intestine. 

In Situ Localization of WISP-1 and WISP-2. Expression of 
WISP-1 and WISP-2 was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WISP-1 was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig, 4 A-D). However, low- 
level WISP-1 expression also was observed focally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WISP-1, WISP-2 expression also was seen in the 
tumor stroma in breast tumors from Wnt-1 transgenic animals 
(Fig. 4 E-H). However, WISP-2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 





Fig. 4. (A, C, E, and C) Representative hematoxylin/eosin-stained 
images from breast tumors in Wnt-1 transgenic mice. The correspond- 
ing dark-field images showing WISP-1 expression are shown in B and 
D, The tumor is a moderately well-differentiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power (A and £), 
expression of WISP-1 is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and D), and tumor cells are negative. 
Focal expression of WISP-1, however, was observed in tumor cells in 
some areas. Images of WISP-2 expression are shown in E-H. At low 
power (£ and F), expression of WISP-2 is seen in cells lying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas tumor ceils are 
negative (G and H). 



the predominant ceil type expressing WISP-1 was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP-1 is approximately 
3.48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 16.31] on chromosome 8q24.1 to 8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28). Preliminary fine 
mapping indicates that WISP-1 is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-20ql3.1. Human WISPS mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(lod = 1,000). WISPS is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (27, 29). 

Amplification and Aberrant Expression ofWISPs in Human 
Colon Tumors. Amplification of protooncogenes is seen in 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression 
and poor prognosis (30). Because WISP-1 resides in the same 
general chromosomal location (8q24) as c-myc, we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus. 
Genomic DNA from human colon cancer cell lines was 
assessed by quantitative PCR and Southern blot analysis. (Fig. 
5 A and B). Both methods detected similar degrees of WISP-1 
amplification. Most cell lines showed significant (2- to 4-fold) 
amplification, with the HT-29 and WiDr cell lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP-1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PCR (Fig. 6). The copy number of WISP-1 and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-1 in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors (P < 0.001 for each). The 
copy number for WISPS was indistinguishable from one (P = 
0.166). In addition, the copy number of WISP-2 was signifi- 
cantly higher than that of WISP-1 (P < 0.001). 

The levels of WISP transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 
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Fig. 5. Amplification of WISP-1 genomic DNA in colon cancer cell 
lines. (A) Amplification in cell line DNA was determined by quanti- 
tative PCR. (B) Southern blots containing genomic DNA (10 jig) 
digested with £coRI {WISP-1) or Xba\ (c-myc) were hybridized with 
a 100-bp human WISP-1 probe (amino acids 186-219) or a human 
c-myc probe (located at bp 1901-2000). The WISP and myc genes are 
detected in normal human genomic DNA after a longer film exposure. 
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Fig. 6. Genomic amplification of WISP genes in human colon 
tumors. The relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by quantitative PCR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy 
donors. The data are means ± SEM from one experiment done in 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of WISP-1 
RNA present in tumor tissue varied but was significantly 
increased (2- to >25-fold) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10-fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WISP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP-1, WISP-3 RNA was overexpressed in 
63% (12/19) of the colon tumors compared with the normal. 
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Fig. 7. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from the same patient. 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. The Dukes stage of the tumor is listed under the 
sample number. The data are means ± SEM from one experiment 
done in triplicate. The experiment was repeated at least twice. 



mucosa. The amount of overexpression of WISP-3 ranged from 
4- to >40-fold. 



DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy, SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
Wnt-1. 

Three of the genes isolated, WISP-1, WISP-2, and WISP-3, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and nov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wnt-1. 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnt-1 and WISPs in that in these two situations, 
WISP induction was correlated with Wnt-1 expression. 

It is not clear whether the WISPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., j3-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-l-transformed cells, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through j3-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present in CTGF, Cyr61, nov, WISP-1, and WISP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such as TGF-/3, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32). It is tempting to speculate that 
WISP-1 and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is- also important for 
receptor binding, WISP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or nov. A recent report has shown that integrin a s $i serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression of WISP-1 and WISP-2 in cells lying 
within the fibrovascular tumor stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
. tissue formation during wound repair. It has been proposed 
that mammary tumor cells or inflammatory cells at the tumor 
interstitial interface secrete TGF-/31, which is the stimulus for 
stromal proliferation (34). TGF-/31 is secreted by a large 
percentage of malignant breast tumors and may be one of the 
growth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest that WISP-1 and WISP-2 expression was 
observed in the stromal cells that surrounded the tumor cells 
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(epithelial cells) in the Wnt-1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply WISP-1 and 
WISP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP-1 
and WISP-2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysis of WISP- 1 gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas overexpression of 
WISP-3 RNA was seen in the absence of DNA amplification. 
In contrast, WISP-2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WISP-2 was localized to chromosome 20ql2-20ql3, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 amplicon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon. 

A recent manuscript on rCop-1, the rat orthologue of 
WISP-2, describes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down-regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and 0-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic /3-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, the 
identification of WISPs as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development. . 
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methods. Peptides AENK or AEQK were dissolved in water, made isotonic with 
NaCl and dilated into RPM1 growth medium. T-cell-proliferation assays were 
done essentially as described 2011 . Briefly, after antigen pulsing OOp-griil"' 
TTCF) with tetrapeptides (l-amgmr 1 ), PBMCs or EBV-B cells were 
washed in PBS and fixed for 45 s in 0.05% glutaraldehyde. Glycine was added 
to a final concentration of 0.1M and the cells were washed five times in RPMI 
1640 medium containing 1% FCS before co-culture with T-cell clones in 
round-bottom 96-well microtitre plates. After 48 h, the cultures were pulsed 
with I u,C\ of 3 H-thymidine and harvested for scintillation counting 16 h later. 
Predigestion of native TTCF was done by incubating 200 u.g TTCF with 0.25 u,g 
pig kidney legumain in 500 \x\ 50 mM citrate buffer, pH 5.5, for 1 h at 37 °C 
Glycopeptide digestions. The peptides HIDNEEDI, HlDN(N-glucosamine) 
EEDI and H1DNESDI, which are based on the TTCF sequence, and 
QQQHLFGSNVTDCSGNFCLFR(KKK), which is based on human transferrin, 
were obtained by custom synthesis. The three C-terminal lysine residues were 
added to the natural sequence to aid solubility. The transferrin glycopeptide 
QQQHLFGSNVTDCSGNFCLFR was prepared by tryptic (Promega) digestion 
of 5mg reduced, carboxy- methylated human transferrin followed by 
concanavalin A chromatography". Glycopeptides corresponding to residues 
622-642 and 421-452 were isolated by reverse-phase HPLC and identified by 
mass spectrometry and N-terminal sequencing. The lyophilized transferrin- 
derived peptides were redissolved in 50 mM sodium acetate, pH 5.5, 10 mM 
dithiothreitol, 20% methanol. Digestions were performed for 3 h at 30 °C with 
5-50 mU ml" 1 pig kidney legumain or B-cell AEP. Products were analysed by 
HPLC or MALDI-TOF mass spectrometry using a matrix of lOmgml" 1 ct- 
cyanocinnamic acid in 50% acetonitrile/0.1% TFA and a PerSeptive Biosystems 
Elite STR mass spectrometer set to linear or reflector mode. Internal standar- 
dization was obtained with a matrix ion of 568,13 mass units. 
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Fas ligand (FasL) is produced by activated T cells and natural 
killer cells and it induces apoptosis (programmed cell death) in 
target cells through the death receptor Fas/Apol/CD95 (ref. 1). 
One important role of FasL and Fas is to mediate immune- 
cytotoxic killing of cells that are potentially harmful to the 
organism, such as virus-infected or tumour cells'. Here we 
report the discovery of a soluble decoy receptor, termed decoy 
receptor 3 (DcR3), that binds to FasL and inhibits FasL-induced 
apoptosis. The DcR3 gene was amplified in about half of 35 
primary lung and colon tumours studied, and DcR3 messenger 
RNA was expressed in malignant tissue. Thus, certain tumours 
may escape FasL-dependent immune-cytotoxic attack by expres- 
sing a decoy receptor that blocks FasL. 

By searching expressed sequence tag (EST) databases, we identi- 
fied a set of related ESTs that showed homology to the tumour 
necrosis factor (TNF) receptor (TNFR) gene superfamily 2 . Using 
the overlapping sequence, we isolated a previously unknown full- 
length complementary DNA from human fetal lung. We named the 
protein encoded by this cDNA decoy receptor 3 (DcR3). The cDNA 
encodes a 300-amino-acid polypeptide that resembles members of 
the TNFR family (Fig. la): the amino terminus contains a leader 
sequence, which is followed by four tandem cysteine-rich domains 
(CRDs). Like one other TNFR homologue, osteoprotegerin (OPG) 3 , 
DcR3 lacks an apparent transmembrane sequence, which indicates 
that it may be a secreted, rather than a membrane-asscociated, 
molecule. We expressed a recombinant, histidine-tagged form of 
DcR3 in mammalian cells; DcR3 was secreted into the cell culture 
medium, and migrated on polyacrylamide gels as a protein of 
relative molecular mass 35,000 (data not shown). DcR3 shares 
sequence identity in particular with OPG (31%) and TNFR2 
(29%), and has relatively less homology with Fas (17%). All of 
the cysteines in the four CRDs of DcR3 and OPG are conserved; 
however, the carboxy-terminal portion of DcR3 is 101 residues 
shorter. 

We analysed expression of DcR3 mRNA in human tissues by 
northern blotting (Fig. lb). We detected a predominant 1.2-kilobase 
transcript in fetal lung, brain, and liver, and in adult spleen, colon 
and lung. In addition, we observed relatively high DcR3 mRNA 
expression in the human colon carcinoma cell line SW480. 

To investigate potential ligand interactions of DcR3, we generated 
a recombinant, Fc-tagged DcR3 protein. We tested binding of 
DcR3-Fc to human 293 cells transfected. with individual TNF- 
family ligands, which are expressed as type 2 transmembrane 
proteins (these transmembrane proteins have their N termini in 
the cytosol). DcR3-Fc showed a significant increase in binding to 
cells transfected with FasL 4 (Fig. 2a), but not to cells transfected with 
TNF 5 , Apo2L/TRAIL 6 * 7 , Apo3L/TWEAK 8,9 , or OPGL/TRANCE/ 
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I RANKL 10 " 12 (data not shown). DcR3-Fc immuno precipitated shed 
FasL from FasL-transfected 293 cells (Fig. 2b) and purified soluble 
FasL (Fig. 2c), as did the Fc-tagged ectodomain of Fas but not 
TNFR1, Gel- filtration chromatography showed that DcR3-Fc and 
soluble FasL formed a stable complex (Fig. 2d). Equilibrium 
analysis indicated that DcR3-Fc and Fas-Fc bound to soluble 
FasL with a comparable affinity {K A — 0.8 ± 0.2 and 
l.l±0.1nM, respectively; Fig. 2e), and that DcR3-Fc could 
block nearly all of the binding of soluble FasL to Fas-Fc (Fig. 2e, 
inset). Thus, DcR3 competes with Fas for binding to FasL. 

To determine whether binding of DcR3 inhibits FasL activity, we 
tested the effect of DcR3-Fc on apoptosis induction by soluble 
FasL in Jurkat T leukaemia cells, which express Fas (Fig. 3a). DcR3- 
Fc and Fas-Fc blocked soluble- FasL-induced apoptosis in a 
similar dose-dependent manner, with half-maximal inhibition at 
~-0.1 u-gmP 1 . Time-course analysis showed that the inhibition did 
not merely delay cell death, but rather persisted for at least 24 hours 
(Fig. 3b). We also tested the effect of DcR3-Fc on activation- 
induced cell death (A1CD) of mature T lymphocytes, a FasL- 
dependent process'. Consistent with previous results 13 , activation 
of interleukin -2 -stimulated CD4-positive T cells with anti-CD3 
antibody increased the level of apoptosis twofold, and Fas-Fc 
blocked this effect substantially (Fig. 3c); DcR3-Fc blocked the 



induction of apoptosis to a similar extent. Thus, DcR3 binding 
blocks apoptosis induction by FasL. 

FasL-induced apoptosis is important in elimination of virus- 
infected cells and cancer cells by natural killer cells and cytotoxic T 
lymphocytes; an alternative mechanism involves perforin and 
granzymes U4 ~ Ih . Peripheral blood natural killer cells triggered 
marked cell death in Jurkat T leukaemia cells (Fig. 3d); DcR3-Fc 
and Fas-Fc each reduced killing of target cells from -65% to 
— 30%, with half-maximal inhibition at — l^gml" 1 ; the residual 
killing was probably mediated by the perforin/granzyme pathway. 
Thus, DcR3 binding blocks FasL-dependent natural killer cell 
activity. Higher DcR3-Fc and Fas-Fc concentrations were required 
to block natural killer cell activity compared with those required to 
block soluble FasL activity, which is consistent with the greater 
potency of membrane -associated FasL compared with soluble 
FasL 17 . 

Given the role of immune-cytotoxic cells in elimination of 
tumour cells and the fact that DcR3 can act as an inhibitor of 
FasL, we proposed that DcR3 expression might contribute to the 
ability of some tumours to escape immune-cytotoxic attack. As 
genomic amplification frequently contributes to tumorigenesis, we 
investigated whether the DcR3 gene is amplified in cancer. We 
analysed DcR3 gene-copy number by quantitative polymerase chain 
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Figure 1 Primary structure and expression of human DcR3. a, Alignment of the 
amino-acid sequences of DcR3 and of osteoprotegerin (OPG); the C-terminal 101 
residues of OPG are not shown. The putative signal cleavage site (arrow), the 
cysteine-rich domains (CRD l -4), and the AMinked glycosylation site (asterisk) are 
shown, b. Expression of DcR3 mRNA. Northern hybridization analysis was done 
using the DcR3 cDNA as a probe and blots of poly(A)* RNA (Clontech) from 
human fetal and adult tissues or cancer cefl lines. PBL, peripheral blood 
lymphocyte. 
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Figure 2 Interaction of DcR3 with FasL. a, 293 cells were transfected with pRK5 
vector (top) or with pRK5 encoding full-length FasL (bottom), incubated with 
DcR3-Fc (solid line, shaded area}, TNFRi-Fc (dotted line) or buffer control 
(dashed line) (the dashed and dotted lines overlap), and analysed for binding by 
FACS. Statistical analysis showed a significant difference {P < 0.001 ) between the 
binding of DcR3-Fc to ceils transfected with FasL or pRK5. PE. phycoerythrin- 
labelied cells, b, 293 cells were transfected as in a and metabolically labelled, and 
cell supernatants were immunoprecipitated with Fc-tagged TNFR1. DcR3 or Fas. 
c, Purified soluble FasL (sFasL) was immunoprecipitated with TNFRi-Fc, DcR3- 
Fc or Fas-Fc and visualized by immunoblot with anti-FasL antibody. sFasL was 
loaded directly for comparison in the right-hand lane, d, Flag-tagged sFasL was 
incubated with DcR3 r Fc or with buffer and resolved by gel filtration; column 
fractions were analysed in an assay that detects complexes containing DcR3-Fc 
and sFasL-Flag. e, Equilibrium binding of DcR3-Fc or Fas-Fc to sFasL-Flag. 
Inset, competition of DcR3-Fc with Fas-Fc for binding to sFasL-Flag. 
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reaction (PCR)' 8 in genomic DNA from 35 primary lung and colon 
tumours, relative to pooled genomic DNA from peripheral blood 
leukocytes (P&Ls) of 10 healthy donors. Eight of 18 lung tumours 
and 9 of 17 colon tumours showed DcR3 gene amplification, 
ranging from 2- to 18-fold (Fig. 4a, b). To confirm this result, we 
analysed the colon tumour DNAs with three more, independent sets 
of DcR3 -based PCR primers and probes; we observed nearly the 
same amplification (data not shown). 

We then analysed DcR3 mRNA expression in primary tumour 
tissue sections by in situ hybridization. We detected DcR3 expres- 
sion in 6 out of 15 lung tumours, 2 out of 2 colon tumours, 2 out of 5 
breast tumours, and 1 out of 1 gastric tumour (data not shown). A 
section through a squamous-cell carcinoma of the lung is shown in 
Fig. 4c. DcR3 mRNA was localized to infiltrating malignant epithe- 
lium, but was essentially absent from adjacent stroma, indicating 
tumour-specific expression. Although the individual tumour speci- 
mens that we analysed for mRNA expression and gene amplification 
were different, the in situ hybridization results are consistent with 
the finding that the DcR3 gene is amplified frequently in tumours. 
SW480 colon carcinoma cells, which showed abundant DcR3 
mRNA expression (Fig. lb), also had marked DcR3 gene amplifica- 
tion, as shown by quantitative PCR (fourfold) and by Southern blot 
hybridization (fivefold) (data not shown). 

If DcR3 amplification in cancer is functionally relevant, then 
DcR3 should be amplified more than neighbouring genomic 
regions that are not important for tumour survival. To test this, 



a c 




Time (h) Inhibitor (ug mM) 



Figure 3 Inhibition of FasL activity by DcR3. a, Human Jurkat T leukaemia cells 
were incubated with Flag-tagged soluble FasL (sFasL;. 5ng mt" 1 ) oligomerized 
with anti-Flag antibody (0.1 p-gmr') in the presence of the proposed inhibitors 
DcR3-Fc, Fas-Fc or human IgGt arid assayed for apoptosis (mean s s.e.m. of 
triplicates), b. Jurkat cells were incubated with sFasL-Flag.plus anti-Flag antibody 
as in a, in presence of 1 \iQ ml"' DcR3-Fc (rilled circles), Fas-Fc (open circles) or 
human IgGi (triangles), and apoptosis was determined at the indicated time 
points, c, Peripheral blood T cells were stimulated with PHA and interieukin-2, 
followed by control (white bars) or anti-CD3 antibody (filled bars), together with 
phosphate-buffered saline (PBS), human IgGi. Fas-Fc, or DcR3-Fc (lO^gm)* 1 ). 
After 16 h, apoptosis of CD4* cells was determined (mean ± s.e.m. of results from 
five donors), d, Peripheral blood natural killer cells were incubated with 5, Cr- 
labelled Jurkat cells in the presence of DcR3-Fc (filled circles), Fas-Fc (open 
circles) or human IgGi (triangles), and target-cell death was determined by. 
release of 5, Cr (mean ± s.d. for two donors, each in triplicate). 
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we mapped the human DcR3 gene by radiation-hybrid analysis; 
DcR3 showed linkage to marker AFM2 18xe7 (T160), which maps to 
chromosome position 20ql3. Next, we isolated from a bacterial 
artificial chromosome (BAC) library a human genomic clone that 
carries DcR3, and sequenced the ends of the clone's insert. We then 
determined, from the nine colon tumours that showed twofold or 
greater amplification of DcR3, the copy number of the DcR3- 
flanking sequences (reverse and forward) from the BAC, and of 
seven genomic markers that span chromosome 20 (Fig. 4d). The 
DcR3- linked reverse marker showed an average amplification of 
roughly threefold, slightly less than the approximately fourfold 
amplification of DcR3; the other markers showed little or no 
amplification. These data indicate that DcR3 may be at the 'epi- 
centre' of a distal chromosome 20 region that is amplified in colon 
cancer, consistent with the possibility that DcR3 amplification 
promotes tumour survival. 

Our results show that DcR3 binds specifically to FasL and inhibits 
FasL activity. We did not detect DcR3 binding to several other TNF- 
ligand- family members; however, this does not rule out the possi- 
bility that DcR3 interacts with other ligands, as do some other 
TNFR family members, including OPG 2 ' 19 . . 

FasL is important in regulating the immune response; however, 
little is known about how FasL function is controlled. One mechan- 
ism involves the molecule cFLIP, which modulates apoptosis signal- 
ling downstream of Fas 20 . A second mechanism involves proteolytic 
shedding of FasL from the cell surface 17 . DcR3 competes with Fas for 



a b d 




Figure 4 Genomic amplification of DcR3 in tumours, a, Lung cancers, comprising 
eight adenocarcinomas (c. d, f. g. h, j, k, r), seven squamous-cell carcinomas (a, e, 
m, n. o, p, q), one non-small-cell carcinoma (b), one small-cell carcinoma (i), and 
one bronchial adenocarcinoma (I). The data are means z s.d. of 2 experiments 
done in duplicate, b, Colon tumours, comprising 17 adenocarcinomas. Data are 
means ± s.e.m. of five experiments done in duplicate., c, In situ hybridization 
analysis of DcR3 mRNA expression in a squamous-celi carcinoma of the lung. A 
representative bright-field image (left) and the corresponding dark-field image 
(right) show DcR3 mRNA over infiltrating malignant epithelium (arrowheads). 
Adjacent non-malignant stroma (S), blood vessel (V) and necrotic tumour tissue 
(N) are also shown, d, Average amplification of DcR3 compared with amplifica- 
tion of neighbouring genomic regions (reverse and forward, Rev and Fwd), the 
DcR3-linked marker T160, and other chromosome-20 markers, in the nine colon 
tumours showing DcR3 amplification of twofold or more (b). Data are from two 
experiments done in duplicate. Asterisk indicates P < 0.01 for a Student's f-test 
comparing each marker with DcR3. 
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FasL binding; hence, it may represent a third mechanism of 
extracellular regulation of FasL activity. A decoy receptor that 
modulates the function of the cytokine interleukin- 1 has been 
described 21 . In addition, two decoy receptors that belong to the 
TNFR family, DcRl and DcR2, regulate the FasL-related apoptosis- 
inducing molecule Apo2L 22 . Unlike DcRl and DcR2, which are 
membrane-associated proteins, DcR3 is directly secreted into the 
extracellular space. One other secreted TNFR- family member is 
OPG 3 , which shares greater sequence homology with DcR3 (31%) 
than do DcRl (17%) or DcR2 (19%); OPG functions as a third 
decoy for Apo2L 19 . Thus, DcR3 and OPG define a new subset of 
TNFR-family members that function as secreted decoys to mod- 
ulate ligands that induce apoptosis. Pox viruses produce soluble 
TNFR homologues that neutralize specific TNF-family ligands, 
thereby modulating the antiviral immune response 2 . Our results 
indicate that a similar mechanism, namely, production of a soluble 
decoy receptor for FasL, may contribute to immune evasion by 
certain tumours. □ 



Methods 

Isolation of DcR3 cDNA. Several overlapping ESTs in GenBank (accession 
numbers AA025672, AA025673 and W67560) and in Lifeseq™ (Incyte 
Pharmaceuticals; accession numbers 1339238, 1533571, 1533650, 1542861, 
1789372 and 2207027) showed similarity to members of the TNFR family. We 
screened human cDNA libraries by PCR with primers based on the region of 
EST consensus; fetal lung was positive for a product of the expected size. By 
hybridization to a PCR-generated probe based on the ESTs, one positive clone 
(DNA30942) was identified. When searching for potential alternatively spliced 
forms of DcR3 that might encode a transmembrane protein, we isolated 50 
more clones; the coding regions of these clones were identical in size to that of 
the initial clone (data not shown). 

Fc-tusion proteins (immunoadhesins). The entire DcR3 sequence, or the 
ectodomain of Fas or TNFR1, was fused to the hinge and Fc region of human 
IgGl, expressed in insect SF9 cells or in human 293 cells, and purified as 
described". 

Fluorescence-activated cell sorting (FACS) analysis. We transfected 293 
cells using calcium phosphate or Effectene (Qiagen) with pRK5 vector or pRK5 
encoding full-length human FasL 4 (2 u.g), together with pRK5 encoding CrmA 
(2u.g) to prevent cell death. After 16 h, the cells were incubated with 
biotinylated DcR3-pc or TNFRl-Fc and then with phycoerythrin-conjugated 
streptavidin (GibcoBRL), and were assayed by FACS. The data were analysed by 
Kolmogorov-Smirnov statistical analysis. There was some detectable staining 
of vector-transfected cells by DcR3-Fc; as these cells express little FasL (data 
not shown), it is possible that DcR3 recognized some other factor that is 
expressed constitutively on 293 cells. 

Immunoprecipitation. Human 293 cells were transfected as above, and 
metabolically labelled with ( 35 S] cysteine and ( 35 S| methionine (0.5 mCi; 
Amersham). After 16 h of culture in the presence of z-VAD-fmk (10 ^.M), 
the medium was immunoprecipitated with DcR3-Fc, Fas-Fc or TNFRl-Fc 
(5u,g) t followed by protein A-Sepharose (Repligen). The precipitates were 
resolved by SDS-PAGE and visualized on a phosphorimager (Fuji BAS2000). 
Alternatively, purified. Flag-tagged soluble FasL ( 1 u.g) (Alexis) was incubated 
with each Fc-fusion protein ( 1 u.g), precipitated with protein A-Sepharose, 
resolved by SDS-PAGE and visualized by immunoblotting with rabbit anti- 
FasL antibody (Oncogene Research). 

Analysis of complex formation. Flag-tagged soluble FasL (25 fig) was 
incubated with buffer or with DcR3-Fc (40 u.g) for 1.5 h at 24 °C. The reaction 
was loaded onto a Superdex 200 HR 10/30 column (Pharmacia) and developed 
with PBS; 0.6-ml fractions were collected. The presence of DcR3-Fc-FasL 
complex in each fraction was analysed by placing 100 u.1 aliquots into microtitre 
wells precoated with anti-human IgG (Boehringer) to capture DcR3-Fc, 
followed by detection with biotinylated anti-Flag antibody Bio M2 (Kodak) and 
streptavidin-horseradish peroxidase (Amersham). Calibration of the column 
indicated an apparent relative molecular mass of the complex of 420K (data not 
shown), which is consistent with a stoichiometry of two DcR3-Fc homodimers 
to two soluble FasL homotrimers. 

Equilibrium binding analysis. Microtitre wells were coated with anti-human 



IgG, blocked with 2% BSA in PBS. DcR3-Fc or Fas-Fc was added, followed by 
serially diluted Flag-tagged soluble FasL. Bound ligand was detected with anti- 
Flag antibody as above. In the competition assay, Fas-Fc was immobilized as 
above, and the wells were blocked with excess IgGl before addition of Flag- 
tagged soluble FasL plus DcR3-Fc. 

T-cell AICD. CD3 + lymphocytes were isolated, from peripheral: blood of 
individual donors using anti-CD3 magnetic beads (Miltenyi Biotech), 
stimulated with phytohaemagglutinin (PHA; 2 u,g ml" 1 ) for 24 h, and cultured 
in the presence of interleukin-2 ( 100 U ml -1 ) for 5 days. The cells were plated in 
wells coated with anti-CD3 antibody (Pharmingen) and analysed for apoptosis 
16 h later.by FACS analysis of annexin-V-binding of CD4 + cells" 
Natural killer cell activity. Natural killer cells were isolated from peripheral 
blood of individual donors using anti-CD56 magnetic beads (Miltenyi 
Biotech), and incubated for 16 h with 51 Cr-Ioaded Jurkat cells at an effector- 
to-target ratio of 1:1 in the presence of DcR3-Fc, Fas-Fc or human IgGl. 
Target -cell death was determined by release of ^Cr in effector- target co- 
cultures relative to release of 5, Cr by detergent lysis of equal numbers of Jurkat 
cells. ' 

Gene-amplification analysis. Surgical specimens were provided by J. Kern 
(lung tumours) and P. Quirke (colon tumours). Genomic DNA was extracted 
(Qiagen) and the concentration was determined using Hoechst dye 33258 
intercalation fluorometry. Amplification was determined by quantitative PGR" 
using a TaqMan instrument (ABI). The method was validated by comparison of 
PCR and Southern hybridization data for the Myc and HER-2 oncogenes (data 
not shown). Gene-specific primers and fluorogenic probes were designed on 
the basis of the sequence of DcR3 or of nearby regions identified on a BAC 
carrying the human DcR3 gene; alternatively, primers and probes were based 
on Stanford Human Genome Center marker AFM218xe7 (T160), which is 
linked to DcR3 (likelihood score = 5.4), SHGC-36268 (T159), the nearest 
available marker which maps to -~500 kilobases from T160, and five extra 
markers that span chromosome 20. The DcR3-specific primer sequences were 
5'-CTTCTTCGCGCACGCTG-3' and 5'-ATCACGCCGGCACCAG-3' and the 
fluorogenic probe sequence was 5'-(FAM-ACACGATGCGTGCTCCAAGCAG 
AAp-(TAMARA), where FAM is 5' -fluorescein phosphoramidite. Relative 
gene-copy numbers were derived using the formula 2 (4CT) , where ACT is the 
difference in amplification cycles required to detect DcR3 in peripheral blood 
lymphocyte DNA compared to test DNA. 
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ABC transporters (also known as traffic ATPases) form a large 
family of proteins responsible for the translocation of a variety 
of compounds across membranes of both prokaryotes and 
eukaryotes 1 . The recently completed Escherichia coli genome 
sequence revealed that the largest family of paralogous E. colt 
proteins is composed of ABC transporters 2 . Many eukaryotic 
proteins of medical significance belong to this family, such as 
the cystic fibrosis transmembrane conductance regulator (CFTR), 
the P-glycoprotein (or multidrug-resistance protein) and the 
heterodimeric transporter associated with antigen processing 
(Tapl-Tap2). Here we report the crystal structure at 1.5 A resolu- 
tion of HisP, the ATP-binding subunit of the histidine permease, 
which is an ABC transporter from Salmonella typhimurium. We 
correlate the details of this structure with the biochemical, genetic 
and biophysical properties of the wild-type and several mutant 
HisP proteins. The structure provides a basis for understanding 
properties of ABC transporters and of defective CFTR proteins. 

ABC transporters contain four structural domains: two nucleo- 
tide-binding domains (NBDs), which are highly conserved 
throughout the family, and two transmembrane domains 1 . In 
prokaryotes these domains are often separate subunits which are 
assembled into a membrane-bound complex; in eukaryotes the 
domains are generally fused into a single polypeptide chain. The 
periplasmic histidine permease ofS. typhimurium and E co/i u " 8 is a 
well-characterized ABC transporter that is a good model for this 
superfamily. It consists of a membrane-bound complex, HisQMP 2 , 
which comprises integral membrane subunits, HisQ and HisM, and 
two copies of HisP, the ATP-binding subunit. HisP, which has 
properties intermediate between those of integral and peripheral 
membrane proteins 9 , is accessible from both sides of the membrane, 
presumably by its interaction with HisQ and HisM 6 . The two HisP 
subunits form a dimer, as shown by their cooperativity in ATP 
hydrolysis 5 , the requirement for. both subunits to be present for 
activity 8 , and the formation of a HisP dimer upon chemical cross- 
linking. Soluble HisP also forms a dimer 3 . HisP has been purified 
and characterized in an active soluble form 3 which can be recon- 
stituted into a fully active membrane-bound complex 8 . 

The overall shape of the crystal structure of the HisP monomer is 
that of an T with two thick arms (arm I and arm II); the ATP- 
binding pocket is near the end of arm I (Fig. 1). A six-stranded (3- 
sheet ( p3 and p8-p 12) spans both arms of the L, with a domain of a 
a- plus p-type structure (f$l, p2, p4-p7, al and a2) on one side 
(within arm I) and a domain of mostly a-helices (ct3-a9) on the 



arm l 




Figure 1 Crystal structure of HisP. a, View of the dimer along an axis 
perpendicular to its two-fold axis. The top and bottom of the dimer are suggested 
to face towards the periplasmic and cytoplasmic sides, respectively (see text). 
The thickness of arm II is about 25 A, comparable to that of membrane. a-Helices 
are shown in orange and rj-sheets in green, b, View along the two-fotd axis of the 
HisP dimer, showing the relative displacement of the monomers not apparent in 
a. The p-strands at the dimer interface are labelled, c, View of one monomer from 
the bottom of arm I, as shown in a, towards arm I!, showing the ATP-binding 
pocket, a-c. The protein and the bound ATP are in 'ribbon' and 'ball-and-stick* 
representations, respectively. Key residues discussed in the text are indicated in 
c. These figures were prepared with MOLSCRIPT 29 . N, amino terminus; C, C 
terminus. 
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Gene amplification is a common event in the progression of 
human cancers, and amplified oncogenes have been shown to 
have diagnostic, prognostic and therapeutic relevance. A 
kinetic quantitative polymerase-chain-reaction (PCR) method, 
based on fluorescent TaqMan methodology and a new instru- 
ment (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real-time, was used to quantify 
gene amplification in tumor DNA. Reactions are character- 
ized by the point during cycling when PCR amplification is still 
in the exponential phase, rather than the amount of PCR 
product accumulated after a fixed number of cycles. None of 
the reaction components is limited during the exponential 
phase, meaning that values are highly reproducible in reac- 
tions starting with the same copy number. This greatly 
improves the precision of DNA quantification. Moreover, 
real-time PCR does not require post-PCR sample handling, 
thereby preventing potential PCR-product carry-over con- 
tamination; it possesses a wide dynamic range of quantifica- 
tion and results in much faster and higher sample throughput. 
The real-time PCR method, was used to develop and validate 
a simple and rapid assay for the detection and quantification 
of the 3 most frequently amplified genes (myc, ccndl and 
erbB2) in breast tumors. Extra copies of myc, ccndl and erfaB2 
were observed in 10, 23 and 15%, respectively, of 108 breast- 
tumor DNA; the largest observed numbers of gene copies 
were 4.6, 18.6 and 15.1, respectively. These results correlated 
well with those of Southern blotting. The use of this new 
semi-automated technique will make molecular analysis of 
human cancers simpler and more reliable, and should find 
broad applications in clinical and research settings. Int. J. 
Cancer 78:661-666, 1998. 
€> 1998 Wiley-Liss, Inc. 

Gene amplification plays an important role in the pathogenesis 
of various solid tumors, including breast cancer, probably because 
over-expression of the amplified target genes confers a selective 
advantage. The first technique used to detect genomic amplification 
was cytogenetic analysis. Amplification of several chromosome 
regions, visualized either as extrachromosomal double minutes 
(dmins) or as integrated homogeneously staining regions (HSRs), 
are among the main visible cytogenetic abnormalities in breast 
tumors. Other techniques such as comparative genomic hybridiza- 
tion (CGH) (Kailioniemi et ai, 1 994) have also been used in broad 
searches for regions of increased DNA copy numbers in tumor 
cells, and have revealed some 20 amplified chromosome regions in 
breast tumors. Positional cloning efforts are underway to identify 
the critical gene(s) in each amplified region. To date, genes known 
to be amplified frequently in breast cancers include myc (8q24), 
ccnd\ ( 1 1 q 1 3), and erb&2 ( 1 7q 1 2-q2 1 ) (for review, see Bieche and 
Lidereau, 1995). 

Amplification of the myc, ccndl, and erbB2 proto-oncogenes 
should have clinical relevance in breast cancer, since independent 
studies have shown that these alterations can be used to identify 
sub-populations with a worse prognosis (Berns et ai. 1992; 
Schuuring et a/., '1992; Slamon et ai, 1987). Muss et al, (1994) 
suggested that these gene alterations may also be useful for the 
prediction and assessment of the efficacy of adjuvant chemotherapy 
and hormone therapy. 

However, published results diverge both in terms of the fre- 
quency of these alterations and their clinical value. For instance, 
over 500 studies in 10 years have failed to resolve the controversy 



surrounding the link suggested by Slamon et al. (1987) between 
erbBl amplification and disease progression. These discrepancies 
are partly due to the clinical, histological and ethnic heterogeneity 
of breast cancer, but technical considerations are also probably 
involved. 

Specific genes (DNA) were initially quantified in tumor cells by 
means of blotting procedures such as Southern and slot blotting. 
These batch techniques require large amounts of DNA (5-10 
ug/reaction) to yield reliable quantitative results. Furthermore, 
meticulous care is required at all stages of the procedures to 
generate blots of sufficient quality for reliable dosage analysis. 
Recently, PCR has proven to be a powerful tool for quantitative 
DNA analysis, especially with minimal starting quantities of tumor 
samples (small, early-stage tumors and formalin-fixed, paraffin- 
embedded tissues). 

Quantitative PCR can be performed by evaluating the amount of 
product either after a given number of cycles (end-point quantita- 
tive PCR) or after a varying number of cycles during the 
exponential phase (kinetic quantitative PCR). In the first case, an 
internal standard distinct from the target molecule is required to 
ascertain PCR efficiency. The method is relatively easy but implies 
generating, quantifying and storing an internal standard for each 
gene studied. Nevertheless, it is the most frequently applied 
method to date. 

One of the major advantages of the kinetic method is its rapidity 
in quantifying a new gene, since no internal standard is required (an 
external standard curve is sufficient). Moreover, the kinetic method 
has a wide dynamic range (at least 5 orders of magnitude), giving 
an accurate value for samples differing in their copy number. 
Unfortunately, the method is cumbersome and has therefore been 
rarely used. It involves aliquot sampling of each assay mix at 
regular intervals and quantifying, for each aliquot, the amplifica- 
tion product. Interest in the kinetic method has been stimulated by a 
novel approach using fluorescent TaqMan methodology and a new 
instrument (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real time (Gibson et ai, 1996; Heid et 
al., 1996). The TaqMan reaction is based on the 5' nuclease assay 
first described by Holland et al. (1991). The latter uses the 5' 
nuclease activity of Taq polymerase to cleave a specific fluorogenic 
oligonucleotide probe during the extension phase of PCR. The 
approach uses dual-labeled fluorogenic hybridization probes (Lee 
et ai, 1993). One fluorescent dye, co-valently linked to the 5' end 
of the oligonucleotide, serves as a reporter [FAM (i.e., 6-carboxy- 
fluorescein)] and its emission spectrum is quenched by a second 
fluorescent dye, TAMRA (Le., 6-carboxy-tetramethyl-rhodamine) 
attached to the 3' end. During the extension phase of the PCR 
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cycle, the fluorescent hybridization probe is hydrolyzed by the 
5'-3' nucleolytic activity of DNA polymerase. Nuclease degrada- 
tion of the probe releases the quenching of FAM fluorescence 
emission, resulting in an increase in peak fluorescence emission. 
The fluorescence signal is normalized by dividing the emission 
intensity of the reporter dye (FAM) by the emission intensity of a 
reference dye (i.e., ROX, 6-carboxy-X-rhodamine) included in 
TaqMan buffer, to obtain a ratio defined as the Rn (normalized 
reporter) for a given reaction tube. The use of a sequence detector 
enables the fluorescence spectra of all 96 wells of the thermal 
cycler to be measured continuously during PCR amplification. 

The real-time PCR method offers several advantages over other 
current quantitative PCR methods (Celi et al., 1994): (i) the 
probe-based homogeneous assay provides a real -time method for 
detecting only specific amplification products, since specific hybri- 
dation of both the primers and the probe is necessary to generate a 
signal; (ii) the Q (threshold cycle) value used for quantification is 
measured when PCR amplification is still in the log phase of PCR 
product accumulation. This is the main reason why C, is a more 
reliable measure of the starting copy number than are end-point 
measurements, in which a slight difference in a limiting component 
can have a drastic effect on the amount of product; (Hi) use of C, 
values gives a wider dynamic range (at least 5 orders of magni- 
tude), reducing the need for serial dilution; (iv) The real-time PCR 
method is run in a closed-tube system and requires no post-PCR 
sample handling, thus avoiding potential contamination; (v) the 
system is highly automated, since the instrument continuously 
measures fluorescence in all 96 wells of the thermal cycler during 
PCR amplification and the corresponding software processes, and 
analyzes the fluorescence data; (vi) the assay is rapid, as results are 
available just one minute after thermal cycling is complete; (vii) the 
sample throughput of the method is high, since 96 reactions can be 
analyzed in 2 hr. 

Here, we applied this semi-automated procedure to determine 
the copy numbers of the 3 most frequently amplified genes in breast 
tumors (myc, ccndJ and er&B2), as well as 2 genes (alb and app) 
located in a chromosome region in which no genetic changes have 
been observed in breast tumors. The results for 1 08 breast tumors 
were compared with previous Southern-blot data for the same 
samples. 



MATERIAL AND METHODS 
Tumor and blood samples 

Samples were obtained from 108 primary breast tumors removed 
surgically from patients at the Centre Rene Huguenin; none of the 
patients had undergone radiotherapy or chemotherapy. Immedi- 
ately after surgery, the tumor samples were placed in liquid 
nitrogen until extraction of high-molecular-weight DNA. Patients 
were included in this study if the tumor sample used for DNA 
preparation contained more than 60% of tumor cells (histological 
analysis). A blood sample was also taken from 18 of the same 
patients. 

DNA was extracted from tumor tissue and blood leukocytes 
according to standard methods. 

Real-time PCR 

Theoretical basis. Reactions are characterized by the point 
during cycling when amplification of the PCR product is first 
detected, rather than by the amount of PCR product accumulated 
after a fixed number of cycles. The higher the starting copy number 
of the genomic DNA target, the earlier a significant increase in 
fluorescence is observed. The parameter C, (threshold cycle) is 
defined as the fractional cycle number at which the fluorescence 
generated by cleavage of the probe passes a fixed threshold above 
baseline. The target gene copy number in unknown samples is 
quantified by measuring C t and by using a standard curve to 
determine the starting copy number. The precise amount of 
genomic DNA (based on optical density) and its quality (i.e., lack 



of extensive degradation) are both difficult to assess. We therefore 
also quantified a control gene (alb) mapping to chromosome region 
4qll-ql3. in which no genetic alterations have been found in 
breast-tumor DNA by means of CGH (Kallioniemi et al, 1 994). 

Thus, the ratio of the copy number of the target gene to the copy 
number of the alb gene normalizes the amount and quality of 
genomic DNA. The ratio defining the level of amplification is 
termed *'N M , and is determined as follows: 

copy number of target gene (app, myc, ccndl, erbB2) 

N = • — — — ■ . 

copy number of reference gene (alb) 

Primers, probes, reference human genomic DNA and PCR 
consumables. Primers and probes were chosen with the assistance 
of the computer programs Oligo 4.0 (National Biosciences, Ply- 
mouth, MN), EuGene (Daniben Systems, Cincinnati, OH) and Primer 
Express (Perkin-EImer Applied Biosystems, Foster City, CA). 

Primers were purchased from DN Agency (Malvern, PA) and 
probes from Perkin-EImer Applied Biosystems. 

Nucleotide sequences for the oligonucleotide hybridization 
probes and primers are available on request. 

The TaqMan PCR Core reagent kit, Micro Amp optical tubes, 
and MicroAmp caps were from Perkin-EImer Applied Biosystems. 

Standard-curve construction. The kinetic method requires a 
standard curve. The latter was constructed with serial dilutions of 
specific PCR products, according to Piatak et al. (1993). In 
practice, each specific PCR product was obtained by amplifying 20 
rig of a standard human genomic DNA (Boehringer, Mannheim, 
Germany) with the same primer pairs as those used later for 
real -time quantitative PCR. The 5 PCR products were purified 
using MicroSpin S-400 HR columns (Pharmacia, Uppsala, Swe- 
den) electrophorezed through an acrylamide gel and stained with 
ethidium bromide to check their quality. The PCR products were 
then quantified spectrophotometrically and pooled, and serially 
diluted 10-fold in mouse genomic DNA (Clontech, Palo Alto, CA) 
at a constant concentration of 2 ng/ul. The standard curve used for 
real-time quantitative PCR was based on serial dilutions of the pool 
of PCR products ranging from 10~ 7 (10 5 copies of each gene) to 
10" 10 (10* copies). This series of diluted PCR products was 
aliquoted and stored at -80°C until use. 

The standard curve was validated by analyzing 2 known 
quantities of calibrator human genomic DNA (20 ng and 50 ng). 

PCR amplification. Amplification mixes (50 ul) contained the 
sample DNA (around 20 ng, around 6600 copies of disomic genes), 
10X TaqMan buffer (5 ul), 200 uM dATP, dCTP, dGTP, and 400 
uM dUTP, 5 mM MgCl 2 , 1 .25 units of AmpliTaq Gold, 0.5 units of 
AmpErase uracil N-glycosylase (UNG), 200 nM each primer and 
100 uM probe. The thermal cycling conditions comprised 2 min at 
50°C and 10 min at 95°C. Thermal cycling consisted of 40 cycles at 
95°C for 15 s and 65°C for 1 min. Each assay included: a standard 
curve (from 10 5 to 10 2 copies) in duplicate, a no-template control, 
20 ng and 50 ng of calibrator human genomic DNA (Boehringer) in 
triplicate, and about 20 ng of unknown genomic DNA in triplicate 
(26 samples can thus be analyzed on a 96-well microplate). All 
samples with a coefficient of variation (CV) higher than 10% were 
retested. 

All reactions were performed in the ABI Prism 7700 Sequence 
Detection System (Perkin-EImer Applied Biosystems), which 
detects the signal from the fluorogenic probe during PCR. 

Equipment for real-time detection. The 7700 system has a 
built-in thermal cycler and a laser directed via fiber optical cables 
to each of the 96 sample wells. A charge-coupled-device (CDD) 
camera collects the emission from each sample and the data are 
analyzed automatically. The software accompanying the 7700 
system calculates C t and determines the starting copy number in the 
samples. 
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Determination of gene amplification. Gene amplification was 
calculated as described above. Only samples with an N value 
higher than 2 were considered to be amplified. 

RESULTS 

To validate the method, real-time PCR was performed on 
genomic DNA extracted from 108 primary breast tumors, and 18 
normal leukocyte DNA samples from some of the same patients. 
The target genes were the myc, ccndl and erbBl proto-oncogenes, 
and the p-amyloid precursor protein gene (app), which maps to a 
chromosome region (2 1 q2 1 .2) in which no genetic alterations have 
been found in breast tumors (Kallioniemi et ai, 1994). The 
reference disomic gene was the albumin gene (alb, chromosome 
4qll-ql3). 



Validation of the standard curve and dynamic range 
of real-time PCR 

The standard curve was constructed from PCR products serially 
diluted in genomic mouse DNA at a constant concentration of 
2 ng/ul. It should be noted that the 5 primer pairs chosen to analyze 
the 5 target genes do not amplify genomic mouse DNA (data not 
shown). Figure 1 shows the real-time PCR standard curve for the 
alb gene. The dynamic range was wide (at least 4 orders of 
magnitude), with samples containing as few as I0 2 copies or as 
many as 10 5 copies. 

Copy-number ratio of the 2 reference genes (app and alb) 

The app to alb copy-number ratio was determined in 1 8 normal 
leukocyte DNA samples and all 108 primary breast-tumor DNA 
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Figure 1 - Albumin {alb) gene dosage by real-time PCR. Top: Amplification plots for reactions with starting alb gene copy number ranging 
from 10 s (A9), 10 4 (A7), 10 3 (A4) to 10 2 (A2) and a no-template control (Al). Cycle number is plotted vs. change in normalized reporter signal 
(ARn). For each reaction tube, the fluorescence signal of the reporter dye (FAM) is divided by the fluorescence signal of the passive reference dye 
(ROX), to obtain a ratio defined as the normalized reporter signal (Rn). ARn represents the normalized reporter signal (Rn) minus the baseline 
signal established in the first 15 PCR cycles. ARn increases during PCR as alb PCR product copy number increases until the reaction reaches a 
plateau. C, (threshold cycle) represents the fractional cycle number at which a significant increase in Rh above a baseline signal (horizontal black 
line) can first be detected. Two replicate plots were performed for each standard sample, but the data for only one are shown here. Bottom: 
Standard curve plotting log starting copy number vs. C, (threshold cycle). The black dots represent the data for standard samples plotted in 
duplicate and the red dots the data for unknown genomic DNA samples plotted in triplicate. The standard curve shows 4 orders of linear dynamic 
range. 
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samples. We selected these 2 genes because they are located in 2 
chromosome regions (app. 21q21.2; alb, 4qll-ql3) in which no 
obvious genetic changes (including gains or losses) have been 
observed in breast cancers (Kallioniemi et ai, 1994). The ratio for 
the 18 normal leukocyte DNA samples fell between 0.7 and 1.3 
(mean 1.02 ± 0.21), and was similar for the 108 primary breast- 
tumor DNA samples (0.6 to 1 .6, mean 1 .06 ± 0.25), confirming 
that alb and app are appropriate reference disomic genes for 
breast-tumor DNA. The low range of the ratios also confirmed that 
the nucleotide sequences chosen for the primers and probes were 
not polymorphic, as mismatches of their primers or probes with the 
subject's DNA would have resulted in differential amplification. 

myc, ccndl and erbB2 gene dose in normal leukocyte DNA 

To determine the cut-off point for gene amplification in breast- 
cancer tissue, 1 8 normal leukocyte DNA samples were tested for 
the gene dose (N), calculated as described in "Material and 
Methods". The N value of these samples ranged from 0.5 to 1.3 
(mean 0.84 ± 0.22) for myc, 0.7 to 1.6 (mean 1.06 ± 0.23) for 
ccndl and 0.6 to 1 .3 (mean 0.9 1 ±0.19) for erbB2. Since N values 
for myc, ccndl and erbB2 in normal leukocyte DNA consistently 
fell between 0.5 and 1.6, values of 2 or more were considered to 
represent gene amplification in tumor DNA. 

myc, ccndl and erbi?2 gene dose in breast-tumor DNA 

myc, ccndl and erbBl gene copy numbers in the 108 primary 
breast tumors are reported in Table I. Extra copies of ccndl were 
more frequent (23%, 25/108) than extra copies of erbBl (15%, 
16/108) and myc (10%, 11/108), and ranged from 2 to 18.6 for 
ccndl, 2 to 15.1 for erbBl, and only 2 to 4.6 for the myc gene. 
Figure 2 and Table IT represent tumors in which the ccndl gene was 
amplified 16-fold (T145), 6-fold (T133) and non-amplified (Tl 18). 
The 3 genes were never found to be co-amplified in the same tumor. 
erbB2 and ccndl were co-amplified in only 3 cases, myc and ccndl 
in 2 cases and myc and erbBl in 1 case. This favors the hypothesis 
that gene amplifications are independent events in breast cancer. 
Interestingly, 5 tumors showed a decrease of at least 50% in the 
erbB2 copy number (N < 0.5), suggesting that they bore deletions 
of the 17q21 region (the site of erbBl). No such decrease in copy 
number was observed with the other 2 proto-oncogenes. 

. Comparison of gene dose determined by real-time quantitative 
PCR and Southern-blot analysis 

Southem-blot analysis of myc, ccndl and erbB2 amplifications 
had previously been done on the same 1 08 primary breast tumors. A 
perfect correlation between the results of real-time PCR and 
Southern blot was obtained for tumors with high copy numbers 
(N S 5). However, there were cases (1 myc, 6 ccndl and 4 erbBl) 
in which real-time PCR showed gene amplification whereas 
Southern-blot did not, but these were mainly cases with low extra 
copy numbers (N from 2 to 2.9). 

DISCUSSION 

The clinical applications of gene amplification assays are 
currently limited, but would certainly increase if a simple, standard- 
ized and rapid method were perfected. Gene amplification status 
has been studied mainly by means of Southern blotting, but this 
method is not sensitive enough to detect low-level gene amplifica- 
tion nor accurate enough to quantify the full range of amplification 
values. Southern blotting is also time-consuming, uses radioactive 



TABLE I - DISTRIBUTION OF AMPLIFICATION LEVEL (N) FOR myc. 
ccndl AND crbB2 GENES IN 108 HUMAN BREAST TUMORS 



Gene 




Amplification level (N) 




<0.5 


0.5-1.9 2-4.9 




myc 

ccndl 

erbBl 


0 
0 

5 (4.6%) 


97 (89.8%) 11 (10.2%) 
83 (76.9%) 17(15.7%) 
87 (80.6%) 8 (7.4%) 


0 

8 (7.4%) 
8 (7.4%) 



reagents and requires relatively large amounts of high-quality 
genomic DNA, which means it cannot be used routinely in many 
laboratories. An amplification step is therefore required to deter- 
mine the copy number of a given target gene from minimal 
quantities of tumor DNA (small early-stage tumors, cytopuncture 
specimens or formalin-fixed, paraffin-embedded tissues). 

In this study, we validated a PCR method developed for the 
quantification of gene over-representation in rumors. The method, 
based on real-time analysis of PCR amplification, has several 
advantages over other PCR-based quantitative assays such as 
competitive quantitative PCR (Celi et ai, 1994). First, the real-time 
PCR method is performed in a closed-tube system, avoiding the 
risk of contamination by amplified products. Re-amplification of 
carryover PCR products in subsequent experiments can also be 
prevented by using the enzyme uracil N-glycosylase (UNG) 
(Longo et ai, 1 990). The second advantage is the simplicity and 
rapidity of sample analysis, since no post-PCR manipulations are 
required. Our results show that the automated method is reliable. 
We found it possible to determine, in triplicate, the number of 
copies of a target gene in more than 1 00 tumors per day. Third, the 
system has a linear dynamic range of at least 4 orders of magnitude, 
meaning that samples do not have to contain equal starting amounts 
of DNA. This technique should therefore be suitable for analyzing 
formalin-fixed, paraffin-embedded tissues. Fourth, and above all, 
real-time PCR makes DNA quantification much more precise and 
reproducible, since it is based on C, values rather than end-point 
measurement of the amount of accumulated PCR product. Indeed, 
the ABI Prism 7700 Sequence Detection System enables C ( to be 
calculated when PCR amplification is still in the exponential phase 
and when none of the reaction components is rate-limiting. The 
within-run CV of the C, value for calibrator human DNA (5 
replicates) was always below 5%, and the between-assay precision 
in 5 different runs was always below 10% (data not shown). In 
addition, the use of a standard curve is not absolutely necessary, 
since the copy number can be determined simply by comparing the 
C t ratio of the target gene with that of reference genes. The results 
obtained by the 2 methods (with and without a standard curve) are 
similar in our experiments (data not shown). Moreover, unlike 
competitive quantitative PCR, real-time PCR does not require an 
internal control (the design and storage of internal controls and the 
validation of their amplification efficiency is laborious). 

The only potential disavantage of real-time PCR, like all other 
PCR-based methods and solid-matrix blotting techniques (South- 
em blots and dot blots) is that is cannot avoid dilution artifacts 
inherent in the extraction of DNA from tumor cells contained in 
heterogeneous tissue specimens. Only FISH and immunohistochem- 
istry can measure alterations on a cell-by-cell basis (Pauletti et ai, 
1996; Slamon et ai, 1989). However, FISH requires expensive 
equipment and trained personnel and is also time-consuming. 
Moreover, FISH does not assess gene expression and therefore 
cannot detect cases in which the gene product is over-expressed in 
the absence of gene amplification, which will be possible in the 
future by real-time quantitative RT-PCR. Immunohistochemistry is 
subject to considerable variations in the hands of different teams, 
owing to alterations of target proteins during the procedure, the 
different primary antibodies and fixation methods used and the 
criteria used to define positive staining. 

The results of this study are in agreement with those reported in 
the literature. (/) Chromosome regions 4qll-ql3 and 21q21.2 
(which bear alb and app, respectively) showed no genetic alter- 
ations in the breast-cancer samples studied here, in keeping with 
the results of CGH (Kallioniemi et ai, 1994). (//) We found that 
amplifications of these 3 oncogenes were independent events, as 
reported by other teams (Bems et ai, 1 992; Borg et ai, 1 992). (///) 
The frequency and degree of myc amplification in our breast tumor 
DNA series were lower than those of ccndl and erbBl amplifica- 
tion, confirming the findings of Borg et ai ( 1 992) and Courjal et ai 
(1997). (iv) The maxima of ccndl and er6B2 over-representation 
were 1 8-fold and 15-fold, also in keeping with earlier results (about 



GENE AMPLIFICATION BY REAL-TIME PCR 



665 



Amplification - CCND1 












1.400 - 
1.200 - 










i nnfl - 

1 .WW 

0-800 - 






iff 




0.600 - 








0.400 - 
0.200 - 
.0.000 - 
-0.200 - 










rrr 


1 I 1 1 ! 1 1 


j 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 





1 — Samples 



El FAM - A8 
El FAM - E12 
El FAM - Gil 



Viewer: | &Rn (B... ▼] 



0 2 4 6 6 10 12 H 16 18 20 22 24 26 28 30 32 34 36 38 40 Reporter ["TasT^I 

Cycle 



Amplification - ALB 




-Samples 



FAM - 64 
El FAM - C6 
I FAM - C8 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40 

Cycle 



Viewer: | ARn (B..."^l 
Reporter | FAM ▼ | 



Tumor 
I T118 



CCND1 



ALB 



Copy number C*- Copy number 



27.3 



4605 
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25.2 



10092 



T145 



22.1 



125892 



25.6 
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Figure 2 - ccndl and alb gene dosage by real-time PCR in 3 breast tumor samples: Tl 1 8 (E 1 2, C6, black squares), Tl 33 (G 1 1 . B4, red squares) 
and T145 (A8, C8, blue squares). Given the C, of each sample, the initial copy number is inferred from the standard curve obtained during the same 
experiment. Triplicate plots were performed for each tumor sample, but the data for only one are shown here. The results are shown in Table II. 



30-fold maximum) (Berns et ai, 1992; Borg et ai, 1992; Courjal et 
ai. 1997). (v) The erbB2 copy numbers obtained with real-time 
PCR were in good agreement with data obtained with other 
quantitative PCR-based assays in terms of the frequency and 
degree of amplification (An et ai, 1 995; Deng et ai. 1 996; Valeron 



et ai, 1996). Our results also correlate well with those recently 
published by Gelmini et ai ( 1 997), who used the TaqMan system to 
measure erbBl amplification in a small series of breast tumors 
(n = 25), but with an instrument (LS-50B luminescence spectrom- 
eter, Perkin-Elmer Applied Biosystems) which only allows end- 
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TABLE II - EXAMPLES OF vend) GENE DOSAGE RESULTS 
FROM 3 BREAST TUMORS 1 



Tumor 




ccndl 






alD 




Nccndl/atb 


Copy 
number 


Mean 


SD 


Copy 
number 


Mean 


SD 


Til 8 


'4525 






4223 










4605 


4603 


77 


4365 


4325 


89 


1.06 




4678 






4387 








T133 


59821 






9787 










61659 


61100 


1111 


10092 


10137 


375 


6.03 




61821 






10533 








T145 


128563 






7321 










125892 


125392 


3448 


7762 


7672 


316 


16.34 




121722 






7933 









'For each sample, 3 replicate experiments were performed and the mean 
and the standard deviation (SD) was determined. The level of ccndl gene 
amplification (Nccndlfalb) is determined by dividing the average ccndl 
copy n amber value by the average alb copy number value. 



point measurement of fluorescence intensity. Here we report myc 
and ccndl gene dosage in breast cancer by means of quantitative 
PCR. (vi) We found a high degree of concordance between 
real-time quantitative PCR and Southern blot analysis in terms of 
gene amplification, especially for samples with high copy numbers 
(>5-fold). The slightly higher frequency of gene amplification 
(especially ccndl and erbBl) observed by means of real-time 
quantitative PCR as compared with Southem-blot analysis may be 
explained by the higher sensitivity of the former method. However, 
we cannot rule out the possibility that some tumors with a few extra 



gene copies observed in real-time PCR had additional copies of an 
arm or a whole chromosome (trisomy, tetrasomy or polysomy) 
rather than true gene amplification. These 2 types of genetic 
alteration (polysomy and gene amplification) could be easily 
distinguished in the future by using an additional probe located on 
the same chromosome arm, but some distance from the target gene. 
It is noteworthy that high gene copy numbers have the greatest 
prognostic significance in breast carcinoma (Borg et al, 1992; 
Slamon era/., 1987). 

Finally, this technique can be applied to the detection of gene 
deletion as well as gene amplification. Indeed, we found a 
decreased copy number of erbBl (but not of the other 2 proto- 
oncogenes) in several tumors; erbBl is located in a chromosome 
region (17q21) reported to contain both deletions and amplifica- 
tions in breast cancer (Bieche and Lidereau, 1995). 

In conclusion, gene amplification in various cancers can be used 
as a marker of pre-neoplasia, also for early diagnosis of cancer, 
staging, prognostication and choice of treatment. Southern blotting 
is not sufficiently sensitive, and FISH is lengthy, and complex. 
Real-time quantitative PCR overcomes both these limitations, and 
is a sensitive and accurate method of analyzing large numbers of 
samples in a short time. It should find a place in routine clinical 
gene dosage. 
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amplification in certain tumor types but does so in others, then parallel monitoring of gene 
amplification and gene product over-expression enables more accurate tumor classification and 
hence better determination of suitable therapy. In addition, absence of over-expression is crucial 
information for the practicing clinician. If a gene is amplified but the corresponding gene 
product is not over-expressed, the clinician accordingly will decide not to treat a patient with 
agents that target that gene product. 

7. I hereby declare that all statements made herein of my own knowledge are true 
and that all statements made on information or belief are believed to be true, and further that 
these statements were, made with the knowledge that willful false statements and the like so 




made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code and that such willful statements may jeopardize the validity of the 
application or any patent issued thereon. 

By: ,4v-> A4a^^- ) Date: * )\S j 
Avi Ashkenazi, Ph.D. ' 
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Declaration of paul polakis, Ph.D. 

>olakis, Ph.D., declare and say as follows: 

I was awarded a Ph.D. by the Department of Biochemistry of the Michigan 
State University in 1984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2 I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist. Since joining Genentech in 1 999, one of my primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identify ing tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells; 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins", When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the Course of the research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells, 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely useful for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding-normal human cells. To date, we- 
have generated antibodies that bind to about 30 of the tumor antigen proteins 
expressed from these differentially expressed gene transcripts and have used these 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. • 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 
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expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increasedlevel of mRNA in a tumor cell relative 

.. to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of th 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 
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SECOND DECLARATION OF PAUL POLAKIS, Ph.D. 

I, Paul Polakis, Ph.D., declare and say as follows: 

I am currently employed by Genentech, Inc. where my job tide is Staff 
Scientist. 

Since joining Genentech in 1999, one of my primary responsibilities has 
been leading Genemech's Tumor Antigen Project, which is a large research 
project with a primary focus on identifying tumor cell markers that find use 
as targets for both the diagnosis and treatment of cancer in humans. 

As I stated in my previous Declaration dated May 7, 2004 (attached as 
Exhibit A), my laboratory has been employing a variety of techniques, 
including microarray analysis, to identify genes which are differentially 
expressed in human tumor tissue relative to normal human tissue. The 
primary purpose of this research is to identify proteins that are abundantly 
expressed on certain human tumor tissue(s) and that are either (i) not 
expressed, or (ii) expressed at detectably lower levels, on normal tissue(s); 

In the course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor tissue 
at significantly higher levels than in normal human tissue. To date, we 
have successfully generated antibodies that bind to 31 of the tumor antigen 
proteins expressed from these differentially expressed gene transcripts and 
have used these antibodies to quantitatively determine the level of 
production of these tumor antigen proteins in both human tumor tissue and 
normal tissue. We have then quantitatively compared the levels of mRNA 
and protein in both the tumor and normal tissues analyzed. The results of 
these analyses are attached herewith as Exhibit B. In Exhibit B, fc V means 
that the mRNA or protein was detectably overexpressed in the tumor tissue 
relative to normal tissue and means that no detectable overexpression 
was observed in the tumor tissue relative to normal tissue. 

As shown in Exhibit B, of the 31 genes identified as being detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the mRNA level , 28 of them (i.e., greater than 90%) are also detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the protein level . As such, in the cases where we have been able to 
quantitatively measure both (i) mRNA and (ii) protein levels in both (i) 
tumor tissue and (ii) normal tissue, we have observed thai in the vast 
majority of cases, there is a very strong correlation between increases in 
mRNA expression and increases in the level of protein encoded by that 
mRNA. 




6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4-5 above and my 
knowledge of rhe relevant scientific literature, it is my considered scientific 
opinion thai for human genes, an increased level of mRNA in a tumor 
tissue relative to a normal tissue more often than not correlates to a similar 
increase in abundance of the encoded protein in the tumor tissue relative ro 
the normal tissue. In fact, it remains a generally accepted working 
assumption in molecular biology that increased mRNA levels are more 
often than not predictive of elevated levels of the encoded protein. In fact, 
an entire industry focusing on the research and development of therapeutic 
antibodies to treat a variety of human diseases, such as cancer, operates on 
this working assumption. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be 
true, and further that these statements were made with the knowledge that 
willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful statements may jeopardize the validity of the 
application or any patent issued thereon. 
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DECLARATION OF PAUL POLAKIS, Ph.D. 



I, Paul Polakis, PhD., declare and say as follows: 

1 . I was awarded a PhD. by the Department of Biochemistry of the Michigan 
State University in 1984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2. I am currently employed by Genentech, Inc. where my job tide is Staff 
Scientist. Since joining Genentech in 1999, one of ray primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins that are ahundantiy expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins". When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the course of the research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells, 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely useful for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date, we 
have generated antibodies that bind to about 30 of the tumor antigen proteins 

expressed trom tnese differentially expres$^^ne^anscripts^nd~haYe-used-these 

antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 



expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor cell relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and farther that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 
thereon. 
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ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identification 
of two genes, WISP-1 and WISP-2, that are up-regulated in the 
mouse mammary epithelial cell line C57MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISP-3, these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1. These included (i) C57MG cells infected with a Wnt-1 
retroviral vector or expressing Wnt-1 under the control of a 
tetracyline repressive promoter, and (it) Wnt-1 transgenic 
mice. The WISP-1 gene was localized to human chromosome 
8q24.1-8q24J. WISP-1 genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpressed (2- to > 30-fold) in 84% of the tumors examined 
compared with patient-matched normal mucosa. WISPS 
mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to > 40-fold) in 63% of the colon tumors analyzed. 
In contrast, WISP-2 mapped to human chromosome 20ql2- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced (2- to > 30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WISP expression in colon 
cancer may play a role in colon tumorigenesis. 



Wnt-1 is a member of an expanding family of cysteine-rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven- transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsh) to the cell membrane (1, 2, 6). Dsh then inhibits the 
kinase activity of the normally constitutively active glycogen 
synthase kinase-3)3 (GSK-30) resulting in an increase in 
0-catenin levels. Stabilized 0-catcnin interacts with the tran- 
scription factor TCF/Lefl, forming a complex that appears in 
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the nucleus and binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (APC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
/3-catenin levels (9). APC is phosphorylated by GSK-3/3, binds 
to /3-catenin, and facilitates its degradation. Mutations in 
either APC or 0-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3, a member of 
the transforming growth factor (TGF)-/3 superfamily, and the 
homeobox genes, engrailed, goosecoid, twin (Xtwn), andsiamois 
(2). A recent report also identifies c-myc as a target gene of the 
Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell pheno- 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phenotype, characterized by elongated 
and refractile cells that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two cell lines might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WISP- 1 
and WISP-2, and a third related gene, WISPS. The WISP genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
nov, a family not previously linked to Wnt signaling. 

MATERIALS AND METHODS 

SSH, SSH was performed by using the PCR-Select cDNA 
Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, transforming growth factor, CTGF, connective 
tissue growth factor; SSH, suppression subtractive hybridization; 
VWC, von Willebrand factor type C module. 

Data deposition: The sequences reported in this paper have been 
deposited in the Genbank database (accession nos. AF 100777, 
AF100778, AF100779, AF100780, and AF100781). 
+To whom reprint requests should be addressed, e-mail: diane@gene. 
com. 
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cDNA was synthesized from 2 u.g of poly(A) + RNA isolated 
from the C57MG/Wnt-1 cell line and driver cDNA from 2 ug 
of po!y(A) + RNA from the parent C57MG cells. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WISP-l were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-l 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
coding full-length mouse and human WISP-2 were isolated by 
screening a C57MG/Wnt-1 or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WISP-3 were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA. PCR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 yM of each dNTP at 
94°C for 1 sec, 62°C for 30 sec, 72°C for 1 min, for 22-32 cycles. 
WISP and glyceraldehyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. 33 P-labeled sense and antisense ribo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WISP-l or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse WISP-2. All tissues were processed as described (40). 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DNAs were PCR-amplified, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation f luorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centrifugation 
over CsCl cushions or prepared by using RNAzol. 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR. Gene-specific primers and 
fluorogenic probes (sequences available on request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2<a«> where ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The 
d-method was used for calculation of the SE of the gene copy 
number or RNA expression level. The W7SP-specific signal was 
normalized to that of the glyceraIdehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-Elmer Applied Biosystems. 

RESULTS 

Isolation of WISP-l and WISP-2 by SSH. To identify Wnt- 
1-inducible genes, we used the technique of SSH using the 
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mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1,384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match. To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/Wnt-1 cells. 

Two of the cDNAs, WISP-l and WISP-2, were differentially 
expressed, being induced in the C57MG/ Wnt-1 cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. \A and B). Wnt-4, unlike Wnt-1, does not induce 
the morphological transformation of C57MG cells and has no 
effect on 0-catenin levels (13, 14). Expression of WISP-l was 
up-regulated approximately 3-fold in the C57MG/ Wnt-1 cell 
line and WISP-2 by approximately 5-fold by both Northern 
analysis and reverse transcription-PCR. 

An independent, but similar, system was used to examine 
WISP expression after Wnt-1 induction. C57MG cells express- 
ing the Wnt-1 gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-1 mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnt-1 mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISPs may be an indirect response to Wnt-1 
signaling. 

cDNA clones of human WISP-l were isolated and the 
sequence compared with mouse WISP-l. The cDNA sequences 
of mouse and human WISP-l were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of « 40,000 (Af T 40 K). Both have 
hydrophobic N-terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosylation sites 
and are 84% identical (Fig. Z4). 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of 27,000 (Af r 27 K) (Fig. IS). Mouse and human 
WISP-2 are 73% identical. Human WISP-2 has no potential 
N-linked glycosylation sites, and mouse WISP-2 has one at 
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Fio. 1 . WISP-l and WISP-2 are induced by Wnt- 1 , but not Wnt-4, 
expression in C57MG cells. Northern analysis of WISP-l (A) and 
WISP-2 (B) expression in C57MG, C57MG/Wnt-1, and C57MG/ 
Wnt-4 cells. Poly(A) + RNA (2 fig) was subjected to Northern blot 
analysis and hybridized with a 70-bp mouse WISP- 1- specific probe 
(amino acids 278-300) or a 190-bp M75P-2-specific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were rehybridized with 
human 0-actin probe. 
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Fig. 2. Encoded amino acid sequence alignment of mouse and 
human WISP-1 (A) and mouse and human W7SP-2 (B). The potential 
signal sequence, insulin-like growth factor-binding protein (IGF-BP), 
VWC, thrombospondin (TSP), and C-terminal (CT) domains are 
underlined. 

position 197. WISP-2 has 28 cysteine residues that are con- 
served among the 38 cysteines found in WISP-L 

Identification of WISP-3. To search for related proteins, we 
screened expressed sequence tag (EST) databases with the 
WISP-1 protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
WISP-3 cDNA of 1,371 bp was isolated corresponding to those 
ESTs that encode a 354-aa protein with a predicted molecular 
mass of 39,293. WISP-3 has two potential N-linked glycosyl- 
ation sites and 36 cysteine residues. An alignment of the three 
human WISP proteins shows that WISP-1 and WISP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with WISP-1 and 32% identity with WISP-3 (Fig. 3A). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human WISP-1, WISP-2, and WISP-3 are novel sequences; 
however, mouse WISP-1 is the same as the recently identified 
Ebnl gene. Elml is expressed in low, but not high, metastatic 
mouse melanoma cells, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15). 
Human and mouse WISP-2 are homologous to the recently 
described rat gene, rCop-1 (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene nov. CTGF is a chemotactic and mitogen ic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced by TGF-0 (17). Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overex pressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, similarity to Wnt-1. All are 
secreted, cysteine-rich heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 3B) (21). The N-terminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (GCGC- 
CXXC) conserved in most insulin-like growth factor (IGF)- 
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Fig. 3. (A) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-1 and WISP-2 that are not 
present in WISP-3 are indicated with a dot. (B) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain 
that are absent in WISP-3 are indicated with a dot. (C) Expression of 
WISP mRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas WISP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF-BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagens and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 3 A and B). 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated glycoconjugates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerization and receptor binding (26). The CT domain is 
present in all CCN family members described to date but is 
absent in WISP-2 (Fig. 3 A and B). The existence of a putative 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown). 

Expression of WISP mRNA in Human Tissues. Tissue- 
specific expression of human W/S/'s was characterized by PCR 
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analysis on adult and fetal multiple tissue cDNA panels. 
WISP-1 expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WISP-2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of W1SP-3 was seen in adult 
kidney and testis and fetal kidney. Lower levels of WISPS 
expression were detected in placenta, ovary, prostate, and 
small intestine. 

In Situ Localization of WISP-1 and WISP-2. Expression of 
WISP-1 and WISP-2 was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WISP-2 was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig. 4 A-D). However, low- 
level WISP-1 expression also was observed focally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WISP-2, WISP-2 expression also was seen in the 
tumor stroma in breast tumors from Wnt-1 transgenic animals 
(Fig. 4 E-H). However, WISP-2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 




Fig. 4. {A, C, E, and G) Representative hematoxylin/eosin-stained 
images from breast tumors in Wnt-1 transgenic mice. The correspond- 
ing dark-field images showing WISP-1 expression are shown in B and 
D. The tumor is a moderately well-differentiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power (A and &), 
expression of WISP-1 is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and D), and tumor cells are negative. 
Focal expression of WISP-1, however, was observed in tumor cells in 
some areas. Images of WISP : 2 expression are shown in E-H. At low 
power (£ and F), expression of WISP-2 is seen in cells lying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas tumor cells are 
negative (G and H). 



the predominant cell type expressing WISP-1 was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP-1 is approximately 
3.48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 1631] on chromosome 8q24.1 to 8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28). Preliminary fine 
mapping indicates that WISP-2 is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-20ql3.1. Human WISPS mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(lod = 1,000). WISPS is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (27, 29). 

Amplification and Aberrant Expression of WISPs in Human 
Colon Tumors. Amplification of protooncogenes is seen in 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression 
and poor prognosis (30). Because WISP-1 resides in the same 
general chromosomal location (8q24) as c-myc, we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus. 
Genomic DNA from human colon cancer cell lines was 
assessed by quantitative PCR and Southern blot analysis. (Fig. 
5 A and B). Both methods detected similar degrees of WISP-1 
amplification. Most cell lines showed significant (2- to 4- fold) 
amplification, with the HT-29 and WiDr cell lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP-1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PCR (Fig. 6). The copy number of WISP-1 and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-1 in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors (P < 0.001 for each). The 
copy number for WISPS was indistinguishable from one (P = 
0.166). In addition, the copy number of WISP-2 was signifi- 
cantly higher than that of WISP-1 (P < 0.001). 

The levels of WISP transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 





Fig. 5. Amplification of WISP-1 genomic DNA in colon cancer cell 
lines. (A) Amplification in cell line DNA was determined by quanti- 
tative PCR. (B) Southern blots containing genomic DNA (10 /*g) 
digested with EcoRI (WISP-1) or Xbal (c-myc) were hybridized with 
a 100-bp human WISP-1 probe (amino acids 186-219) or a human 
c-myc probe (located at bp 1901-2000). The WISP and myc genes are 
detected in normal human genomic DNA after a longer film exposure. 
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Fig. 6. Genomic amplification of WISP genes in human colon 
tumors. The relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by quantitative PCR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy 
donors. The data are means ± SEM from one experiment done in 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of WISP-1 
RNA present in tumor tissue varied but was significantly 
increased (2- to >25-fold) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10-fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WISP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP- 1 , WISPS RNA was overexpressed in 
63% (12/19) of the colon tumors compared with the normal 
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Fio. 7. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from the same patient. 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. The Dukes stage of the tumor is listed under the 
sample number. The data are means ± SEM from one experiment 
done in triplicate. The experiment was repeated at least twice. 



mucosa. The amount of overexpression of WISPS ranged from 
4- to >40-fold. 



DISCUSSION 



One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy, SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
Wnt-1. 

Three cf the genes isolated, WISP-1, WISP-2, and WISP-3, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and nov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wnt-1. 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnt-1 and WISPs in that in these two situations, 
WISP induction was correlated with Wnt-1 expression. 

It is not clear whether the WISPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., ^catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-l-transformed cells, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through j3-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present in CTGF, Cyr61, nov, WISP-1, and WISP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such asTGF-j3, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32). It is tempting to speculate that 
WISP-1 and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WISP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or nov. A recent report has shown that integrin a v ftt serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression of WISP-1 and WISP-2 in cells lying 
within the fibrovascular tumor stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
tissue formation during wound repair. It has been proposed 
that mammary tumor ceils or inflammatory cells at the tumor 
interstitial interface secrete TGF-/31, which is the stimulus for 
stromal proliferation (34). TGF-/31 is secreted by a large 
percentage of malignant breast tumors and may be one of the 
growth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest that WISP-1 and WISP-2 expression was 
observed in the stromal cells that surrounded the tumor ceils 
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(epithelial cells) in the Wnt-1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply WISP-1 and 
WISP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP-1 
and WISP-2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysis of WISP-] gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas overexpression of 
WISP-3 RNA was seen in the absence of DNA amplification. 
In contrast, WISP-2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WISP-2 was localized to chromosome 20ql2-20ql3, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 amplicon has not yet 
been identified, it is possible that the. apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon. 

A recent manuscript on rCop-1, the rat orthologue of 
WISP-2, describes the loss of expression of this gene after cell 
7 transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down-regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and 0-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic /3-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, the 
identification of WISPs as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development. 
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ABSTRACT The consistent cytogenetic translocation of 
chronic myelogenous leukemia (the Philadelphia chromosome, 
Ph 1 ) has been observed In cells of multiple hematopoietic 
lineages. This translocation creates a chimeric gene composed 
of breakpoint-cluster-region (bcr) sequences from chromosome 
22 fused to a portion of the a W oncogene on chromosome 9. The 
resulting gene product (PHO^ 1 ) resembles the transforining 
protein of the Abelson murine leukemia virus in Its structure 
and tyrosine kinase activity. P210 e ** bl is expressed in Ph r - 
positive cell lines of myeloid lineage and in clinical specimens 
with, myeloid predominance. We show here that Epstein-Barr 
virus-transformed B-Iymphocyte lines that retain Ph 1 can 
express P210 t> * bI . The level of expression in these B-cell lines is 
generally lower and more variable than that observed for 
myeloid lines. Protein expression is not related to amplification 
of the abl gene but to variation in the level of bcr-abl mRNA 
produced from a single Ph 1 template. 



Chronic myelogenous leukemia (CML) is a disease of the 
pluripotcnt stem cell (1). In greater than 95% of patients, the 
leukemic cells contain the cytogenetic marker known as the 
Philadelphia chromosome, or Ph 1 (2). This reciprocal 
translocation event between the long arms of chromosomes 
9 and 22 has been used as a disease-specific marker for 
diagnosis and evaluation of therapy. Multiple hematopoietic 
lineages, including myeloid and B -lymphoid, contain Ph 1 in 
early or chronic phase, as well as in the more acute accel- 
erated and blast crisis phases of the disease. 

One molecular consequence of Ph 1 is the translocation of 
the chromosomal arm containing the c-abl gene on chromo- 
some 9 into the middle of the breakpoint-cluster region (bcr) 
gene on chromosome 22 (3-6). Although the precise 
translocation breakpoints are variable, an RNA-splicing 
mechanism generates a very similar 8-kilobase (kb) mRNA m 
each case (5-9). The hybrid bcr-abl message encodes a 
structurally altered form of the abl oncogene product, called 
P210 c " tW (10-13), with an anuno-terbinal segment derived 
from a portion of the exons of bcr On chromosome 22 and a 
carboxyl-terminal segment derived from a major portion of 
the exons of the c-abl gene on chromosome 9. The chimeric 
structure of bcr-abl and the resulting P210?'* bl is similar to the 
structure of the Abelson murine leukemia virus gag-dbl 
genome and resulting P160 v ** b ! transforming gene product. 
Both proteins have very similar tyrosine kinase activities (10, 
11, 14) which can be distinguished by their relative stability 
to denaturing detergents and by their ATP requirements from 
the recently described tyrosine kinase activity of the c-abl 
gene product (15). 
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In concert with structural modification of the amino- 
terminal portion of the abl gene, increased level of expression 
has been implicate^ in activation of c-abl oncogenic poten- 
tial. Myeloid and erythrbid cell lines and clinical samples 
derived from acute-phase CML patients contain about 10- 
fold higher levels of the 8-kb bcr-abl mRNA and P210 c " aW than 
the c-abl mRNA forms (6 and 7 kb) and P145 c -* w gene product 
(5, 8, -9, Ml), The higher level of expression of the chimeric 
bcr r abl message in acute-phase cells is not likely to be solely 
due to the presence of.the bcr promoter sequences at the 5' 
end of the gene, since the normal 4.5-kb and 6.7-kb bcr- 
encoded mRNA species are expressed at an even lower level 
than the normal c-abl messages (5, 6).. 

We have analyzed a series of, Epstein-Barr virus-immor- 
talized B-lymphpid cell lines derived from CML patients (16). 
With such in vitro clonal cell lines, we can evaluate whether 
the presence of t>h l always results in synthesis of the chimeric 
bcr-abl message and protein, and whether the quantitative 
expression varies for cells of B-lymphoid lineage as com- 
pared to previously examined myeloid cell lines. Our results 
show that cell lines that retain Ph 1 do express bcr-abl message 
and protein, but that the level is generally lower and more 
variable than previously seen for myeloid cell lines. The 
demonstration that the Ph 1 chromosomal template can vary 
in its level of expression of P210 c * t ! 1 suggests that secondary 
mechanisms, beyond the translocation itself, contribute to 
the regulation of the bcr-abl gene in different cell types or 
subclones that derive from the affected stem cell. 

MATERIALS AND METHODS 

Cells and Cell Labellngs. Epstein-Barr virus-transformed 
B-lymphoid cell lines were established from peripheral blood 
samples of chronic- and acute-phase CML patients as report* 
ed (ljS). The cell lines are designated according to patient 
number, karyotype, and lineage. For example, SK- 
CML7Bt(9,22)-33 refers to CML patient 7, B-lymphoid cell 
line, 9;22 translocation (Ph 1 ), cell line ?3; and SK-CML7BN- 
2 refers to B-cell line 2 with a normal karyotype derived from 
the same patient. Repeat karyotype analysis was performed 
to verify the retention of Ph 1 just prior to analysis for abl 
protein and RNA. Cells were maintained in RPMI .1640 
medium with 20% fetal bovine serum. We have not observed 
any consistent pattern of In vitro growth rate that correlates 
to the stage of disease at the time of transformation with 
Epstein-Barr virus. Cells (1 .5 x 10 7 ) were washed twice with 
Dulbecco's modified Eagle's medium lacking phosphate and 
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supplemented with 5% dialyzed fetal bovine serum. Cells 
were then resuspended in 2 ml of the minimal medium. 
Labeling was started with the addition of [ 32 P]orthophos- 
phate (1 mCi/ml; ICN; 1 Ci - 37 GBq) and continued at 37°C 
for 3-4 hr. 

Immunopredpitation and Immunoblotting. Immunoprecip- 
itations were earned out as described (10). Cells (1.5 X JO 7 ) 
were washed with phosphate-buffered saline and extracted 
with 3-5 ml of phosphate lysis buffer (1% Triton X-100/0.1 
NaDodSO 4 /0.5% deoxycholate/10 mM Na 2 HP0 4 , pH 7.5/ 
100 mM NaCl) with 5 mM EDTA and 5 mM phenylmethyl- 
sulfonyl fluoride. Extracts were clarified by cenirifugation 
and precipitated with normal or rabbit anti-abl sera (anti- 
pEX-2 or antj-pEX-5) (17). The precipitated proteins were 
electrophoresed in a NaDodS0 4 /8% polyacrylamide gel. 
32 P-labeled proteins were detected by autoradiography. 
Alternatively, abl proteins were detected by immunoblotting. 
Extracts from unlabeled cells were clarified, and proteins 
were concentrated by immunopredpitation with rabbit anti- 
sera against aW-encoded proteins [anti-pEX-2 and anti-pJJX- 
5 combined (17)] and then fractionated in 8% acrylamide gels. 
The proteins were transferred from the gel to nitrocellulose 
filters, using protease-facilitated transfer (18). The abU 
encoded proteins were detected using murine monoclonal 
antibodies as a probe and peroxidase-conjugated goat anti- 
mouse second stage antibody (Bio-Rad) for development. 
Rabbit antisera and mouse monoclonal antibodies to abl 
proteins were prepared using bacterially expressed regions of 
the v-abl protein as immunogens (17, 19). Anti-pEX-2 anti- 
bodies react with the internal tyrosine kinase domain and 
anti-pEX-5 antibodies react with the carboxyl-terminal seg- 
ment of the abl proteins. 

RNA Analysis. RNA was extracted from 10 s cells by the 
NaDodS0 4 /urea/phenol method (20). Polyadenylylated 
RNA was purified by oligo(dT) affinity chromatography. 
Samples were electrophoresed in a \% agarose/formalde- 
hyde gel and transferred to nitrocellulose, abl RNA species 
were detected by hybridization with a nick-translated v-abl 
fragment probe (21). 

DNA Analysis. DNA was prepared from 5 x 10 7 cells of 
each cell line and processed for Southern blots with a v-aW 
probe as described (21). 

RESULTS 

Variable Levels of PHO*"" Are Detected In Ph^Posltive Cell 
lines, Fh l -positive and Ph l -negative, Epstein-Barr virus- 
transformed B-iymphocyte cell lines derived from the same 
patient were examined for P210 c ^ w synthesis by immuno- 
precipitation of [ 3i P]orthophosphate-labeled cell extracts 
with anti-abl sera (Fig. 1). The normal c-aW protein P145 c * w 
was detected at a similar level in multiple Ph l -positive and 
Ph l -negative cell lines. P210 c ' abi was only detected in the 
Ph l -positive cell lines because the bcr-abl chimeric gene 
which encodes P210 C *** )I resides on the Ph 1 (4, 5, 11; 13). The 
level of P210 eabl was about 4- to 5-fold higher than the level 
of P145 c ^ w in the SK-CML7Bt-33 cell line (Fig. 1A, +). The 
Prepositive erythroid-progenitor cell line K562 (C) showed 
a level of P210 c - kbl about 10-fold higher than P145 c * bl . 
However, the level of P210 mW was about one-fifth that of 
P145 c ^ bl in the Ph l -positive SK-CML16BM ceU line (Fig. IB, 
+). Comparison of different autoradiographic exposures 
roughly indicated that the level of P210 c " tb! varies over a 
20-fold range between these Ph^positive B-cell lines. Anal- 
ysis of four additional Ph l -positive B-cell lines demonstrated 
that the level of P210 c ** w fell into two general classes; some 
cell lines had a level of P210 c ' abl similar to SK-CML7Bt-33 
and others had the low Level similar to SK-CML16BM (Table 
1). This differs from previous studies with Ph ! -positive 
myeloid cell lines and patient samples derived from acute- 
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Fio. 1. Detection of variable levels of P210 c * u in Ph'-posiuve 
B-cell lines. Production of PUS** 4 and P210 C " M in Epstein-Barr 
virusttransformed B-cell lines derived from a blast-crisis (A) and a 
chronic-phase (B) CML patient was examined by metabolic labeling 
with ["PJorthophosphate and immunoprecipitation. Ph'-ncgative 
(-) and Ph l -positive (+) ceU lines derived from each patient were 
analyzed. The Ph^ncgative cell line in A,- is SK-CMJL7BN-2 and in 
B t - is SK-CML16BN-1. The Ph l -positive ceU line in A f + is 
SK-CML7Bt-33 and in is SK-CML16Bt-l. The K562 cell line, a 
Prepositive erythroid progenitor cell line spontaneously derived 
from a blast-crisis patient (33), is represented in C. CeUs (1.5 x 10 7 ) 
were metabolically labeled with 2 mCi of ["PJorthophosphate for 3-4 
hr and then were extracted and clarified by centrifugation. Samples 
were immunoprecipitated with control normal serum Oanes 1), 
anti-pEX-2 Oanes 2), or anti-pEX-5 (lanes 3) and analyzed by 
NaDodS0 4 /8% PAGE followed by autoradiography with an inten- 
sifying screen (3 days for A and C, 10 days for B). 

phase CML patients, in which P210 c "* w was detected at a 
10-fold higher level than FUS*** (refs. 10 and 11; Table 1). 
There was no large difference in level of chimeric mRNA and 
P210 c ** bl expressed in four myeloid/erythroid-lineage Ph l - 
positive cell lines (K562, EM2, EM3, CML22, and BV173; 
refs. 9 and 11), despite a 4- to 5-fold amplification of 
aW-related sequences in the K562 cell line. 

Detection of different levels of P210 c-abl in Fig. 1 could be 
due to decreased phosphorylation of P210 c "* w , a lower level 
of P210 6 "* 1 synthesis, or altered stability of the protein. To 
help distinguish among these possibilities, the steady-state 
level of P210 c * ttM in the cell lines was assayed by immuno- 
blotting. The results show that SK-CML7Bt*33 (Fig. 2A t +) 
had a higher level of P210 c " w than P145, similar to the results 
with metabolic labeling (Fig. 1). We did not detect P210 c " Bbl 
by immunoblotting with 2 x 10 7 cells of line SK-CMLfiBt-3 
(Fig. 2B, +). Reconstruction experiments using dilutions of 
cell extracts showed that we could detect about 5-10% the 
level of P210°* w expressed in the K562 cell line (data not 
shown). We infer that the steady-state level of P210 c ** w in 
SK-CML8Bt-3 is lower than the level in SK-CML7Bt-33 by 
a factor of at least 10. The level of P210 c '* bl detected in these 
assays correlated with the amount of P210 c " aW tyrosine kinase 
activity that could be detected in vitrp (data not shown). 

Different Levels of P210 c -* bl Are Reflected in the Amount of 
Stable bcr-abl mRNA. To identify the basis for detection of 
variable levels of P210 c '* bl , we examined the production of 
the abl RNA. RNA blot hybridization analysis using a v-abt 
probe (Fig. 3) showed that the normal 6- and 7-kb c-abl 
mRNAs were present at a similar level in Ph x -positive and 
-negative cell lines derived from different patients. However, 
the 8-kb mRNA that encodes P210 c "* w was detected at a 
10-fold higher level in SK-CML7Bt-33 (Fig. 3A f +) than in 
SK-CML16BM (B t +), which correlated with the relative 
level of P210 c - ebl detected in each cell line. Analysis of 
additional cell lines demonstrated that the level of 8-kb RNA 
directly correlated with the ievel of P210 c -* bI (Table 1). The 
variation in level of 8-kb RNA detected in these cell lines was 
not due to loss or gain of Ph 1 , because cytogenetic analysis 
confirmed the presence of Ph 1 in these cell lines (ref. 16 and 
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Table 1. Relative levels of bcr-abl expression in Epstein-Barr 
vinis-immortalized B-cell lines and myeloid CML lines 
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+++++ 
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EM2 


BC 
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•Cell lines derived from CML patients by transformation with 
Epstein-Barr virus as described (16). Names of cell lines indicate 
patient number and Ph l status: SK-CML7BI indicates a cell line 
derived from patient 7 that carries the 9;22 Ph 1 translocation; N 
indicates a normal karyotype. Myeloid-erythroid cell lines (K562, 
EM2, and BV173) are described in previous publications (9, 11, 22, 
33). 

'Status of patient at the time cell line was derived. BC, blast crisis; 
Acc, accelerated phase. 

♦Presence (+) or absence (-) of Pb l as demonstrated by karyotypic 
or Southern blot analysis. 

tP210 c * bl detected as described in legend to Fig. 1. B<el) lines 
derived from blast-crisis and accelerated-phase patients had levels 
of P210 3- to 5-fold higher (+++) than levels of P145. Chronic- 
phase-derived cell lines had P210 levels lower than or just equivalent 
(+) to the level of P145. Myeloid and erythroid lines had levels of 
P210 5- to 10-fold higher than P145 (+++++). 

f Eight-kilobase bcr-abl mRNA detected as described in legend to 
Fig. 2. Symbols: ±, borderline detectable; .+++++, level of 8-kb 
mRNA 5- to 10-fold higher than that of the 6- and 7-kb c-abt mRNA 
species; + + + , level of 8-kb mRNA 3- to 5-fold higher than that of 
the 6- and 7-kb species; +, a level approximately equivalent to that 
of the 6- and 7-kb messages. 

data not shown). There was no difference in the copy number 
of aW-related sequences as judged by Southern blot analysis 
(Fig. 4). Only the K562 cell line control showed an amplifi- 
cation of M sequences, as previously reported (22, 23). 
These combined data suggest that differential bcr-abl mRNA 
expression from a single gene template is responsible for the 
variable levels of P210 e> * w detected. This could be mediated 
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Fio. 2. Analysis of steady-state abl protein levels by immuno- 
blotting. Cell extracts prepared from 2 x 10 7 cells of lines SK- 
CML7BN-2 (A.-), SK-CML7BI-33 (A,+), SK-CML8BN-10 (*,-), 
and SK-CMUSBt-3 (B,+) were, concentrated by immunoprecip- 
itation with anti-pEX-2 plus anti-pEX-5. Samples were then electro- 
phoresed in a NaDodS0 4 /8% polyacrylamide gel and transferred to 
nitrocellulose, using protease-facilitated transfer (18). abl proteins 
were detected using a mixture of two monoclonal antibodies directed 
against the pEX-2 and pEX-5 <sM-protein fragments produced in 
bacteria (19) as a probe and a peroxidase-conjugated goat anti-mouse 
second-stage antibody (Bio-Rad) for development. 




Fia. 3. Comparison of abl RNA levels In Ph l -positive and 
-negative B-cell lines. The levels of the normal 6- and 7-kb c-abl 
RNAs and the 8-kb frcr-oWRNA were analyzed by blot hybridization 
using a v-dM probe. RNA was extracted from Ph'-negative lines 
SK-CML7BN-2 (A,-) and SK-CML16BN-1 (B,-), from Ph l -pos- 
hive lines SK-CMUBt-33 (A,+) and SK-CML16Bt-3 (*,+), and 
from line K562 (C,+) by the NaDodSd 4 /urea/phenol method (20). 
Polyadenylylated RNA was purified by oligo(dT) affinity chroma- 
tography, and 15 of each sample was electrophoresed in a 1% 
agarose/formaldehyde gel and then transferred to nitrocellulose. The 
blotted RNAs were hybridized with a nick-translated v-oW fragment 
probe (21) and then autoradiographed for 4 days. 

by factors influencing the transcription rate of the bcr-abl 
gene or the stability of the mRNA. 

DISCUSSION 

Several lines of evidence suggest that formation of Ph 1 is not 
' the primary event that affects the stem cell in CML. Patients 
have been identified that present with the clinical picture of 
CML but only later develop Ph 1 (1). This observation, 
coupled with studies of G6PD (glucose-6-phosphate dehy- 
drogenase)-heterozygous females with CML that demon- 
strate stem-cell clonality by isozyme analysis among cell 
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Fio. 4. Southern blot analysis of abl sequences in Ph'-positive 
and -negative B-cell lines. High molecular weight DNA (15 /ig) was 
digested with restriction endoauciease BamHl, separated in a 0.8% 
agarose gel, and then transferred to nitrocellulose. The blotted DNA 
fragments were hybridized with a nick-translated , 2 .4-kb Bgl II v-a W 
fragment (1.5 x 10 s cpm/Mg; ref. 21) and exposed for 4 days. (A) 
Autoradiogram of aW-speclfic fragments in cell lines HL-60 Cane 1), 
EM2 (lane 2), K562 (lane 3), SK-CML7BI-33 (lane 4), SK-CML8Bt-3 
Cane 5), SK-CMLlGBt-1 (lane 6), SK-CML21Bt-6 (lane 7), SK- 
CML35BI-2 (lane 8), SK-CML7BN-2 (Une 9), SK-CML8BN-2 Gane 
10), and SK-CML35BN-1 Oane 11). (B) Ethidium bromide staining of 
agarose gel prior to transfer to nitrocellulose, showing the level of 
variation in amount of DNA loaded per lane. 
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populations that lack the Ph 1 marker, supports a secondary 
or complementary role for Ph 1 in the progression of the 
disease (24, 25). This chromosome marker is found in 
chronic, accelerated, and blast-crisis phases of the disease. It 
is likely that Ph 1 confers some growth advantage, since cells 
with tihe marker chromosome eventually predominate the 
marrow and peripheral blood even in chronic phase. During 
the phase of blast crisis, many patients develop additional 
chromosome abnormalities, including duplication of Ph 1 , a 
variety of trisomies, and complex translocations (26). This 
is suggestive evidence for Ph 1 being a necessary but not 
sufficient genetic change for the full evolution of the 
disease. 

The realization that one molecular result of Ph 1 is the 
generation of a chimeric bcr-abl protein with functional 
characteristics and structure analogous to the gag-abl trans- 
forming protein of the Abelson murine leukemia virus 
strengthens the argument for an important role of Ph 1 in the 
pathogenesis of CML. Although the Abelson virus is gener- 
ally considered a rapidly transforming retrovirus, its effects 
can range from overcoming growth factor requirements, to 
cellular lethality, to induction of highly oncogenic tumors in 
a number of hematopoietic cell lineages (27, 28). Even in the 
transformation of murine cell targets, there are several lines 
of evidence that suggest that the growth-promoting activity of 
the \-abl gene product is complemented by further cellular 
changes in the production of the malignant-cell phenotype 
(29-31). 

The regulation of bcr-abl gene expression is complex 
because the 5' end of the gene is derived from the non-aW 
sequences, bcr, normally found on chromosome 22 (6). The 
level of stable message for the normal bcr gene and the 
normal abl gene are both much lower than the level of the 
bcr-abl message and protein from cell lines and clinical 
specimens derived from myeloid blast-crisis patients (5, 6, 
11). Therefore, the high level of bcr-abl expression cannot 
simply be attributed to the regulatory sequences associated 
with bcr. Possibly, creation of the chimeric gene disrupts the 
normal regulatory sequences and results in a higher level of 
expression. Variation in bcr-abl expression may result from 
secondary changes in the structure of the chimeric gene or 
function of fran j-acting factors that occur during evolution of 
the disease. Our analysis of P210 c * bl and the 8-kb mRNA in 
Epstein-Barr virus-transformed Ph l -positive B-cell lines 
demonstrates that stable message and protein levels from the 
bcr-abl gene can vary over a wide range. This variation does 
not result from a change in the number of bcr-abl templates 
secondary to gene amplification but more likely from changes 
in either transcription rate or mRNA stability. We suspect 
this range of bcr-abl expression is not limited to lymphoid 
cells. Analysis of peripheral blood leukocytes derived from 
an unusual CML patient who has been in chronic phase with 
myeloid predominance for 16 years showed a level of 
P210 c "* w one-fifth that of P145 c " lbf ( as detected by metabolic 
labeling with [ 32 P]orthophosphate and immunoprecipitation 
(S.C, O.N.W., and P. Greenberg, unpublished observa- 
tions). Lower levels of expression of the chimeric mRNA 
have been demonstrated in clinical samples from chrome- 
phase CML patients compared to acute-phase CML patients 
(9). Others have reported chronic-phase patients with vari- 
able but, in some cases • relatively high levels of the bcr-abl 
mRNA (32). The sampling variation and the heterogenous 
mixture of cell types in clinical samples complicate such 
analyses. Further work is needed to evaluate whether there 
is a defined change in P210 c * tbl expression during the pro- 
gression of CML. It is interesting to note that among the 
limited sample of Ph l -positive B-cell lines we have examined 
(Table 1), we have seen higher levels of P210 c " bl in those 
derived from patiu v. - at more advanced stages of the disease. 
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It will be important to search for cell-type-specific mecha- 
nisms that might regulate expression of bcr-abl from Ph 1 . 
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p.ui a; Ham« Pro(eora.e analysis: Biological assay or data archive? 

SureoP. Cygl 

Daniel Flgers In (his review we -examine the current slate of prpteome analysis. There we 

Ruedi Aebersotd ^ cc ma j n discussed: why it is necessary to study proteomes; how pro- 

teomes can be analyzed with current technology; and how proteome analysts 
Department of Molecular ^ n ^ t use< j to enhance biological research. We conclude that pro teome anal* 

Biotechnology, Itolrerslty of ysil a ao ^^1^1 tool in the understanding of regulated biological systems. 

Washington, Seattle, WAV USA • . Q unwi technology, white still mostly limited to (he more abundant proteins, . 

enables the use of proteome analysis both (o establish databases of proteins 
present, and to perform biological assays involving measurement of multiple 
variables. We believe that the utility of proteome analysts, in future biological 
research will continue to be enhanced by further improvements In analytical 
technology. i - 

Contents resolution two-dimensional gel electrophoresis (2-DB), 

. detected in the gel and identified by their amino add ■ 

1 Introduction , - j™2 sequence. The ease, sensitivity and speed with which gcl- 

2 Rationale for proUome analysts . 1861 5e p af aled proteins can be identified by the use of recently 

2.1 Correlation between raRNA and protein developed mass spectrometric techniques havo cjramatl- 
expression levels ' ' ' 7 .1 ' ■" * r ' 1 tally increased the interest in proteome technology. One 

2.2 Proteins are dynamically modified and pro- • of me mosl allrBcUvc features of such analyses is that com- 

I8W . biQjogjcaj aygtemi can potentially be studied in their 

2.3 Proteomes are dynamic , and retled the entirely, rather than as a multitude of individual compo- 
site of a biological system 1863 ntnll ^ makes u faf eajler to uncovef , hc mQay 

3 Description and assessment of current pro- ^ w6 often obscurC| ro |aUonsnips between mature 
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nol °fl' . ferent organisms and. cell types. Microbial pro teome pro- 
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1 Introduction 

A proteome has been defined as the protein complement 
expressed by the genome of an organism, or, in multicel- 
lular organisms, as the protein complement expressed by a 
(issue or differentiated cell (l). In the most common im- 
plementation of proteome analysis the proteins extracted 
from the cell or tissue analyzed are separated by high 

Cenupoo4«cc«: Professox Ruedi Acbericld, Dtpi/lmcnt of Moltculli 
Biotechnology. Univtrrity of With in (ton. Box 3177)0,. SulUe, WA, 
98195, USA fTtl: +20£4S5-U)3; Fn: ♦ 1 06-615 -63 92; E-miil: ruedi 
eu.vifhiogion.edu) / , 

Abbrtrltlloas: CID. collision-induced disrocialion; -MS/MS. Iifldcm 
m*a ipeclromcvy; SAGg. seriil anityiU of icoe eiprexsioo 

Kcprttrdf: Prolcotne / TVo-<}uncoJiooiJ pofyecrfUrijide |cJ tlcclra- 
phorcsii / Tudcm miu ipectfoaut/y 



Z Rationale for proteome analysis 

The dramatic growth in both ibe number of genome 
projects apd the speed with which genome sequences 
are being determined has generated huge amounts of 
sequence information, for some species even complete 
genomic sequences ([IS- 17]}. The description of the 
stnte of a biological system by the quantitative measure- 
ment of system components has long been a primary 
objective in molecular biology. With recent- technical 
advances including the development of differential dts* 
piay-PCR (18), cDNA microarr&y and DNA chip lechno- 
togy [19, 20) and serial - analysis of gene expression - 
(SAGE) (21, 22), il is now feasible to establish global and 
quantitative mRNA expression maps of ceils and tissues, 
in which the sequence of all the genes is' known, at a 
speed and sensitivity which is not matched by current 
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protein analysis technology. Given the longstanding 
paradigm in biology that DNA synthesizes RNA which 
synthesizes protein, and the ability to rapidly establish 
global, quantitative mRNA expression maps, the ques- 
tions which arise are why technically complex proteome 
•projects should be undertaken and what specific types of 
information could be expected from proteome projects 
which cannot be obtained from genomic and transcript 
. profiling projects. We see three main reasons for pro- 
teome analysis to become an essential component In the . 
comprehensive analysis of bio logical systems, (j) Protein 
expression levels are not predictable from the mRNA 
expression levels, (ii) pro to ins are dynamically modified 
and processed in ways which are not necessarily 
apparent from the gene sequence, arid (iii) proteomes 
axe dynamic and reflect the state of a biological system. 



Z.I Correlation between mRNA and protein. expression 
tev«U 

Interpretations of quantitative mRNA expression profiles 
frequently Implicitly or explicitly assume that for specific 
genes the transcript levels are Indicative of the levels of 
protein expression. As part of an ongoing study in our 
laboratory, we have determined the correlation of expres- 
sion at the mRNA and protein levels for a population of 
selected genes in the yeast Sa'echaromycu ccrcvislac 
growing at mid-log phase (S. P. Oygi ct a/., submitted for 
publication): mRNA expression levels were calculated 
from published SAGE frequency tables (22]. Protein 
expression levels were quantified by metabolic radiola- 
beling of the yeast proteins, liquid scintillation counting 
■ of (ho protein spots separated by high resolution 2 l DE 
. and mass spectrometry identification of the protein(s) 
migrating to each spot. The selected 80 samples consti- 
tute a relatively homogeneous group with respect to pre- 
dicted half-life and expression level of the protein pro- 
ducts. Thus far, we have found a general tread but no 
strong correlation between protein and transcript levels 
(Pig. 1). For some genes studied equivalent mRNA trans- 
cript levels translated into protein abundances which 
varied by more than 50-fold, Similarly, equivalent steady- 
state protein expression levels were maintained by trans- 
cript levels varying by as much as 40-fold (S. P. Oygi 
ct cL t submitied). These results suggests that even for a 
population of genes predicted to be relatively . homoge- 
neous with respect to protein half-life and gene expres- 
sion, the protein levels cannot be accurately predicted 
from the level of the corresponding mRNA transcript. 

2.2 Proteins are dynamically modified and processed 

In the mature, biologically active form many proteins are 
post-translationally modified by glycosylation, phosphor- 
ylation, prenyl at ion, ecylation, ubiquilination or one or 
more of many other modifications (23) and many pro- 
teins are only functional if specifically associated or com- 
plezed with other molecules, including DNA, RNA, pro- 
teins and organic and inorganic cofactors. Frequently, 
modifications are dynamic and reversible and may alter 
trie precise three-dimensional structure and the state of 
activity of a protein. Collectively, the state of modifica- 
tion of the proteins which constitute a biological system 
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. arc important indicators for the slate of the system. The 
type of protein modification and the sites modified at a 
specific cellular state, con usually not be determined 
from the gene sequence alone. 

23 jfroteomes are dynamic sod reflect the state of a 
biological system 

A single genome can give rise to many qualitatively and. 
quantitatively different proteomes. Specific stapes of the 
cell cyde and states of differentiation, responses to 
growth and nutrient conditions, temperature and stress, 
and 'pathological conditions represent cellular states' 
which are characteriied by significanliy 'different pro- 
teomes. The proteome, in principle, also reflects events 
that arc under traoslational and post-treunslalionai con- 
trol. It is therefore expected that proteomics will be able 
to provide the most precise and detailed molecular des- 
cription of the state bf a cell or tissue, provided that the 
external conditions defining the state are carefully deter- 
mined. In answer to the question of whether the study 
of proteomes is necessary for the analysts of biomolec- 
ular systems, it is evident that tho analysis of mature pro- 
tein products in cells is essential as there arc numerous 
levels of control of protein synthesis; degradation, 
processing and modification, which are only apparent by 
direct protein analysis. . . 



. 3 Description and assessment of current proteome 
analysis, technology : - 

3.1 Technical requirements of proteome technology 

In biological systems (he level of expression as well as 
the states of modification, processing and macro-molec- 
ular association of proteins are controlled and modu- 
lated depending on the state of the system. Comprehen- 
sive analysis of the identity, quantity and state of modifi- 
cation of protein j therefore requires the detection and 
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quantitation of Ihc proteins which constitute the system, 
and analysis of differentially processed forms. There arc 
a number of inherent difficuliics in protein analysis 
which complicate these tasks/ First, proteins cannot be 
amplified, ft is possible to produce large amounts of a 
particular protein by ovcr-cxpressibn in specific cell sys- 
tems. However, since many proteins are dynamically 
posltranslationally modified, they cannot be easily am- 
plified in the form in which they finally function in the 
biological system. It is frequently difficult, to purify from 
(he native source sufficient amounts , of a protein for 
analysis. From a technological poini of view this trans- 
lates into the need for high sensitivity analytical tech- 
niques. Second, many proteins art modified and pro- 
cessed poswranslationally. Therefore, in addition to the 
protein identity, the structural basis for differentially 
modified isoforms also needs to he determined. The dis- 
tribution of a constant amount of protein over several 
differentially modified isoforms further reduces the 
amount of each species, available for analysis. The com- 
plexity and dynamics of posl-translational protein edit- 
ing thus significantly complicate] proteomc studios, 
Third, proteins vary dramatically with respect to their 
solubility In commonly used solvents. There are few, if 
any; solvent conditions to which all proteins are soluble 
and which are also compatible with protein analysis. This 
makes the development of protein purification methods 
particularly difficult since both- protein purification and 
solubility have to be achieved under the samc.condi? 
lions. Detergents, In particular sodium, dodecyl sulfate 
(SDS), arc frequently added to aqueous solvents to 
maintain protein solubility. The compatibility with SDS 
is a big advantage of SDS. polyacrylamide gel.cleclnfc: 
phoresis (SDS PAG B) over other protein separation 
"techniques. Thus, SDS-PAGB and two-dimensional gel 
electrophoresis, which also uses SDS and other deter- 
gents, are the. most general and preferred methods for 
the purification of small amounts of proteins, provided 
that activity docs not necessarily heed to be maintained. 
LaxUy, the number of proteins in a given cell system is 
typically in the thousands. Any attempt to identify and 
categorize ail of these must use methods which are as 
rapid as possible to allow completion ' of the project 
■ wllhla a reasonable time .frame. Therefore, a successful, 
general pro t comics technology requires high sensitivity, 
high throughput, the ability to differentiate differentially 
modified proteins, and the ability to quantitatively dis- 
play and analyze all the proteins present in a sample. 

3.2 2-0 electrophoresis — mass spectrometry: a common 
Imp I em eolation of proteomc analysis 

The most common currently used implementation of 
•proteomc analysis technology U based on the separation 
of proteins by two-dimensional (IEF/SDS-PAOB) gel 
electrophoresis and their subsequent identification and 
analysis by mass spectrometry (MS) or tandem mass 
spectrometry (MS/MS). In 2 -DE, proteins arc first separ-. 
atcd by isoelectric focusing (I BP) and then by SDS- 
PAGE, in the second, perpendicular dimension. Separ- 
ated proteins are visualized ai high sensitivity by staining 
or autoradiography, producing two-dimensional arrays of 
proteins. 2-DE gels are. at present, .the most commonly 
used means of global display of proteins in complex 



samples.' The .separation of thousands of proteins has 
been achieved in a single gel 124, 25] and differentially 
modified proteins are frequently separated.' Due to the 
compatibility of -2-DE with high concentrations, of deter- 
gents, protein denaturing and other additives promoting 
protein solubility, the technique is widely used. 

The second step of this type of proteose analysis is the 
identification and analysis of separated proteins. Individ- 
ual proteins from polyacrylamide gels have traditionally 
been identified using //-terminal sequencing [26, 27), 
internal peptide sequencing [28, 291, immunoblotting or 
, comigralipn with known proteins [301 Tha recent dra- 
matic growth of large-scale . genomic and expressed 
sequence tig (EST) sequeo.ee databases has resulted try 
fundamental change in the way proteins are identified |y 
their amino acid sequence. Rather than by the tradiUoaal 
methods described above, protein sequences are now fre- 
quently determined by . correlating mass spectral or 
tandem mass spectral data of peptides derived from pro- 
teins, with the. information contained in sequence data- 
bases [31-M1. 

There are t number of alternative approaches to pro- 
teomc analysis currently under development There is 
considerable interest In developing a pro tec ma analysis 
stragegy which bypasses 2-DE altogether, because it ts . 
considered a relatively slow and tedious process, and 
because of perceived' difficulties in extracting proteins • 
from the get matrix for analysis. However, 2-DE as a 
starting point for protcome analysis has many .advan- 
tages compared to other techniques available today. The 
most significant ^strengths of the 2- DE-MS approach 
include the relatively uniform behavior of proteins in 
gels, the ability to quantify spots and the high resolution 
and simultaneous display of hundreds to thousands of 
proteins within a reasonable time frame. 

A schematic diagram of a typical procedure of the Identi- 
fication of get-separated proteins -is shown in Pig. 2. Pro- 
tein spots detected in the gel are erurymatically or chemi- 
cally fragmented and the peptide fragments are isolated 
for analysis, as already Indicated, most frequently by MS 
or MS/MS. There are numerous protocols for the gener- 
ation of peptide fragments from gel-separated proteins. 
They can be grouped into two categories, digestion in 
the gel slice [28, 34] or digestion after elexirot/ansfer out 
of the gel onto a suitable membrane ([29, 35-37) and 
reviewed in (3ft)). In most instances dither technique is 
applicable and yields good -results. The analysis of MS or 
'MS/M.S data is an important step in the whole process 
because MS instruments cart generate an enormous 
amount of information which cannot easily be managed 
manually. Recently, a number of groups have developed 
software systems dedicated to the use of peptide MS 
and MS/MS spectra for the identification of proteins. 
Proteins arc identified by correlating the information 
contained in the MS spectra of protein digests or . 
MS /MS spectra of individual peptides with data con- 
tained in DNA or protein sequence databases. 

The systems we are currently using in our laboratory are 
based on the separation of the peptides contained in pro* 
tcio digests by narrow bore or capillary liquid chroma tog- 
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flfvrt 2. Schematic din wo of 4 procedure Tor identification «f |cf- 
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or t com bio* lion of both. 



raphy (39, 40| or capillary electrophoresis [411, the anal- 
ysis of the separated peptides by elcctrospray iontza- 
tioo (ESI) MS/MS, and the correlation of the generated 
peptide, spectra with sequence databases using the 
. SEQUEST program developed at (he University of Wash- 
ington j32, 33). The system automaticaliy- performs the 
following operations:^ particular peptide ion character- 
ized by its mass-to-charge ratio is selected in the MS out 
of all the peptide ions present in. the system at. a parti- 
. cutar time; the selected peptide ion is collided in a colli- 
sion ceU with argon (collision-induced dissociation, 
CID). and the masses of the resulting fragment ions are 
determined in the second sector of the tandem MS; this 
experimentally determined CIO spectrum is then corre- 
lated with the CID spectra predicted from all the pep- 
tides in a sequence database which have essentially the 
samo mass as the peptide selected for CID;- this cortela-. 
Uoo matches the isolated peptide with a sequence seg- 
ment in a database and thus identifies the protein from ' 
which the peptide was derived: there are a number of 
alternative programs which use peptide CID spectra for 
protein identification, but we use the SEQURST system 
because it is currently the most highly automated pro- 
gram and has proven to be successful, versatile and 
robust. 



3.3 Protein identification by LC-MS/MS, capillary 
LC-MS/MS and CE-MS/MS. 

tt has been demonstrated repeatedly thai MS has a very 
high intrinsic sensitivity. For the routine analysis of gel- 
separated proteins at high sensitivity, the most signif- 
icant challenge is the handling of small amounts of 
sample. The crux of the problem is the extraction and 
transferal of peptide mixtures generated by the digestion 
of low nanogram amouats of protein, from gels into the 
MS/MS system without significant (oss of sample or 
introduction of unwanted contaminants. We- employ 
three different systems for introducing gej-purified sam- 
ples into an MS, depending on the level of sensitivity 



required. As an approximate guideline, for samples con- 
taining tens of picoraoles of peptides,. LC-MS/MS is 
most appropriate; for samples containing, low picomole ' 
amounts to high femtomole amounts we use capillary 
LC-MS/MS; and for samples containing fem to molts or 
less, CB-MS/MS is the method of choice. 

3,3.1 LC-MS/MS 

The coupling of an MS to' an HPLC system using a 
0.5 mm diameter or bigger reverse phase (RP) column 
has beeo described In detail (42). This system has several 
. advantages If a large number of samples are to be ana- 
lyzed and all are available in sufficient quantity. The 
LC-MS and database searching program can be run in a 
fully automated modo using an autosarapler, thus maxi- 
mizing sample throughput and minimizing the heed for 
operator interference. The relatively largo column is 
tolerant of high levels of impurities from either gel prep* 
aralion or sample matrix. Lastly, tf configured with a 
flow-splitter and miero-spnyer [40L analyses can be per- 
formed on a small fraction of the samplp (less than 5%) 
while the remainder, of the sample is recovered in very 
pure solvents. This latter feature is particularly useful 
when an orthogonal technique is also used to analyze 
peptide fractions. Such as scwtiUatioo of an introduced 
radiolabel, and this data can be correlated with peptides 
identified by CID spectra, 

3.3J Capillary LC-MS 

An increase of sensitivity of approximately tenfold can be ' 
achicvc.dby using a capillary LC system with a 100 urn ID 
column rather than a 0.5 nun ID column as referred to 
above. Since very low rjow rates are required for such 
columns, most reports have used a precolumn flow split- 
ting system for producing solvent .gradients. . We have 
recently desribed the design and construction of a novel 
gradient mixing system which enables , the formation 
of reproducible gradients at very low flow rates .-(low ■ 
, oL/rain) without the need for (low splitting (A. Ductet 
tt a/., submitted for publication). Using this capillary 
LC-MS/MS system we were able to Identify gel-separat- 
ed proteins if low picomole to high femtomole amounts 
were loaded onto the gel (40J. This system is as yet not 
automated and, like all capillary LC systems, is pionc to 
blockage of the columns by microparticulates when ana- 
lyzing gel-separated proteins. 

3.33 CE-MS/MS 

The highest level of sensitivity for analyzing gel-scp- 
. arated proteins can be achieved by using capillary elec- 
trophoresis - mass spectrometry (CE-MS). We have de- 
scribed in (he past a solid-phase extraction capillary elec- 
trophoresis (SPE-CEj system which was used with triple 
quadrupole and ion trap BSI-MS/MS systems for the 
identification of proteins at (he low femtomole to sub- 
femtomole sensitivity level [43, 44J. While (his system is 
highly sensitive, its operation is labor-intensive and its 
operation has not been automated. In order to devise an 
analytical system with both the sensitivity of a CE and 
the level of automation of LC, we have constructed 
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microfabricalcd devices for the introduction of samples 
into BSI-MS for high-sensitivity peptide analysis. 

The basic device is a piece of glass, iqlb which channels 
of 10-00 um in depth and 5<MQ tun in diameter arc 
eidhed by using pbotoUmography/etchins techniques 
similar to the ones used In "the semiconductor industry. 
(A simplb device is shown in Pig. 3). The channels are 
connected to an external high voltage power supply (4$). 
Samples are manipulated on the device and off the 
device to the MS by applying different potentials to the 
reservoirs. This creates a solvent flow by eleclrobsmotlc 
pumping which can be redirected by changing the posi- 
tion of the electrode. Therefore, without the need for 
valves or gates and without any external pumping, the 
How can be redirected by simply switching the position 
of the electrodes on the dtvicxl Thc direction and rate of 
the flow can be modulated by the size and the polarity 
of the electric Held applied and also by the charge state 
of the surface. 

The type of data generated by the system is illustrated in 
Pig- 4, which shows the mass spectrum of a peptide sample 
representing the (ryptic digest of carbonic anbydrase si 
290 fmol/uL. Each numbered peak indicates a peptide suc- 
cessfully Identified as being derived from carbonic an- 
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hydrase. Some of the unassignea. signals may be chemical j 
or peptide contaminants. The MS is programmed to auto- 
matically select each peak and subject the peptide to CID. 
The resulting CID spectra are then used. to identify the 
protein by 'correlation with sequence databases. Therefore, 
this system allows us to concurrently apply a number of 
protein digests onto the device, to sequentially mobilize 
(he samples, to automatically generate CID spectra of 
selected peptide ions and' to search sequence databases 
for protein identification. These steps are performed auto- 
matically without the need for user input and proteins can 
be identified at very low remtoroole level sensitivity at a 
rate of- approximately one protein per .1 5 rain. . 

3.4 Assessment of 1-DE-MS proteomt. technology 

Using a combination of the analytical techniques de- 
scribed above we have identified^ the 80 protein spots * 
indicated in Fig. 5. The protein pattern was generated by 
separating a total of 40 microgram of protein contained 
in a total, cell lysale of the yeast strain. YPH499 by high 
resolution 2-DE and silver staining of the separated pro- 
teins. To estimate how far this type of proteomc analysis 
can penetrate towards the identification of low abun- 
_ dance proteins, we have calculated the codon bias of the 
geoes encoding the respective proteins. Codon bias is a 
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calculated measure of the' degree of redundancy of trip- 
. let DNA codons used to produce each amino- acid In a 
particular gene sequence, it has been shown to be a 
useful indicator of the level of the protein product of a 
particular geao sequence present in a cell (4 ft. The gen- 
. era! rule which applies is thai the higher the value of the 
codon bios calculated for a gene, the more abundant the 
protein product of that gene becomes. The calculated 
codoo bias values corresponding (o the proteins identi- 
fied in Pig. 5 are shown in Fig. 6b. Nearly all of the pro- 
teins identified f> 95%) have codon bias values of > 6.2, 
indicating they are highly abundant In. celts. In contrast, 
codon bias values calculated for the cniire yeast genome 
(Pig. 6a) show that the majority of proteins present in 
the pioteome have a codon bias of < 0.2 and are thus of 
low abundance. 

This finding is of considerable importance in our assess- 
ment of the current statu* of proteoma analysis technol- 
ogy. It is clear thai even usiog highly, sensitive analytical 
tech piques_ we are only able to visualize and identify the 



mor© abundant proteins. Since many important regula- 
tory proteins are present only al low abundance, these 
would not be amenable to analysis using such tech- 
niques. This situation would be exacerbated in the anal- 
ysis of proteomes containing many more proteins than 
the approximately 600b gene products' present in yeast 
cells [16]. Id the analysts of, for example, the proteome 
of any human' cells, (her* are potentially 50000-100000 
gene products [A}). Inherent' limitations on the amount 
of protein thai can be loaded on 2-DB, and the number 
of components that can be resolved, indicate that only 
Ihe most highly' abundant fraction of the many gene 
products could be successfully analyzed. One approach 
that has been employed to circumvent these limitations 
is the use of very narrow range immobilized pH gradient 
strips for the first-dimension separation of 2-DE (48). 
Since only those proteins which focus within the narrow 
range will eater the second dimension of separation), a 
much higher sample loading within the desired range' is 
possible. This,, to turn, can lead to the visualization and' 
identification of less abundant proteins. 
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4 Utility or proteomc analysts (or biological 
research 

For the success of proteomics "as 1. mainstream approach 
(0 (be adaiysis of biological systems it is essential to 
define bow proteomc analysis and biological research 
projects intersect. Without a clear plan for the implemen- 
tation of pralcomo-type approaches into biological re- 
search projects the full impact of the technology can not 
be realized. The literature Indicates that pioteome anal: 
ysis is used both as a database/data arcbivo, and as a bio- 
logical assay or biological research tool. 

4.1 Tbe proteotse as a database 

Itic use of proteomics as a database or data archive 
essentially entails an attempt to identify ail the proteins 
in a cell or species and to annotate each protein with the 
known biological information that is .relevant for each 
protein. The level of annotation tan, of course, be exten- 
sive. The most common implementation of this Idea is 
the separation of proteins ;by high resolution 2-DE, the 
identification of each detected protein spot and'ttjc 
annotation of the protein spots in a 2- OB gel database 
format. This approach is complicated by the fad that it is 
difficult to precisely define a proteomc and to decide 
which pro (come should be represented in the database. 
In contrast to the genome of a species, which is essen- 
tially static, the proteomc is highly dynamic. Processes 
such as differentia lion, cell activation and'discase can all 
significantly change the pro (come of 1 species. This is 
illustrated in Fig. 7. The figure shows ivo high-resolu-. 



lion 2DE maps or proteins isolated from rat Strum. 
Fig. 7A is from the serum of normal rats, while Fjg. 7B 
is from the serum of rats in acute-phase serum after 
prior treatment .with an inAaamaUon-ciiuing. agent (491 
It is obvious that' the protein patterns arc significantly 
different in several areas-, raising .the- question of exactly 
.which protcome Is being described. 

therefore, a comprehensive proteomc database of a spe* 
• cies or celt type needs to contain all of the parameters 
which describe the state and the. type of the cells from 
which the proteins were extracted as well a? the software 
tools tp search the database with queries which reflect 
the dynamics of biological systems. A comprehensive 
proteome database should be capable of quantitative^ 
describing the fate of each protein if specific system! 
and pathways arc activated in the cell. Specifically, the 
quantity, the degree of modification, the subcellular loca- 
tion and the nature* of molecules specifically .Interacting 
with a protein as well as the rate of change of these, 
variables should be . described. Using these admittedly 
stringent criteria, there is currently no comlete proteome 
database. A number of such databases are; however, in 
the process of being constructed. The most advanced 
among them, in our opinion, are the yeast protein data- 
base YPD (50] (accessible at hup:// www.ypd.com) and 
the human 2D- PAGE databases of the Danish Centre, 
for Human Oenorae Research (12) (accessible at blip:// 
biobase.dk/cgi-bin/cclis). While neither can be con- 
sidered complete as not alt of the potential gene pro- 
ducts are identified, both, contain .extensive annotation 
of supplemental .information for many of the spots 
which are positively identified in reference samples/ 



"42' Tne proteome as a blologloJ assay 

the use of proteome analysis as a biological assay or 
research toot represents an alternative approach to inte- 
grating biology with proteomics. Tb investigate the state 
of a system, samples are subjected to a specific pfpeeess 
that allows the quantitative pr qualitative measurement 
of some of the variables which describe the system In 
typical biochemical assays one variable (&;., enzyme 
activity) of a single component (eg., a particular en- 
zyme) is measured. Using proteomics as an assay; mul- 
tiple variables (e.g., expression level, rate of synthesis, 
phosphorylation state, etc.) are measured concurrently 
on many (ideally all) of the proteins in a sample. The 
use of proteomics as an assay is a less Tarwc&ching prop- 
osition than the construction of a comprehensive pro- 
teome database. It does, •however, represent a pragmatic 
approach which can be adapted to investigate specific 
systems and pathways, as long as the interpretation of 
the results takes into account that with current technol- 
' ogy not all of the variables which describe the . system 
can be observed (see Section 3.4). 

A common implementation of proteome analysis, as a 
biological assay is when a 2-DE protein pattern gener- . 
ated from (he analysis of an experimental sample is 
compared to an array of reference pattern* representing 
different slates- of the system. under investigation. The 
state of the experimental system at the time the sample 
was generated is therefore, determined by the quantita- 
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tivc comparative analysis of hundreds to a few thousand 
7B proteins. Comparative analysis of the 2- DE patterns fur- 

aJUf ihermorc highlights quantitative and qualitative differ- 

[40). cnC6S io tic protein profiles which correlate with the 

indy stale of the system. For this type of analysis it is not 

icUy * essential that all. the proteins arc identified or even visu> 



aliied, although the results become more informative as 
more proUins are compared. U is obvious, however,: thai 
the possibility to identify any protein deemed character- 
istic for a particular state dramatically enhances thu 
approach by opening up new avenues for experitnenta- . 
lion. 
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Proteome analysis as a biological assay has btcn success- 
fully u«d in Ihc Held of toxicology, to characterize 
disease stales or to study differential activation of ct\\s. 
"Rie approach is limited, of course, by the fact that only 
the visible protein spots ace induced in the assa* and it 
is well lenown Out a substantial bul far from complete 
fraction or cellular proteins are detected if a total cell 
lysate is separated by 2 -DE. Proteins may not be 
delected in 2-DB gels because they are not abundant 
enough to be visualited by the detection method used, 
because they do not migrate within to© boundaries (size, 
pO resolved by the gel, because they are nol soluble 
under the conditions used, or for other reasons-. 

A different way to use protcome analysis as a biological 
essay to define the slate of a biological system is to take 
advantage of the. wealth of information contained* in 
2-DE protein patterns.. 2-DB is referred to as two-dimen- 
sional because of the eteciropborciic mobility and the 
isoelectric points which define the position of each pro- 
tein in a 2-DB pattern. In addition to the two dimen- 
sions used to generate the protein patterns, a number of. - 
additional data dunciuiorts ore contained in the protein 
patterns. Some of these dimensions such as protein 
expression level, phosphorylation state, subcellular loca- 
tion, association with oiher proteins, rato of synthesis or 
degradation indicate the activity stale of a protein or a 
biological system. Comparative analysis of 2-DE protein 
patterns representing different stales is therefore ideally 
suited for the detection, identification and analysis of 
suitable markers. Once again it must be emphasized that 
in this . type of experiment only, a fraction of the cellular 
proteins Is analyzed. Since many regulatory proteins are 
of tow abundance, this (imitation is a concern, particu- 
- tarty in cases In which regulatory pathways are being 
investigated. 

S Concluding remarks 

to this report we have addressed three main issues 
related to protcome analysis. First, we have discussed 
the rationale for studying proteomes. Second, we have 
assessed the technical feasibility of analyzing proteomes 
and- described' current protcome technology, and third; 
we hayo analyzed the utility of protcome analysis for bio- 
logical research. II Is apparent thai protcome analysis is 
an essential tool in the analysis of biological systems. 
The multi-level control of protein synthesis and degrada- 
tion in cells means that only the direct analysis of 
mature protein products can reveal their coned identi- 
ties, their relevant state of modification and/or associa : 
Hon and their amounts.. Recently developed methods 
have enabled the identification of proteins at ever- 
increasing sensiuviry levers and at a high level of auto- 
mation of the analytical' processes. A number of tech- 
nical challenges, however, remain. While it Is currently 
possible to identify essentially any protein spots (hat can 
be visualized by common staining -methods, it is ap- 
parent that without prior enrichment only a relatively 
small and highly selected population of long-lived, 
highly expiesscd proteins is observed. There are many 
more proteins in a given cell which are not visualized by 
such' methods. Frequently it is the low abundance pro- 
teins that execute key regulatory functions. 
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We have outlined the two principal ways protcome anal- 
ysis is currently being used lo intersect with biological 
research projects; the protcome as a database or data 
archive and protcome analysis as a biological assay. Both 
approaches have in common that at present they are con- 
ceptually and technically limited. Current prolepme data- 
bases, typically are limited to one cell type and ono slate 
of a cell and therefore do not account for the dynamics 
of biological systems. The use of protcome analysis as a 
biological assay can provide a wealth of toforraatiori, but 
it is limited to the proteins detected and ia therefore not 
truly protcoroe-wide. These limitations in prolco rates arc 
to a large extent a reflection of the fad that proteins In 
their fully processed form cannot easily be amplified and 
are therefore cflirlcult to isolate in amounts sufTldentJor 
analysis or experimentation. Trie fad that' to daiexno 
complete protcome has been described further attest? to 
these difficulties. With continued rapid progress in pro- 
tein analysis technology, however, we anticipate that the 
goal of complete proteomc analysis wilt eventually 
become attainable. ./ 
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Genome-wide Study of Gene Copy Numbers, 
Transcripts, and Protein Levels in Pairs of 
Non-invasive and Invasive Human Transitional 
Cell Carcinomas* 

Torben F. 0rntoft$§f Thomas ThykjaerU, Frederic M. WaldmanJI, Hans Wolf**, 
and Julio E. Cells** 



Gain and loaa of chromosomal material Is characteristic 
of bladder cancer, as well as malignant transformation in 
general, The consequences of these changes at both the 
transcription and translation levels is at present unknown 
partly because of technical limitations. Mere we have at- 
tempted to address this question In pairs of non-invasive 
and Invasive human bladder tumors using a combination 
of technology that Included comparative genomic hybrid- 
ization, high density oligonucleotide array-based monitor- 
ing of transcript levels (5600 genes), and high resolution 



phenomenon at both the transcription and translation levels. 
High throughput array studies of the breast cancer cell line 
BT474 has suggested that there Js a correlation between 
DNA copy numbers and gene expression In highly amplified 
areas (2), and studies of Individual genes In solid tumors 
have reveaJed a good correlation between gene dose and 
mRNA or protein levels In the case of c-erb-B2, cyc/ln d1 , 
emsl, and N-myc (3-5). However, a high cyclln D1 protein 
expression has been observed without simultaneous an> 



two-dlmensional gel el«ctrophoresls(the resuits showed^ pilf'catlon (4), and a low level of c-myc copy number In- 



that there Is a gene dosage effect that in some cases 
superimposes on other regulatory mechanisms. This ef- 
fect depended (p < 0.015) on the magnitude of the com- 
parative genomic hybridization change. In general (18 of 
23 cases), chromosomal areas with more than 2-fokJ gain 
of DNA showed a corresponding increase in mRNA tran- 
scripts. Areas with lose of DNA, on the other hand, 
showed either reduced or unaltered transcript levels,; Be- 
cause most proteins resolved by two-dimensional gels 
ere unknown It was only possible to compare mRNA and 
protein alterations iarelatfvely few cases of well focused 
abundant proteins. With few exceptions we found a good 
correlation (p < 0.006) between transcript alterations and 
protein levels. The Implications, as weH as limitations, 
of the approach are discussed. Molecular £ Cellular 
Proteomlcs 1:37-45, 20QZ 



Aneuploldy Is a common feature of most human cancers 
(1), but little is known about the genome-wide effect of this 
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crease was observed without concomitant c-myc protein 
overexpresslon (6). 

In human bladder tumors, karyotyping, fluorescent In situ 
hybridization, and comparative genomic hybridization (CGH) 1 
have revealed chromosomal aberrations that seem to be 
characteristic of certain stages of disease progression. In the 
case of non -Invasive pTa transitional coll carcinomas fTCCs), 
this Includes loss of chromosome 9 or parts of It as well as 
loss of Y In maJes, In minimally Invasive pT1 TCCs, the fol- 
lowing alterations have been reported: 2q~, 11p- ( 1q+, 
11q13 + , 17q+, and 20q+ (7-12). It has been suggested that 
these regions harbor tumor suppressor genes and onco- 
genes; however, the large chromosomal areas Involved often 
contain many genes, majcing meaningful predictions of the 
functional consequences of losses and gains very difficult 

In this Investigation we have combined genome-wide tech- 
nology for detecting genomic gains and losses (CGH) with 
gene expression profiling techniques (mlcroarrays and pro- 
teomlcs) to determine the effect of gene copy number on 
transcript and protein levels in pairs of non-invasive and in- 
vasive human bladder TCCs. 

EXPERIMENTAL PROCEDURES 

Material— Bladder tumor biopsies were sampled a/ter Informed 
consent was obtained and after removal of tissue for routine pathol- 
ogy examination. By light microscopy tumors 335 and 532 were 
staged by an experienced pathologist as pTa (superficial papillary), 

1 The abbreviation* used are: CGH, comparative genomic hybnd- 
izatkxi; TCC, transitional cell carcinoma; LOH, loss of heterozygosity; 
PA- FAB P, psoriasis-associated fatty acid-bmding protein; 20, 
two- dimensional. 
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Fw. 1. ONA copy number and mRNA expression level. Shown from left to right are chromosome (Ch/.), CGH profiles, gene location and 
expression level of specific genes, and overall expression level along the chromosome, A, expression of mRNA In Invasive tumor 733 as 
compared with the noninvasive counterpart tumor 335. 8, expression of mRNA In invasive tumor 827 compared with the non- invasive 
counterpart tumor 532. The average fluorescent signal ratio between tumor DNA and normal ONA Is shown along the length of the chromosome 
(/eft). The bold curve in the ratio profile represents a mean of four chromosomes and la surrounded by thin curves indicating one standard 
deviation. The centra/ verticaljine (brpAen) Indicates a ratio vaiue of 1 {no change), and the vertical lines next to it (dotted) indicate a ratio of 
0.5 {/eft) and 2.0 (r/onf). In chromosomes where the non- invasive tumor 335 used for comparison showed alterations In DNA content the ratio 
profile of that chromosome is shown to the right of the Invasive tumor profile. The colored bars represents one gene each, Identified by the 
ruming numbers above the bars (the name of the o^ne can be seen at www.MDLOK/sdata.html). The bars Indicate the purported location of 
the gene, and the colors indicate the expression level of the gene in the invasive tumor compared with the non-Invasive counterpart; >2-fokJ 
Increase (biacfr), >2-foid decrease (blue), no significant change (onjnoe). The bar to the far right, entitled expression shows the resulting change 
in expression along the chromosome; the colon indicate that at least half of the genes were up-regulated {black), at least half of the genes 
down-regulated (b/ue), or more than half of the genes are unchanged [orange). If a gene was absent In one of the samples and present in 
another, It was regarded as more than a 2-foW change. A 2-fold level was chosen as mis corresponded to one standard deviation in a double 
determination of -1800 genes. Centromeres and heterochromatic regions were excluded from data analysis. 



grade I and II, respectively, tumors 733 and 827 were staged as pT1 
(invasive Into su&mucosa), 733 was staged as solid, and 827 was 
staged as papillary, both grade III. 

mRNA Prepara Von — Tissue biopsies, obtained fresh from surgery, 
were embedded immediately in a sodtum-guanldlnium thiocyanata 
solution and stored at -80 *C. Total RNA was isolated using the 
RNAzol 9 RNA Isolation method (WAK-Chemie Medical GMBH). 
poly(A)~ RNA was Isolated by an oligo(dT) selection step (OUgotex 
mRNA kit; Qiagen). 

cRNA Preparation- 1 of mRNA was used as starting material. 
The first and second strand cONA syntheses was performed using the 
Superscript® cho»ce system (Invrtrogen) according to the manufac- 
turer's Instructions but using an ottgo(dT) primer containing a T7 RNA 
polymerase binding site. Labeled cRNA was prepared using the ME- 
GAscrip® in vitro transcription kit (Ambton), Biottn-laoeted CfP and 



UTP (Enzo) was used, together with unlabeled NTPs In the reaction. 
Following the In vitro transcription reaction, the unincorporated nu- 
cleotides were removed using RNeasy columns (Qiagen). 

Army Hybridization and Scanning— Array hybridization and scan- 
ning was modified from a previous method (13). 10 ng of cRNA was 
fragmented at 94 *C for 35 min In buffer containing 40 nut Tris 
acetate, pH 8.1 , 100 miu KOAc, 30 mu MgOAc. Prior to hybridization, 
the fragmented cRNA in a 8x SSPE-T hybridization buffer (1 m Nad. 
10 mM Tris, pH 7.8, 0.005 % Triton), was heated to g5 *C for 5 min, 
subsequentfy cooled to 40 *C, and loaded onto the Arrymelrix probe 
array cartridge. The probe array was then incubated for 1 6 h at 40 *C 
at constant rotation (60 rpm). The probe array was exposed to 10 
washes in 6* SSPE-T at 25 "C followed 0y 4 washes in Q.5x SSPE-T 
at 50 *C. The biotlnytated cRNA was staJned wfth a streptavidin- 
phycoerythnn conjugate, 10 ^g/ml (Molecular Probes) in 6x SSPE-T 
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Fio. 1 — continued 



for 30 min at 25 °C followed by 10 washes in 6x SSPE-T at 25 9 C. The 
probe arrays were scanned at 560 nm using a confocai laser scanning 
microscope (made for Asymetrix by Hewlett-Packard). The readings 
from the quantitative scanning were analyzed by Affymetrtx gene 
expression analysts software. 

MfcnysatettHo Analysis — Mlcrosatellhe Anaiysls was performed as 
described previously (14). Mlcrosatellites were selected by use of 
www.nobl.nlm.nlh.gov/genemap98, and primer sequences were ob- 
tained from the genome data base at www.gdb.org. DMA was extracted 
from tumor and Wood and amplified by PCR in a volume of 20 mJ for 36 
cycles. The amplicons were denatured and etectrophoresed for 3 h In an 
ABI Prtem 377, Data were collected in (he Gene Scan program for 
fragment analysis. Loss of heterozygosity was defined as less than 33% 
of one allele detected in tumor amplicons compared with Wood. 

Protoomic Analysis —fCCs were minced into smaJt pieces and 
homogenized In a small glass homogenlzer In 0.5 ml of lysis solution. 
Samples were stored at -20 °C until us*. The procedure for 2D gel 
electrophoresis has been described in detail elsewhere (15 r 16). Gels 
were stained with silver nitrate and/or Coomassie Brilliant Blue. Pro- 
teins were Identified by a combination of procedures that included 
microsequencing. mass spectrometry, two^imensionaJ gel Western 
immurwolotting. and comparison with the master two-dimens^onaJ gei 
image of human keratinocyte proteins; see biobase.dk/cgi- bin/cetis. 

CGH— Hybridization of differentially labeled tumor and normal DNA 
to normal metaphase chromosomes was performed as described 
previously (10). Flu or esc ewiabeted tumor DNA (200 ng), Texas Red- 



labeled reference DNA (200 ng), and human Cot-1 DNA (20 /ig) were 
denatured at 37 *C for 5 min and applied to denatured normal met- 
aphase slides. Hybridization was at 37 °C for 2 days. After washing, 
the slides were counterstalned with 0. 15 *tg/ml 4 ,6- dlamldl no- 2 -phe- 
nyl Indole In ah anti-fade solution, A second hybridization was per- 
formed for ail tumor samples using fluo rose ©in-labeled reference DNA 
and Texas Red-labeled tumor DNA (inverse labeling) to confirm the 
aberrations detected during (he initial hybridization. Each CGH ex- 
periment also included a normal control hybridization using fluores- 
cein- and Texas Bed-labeled normal DNA, Digital image anaiysls was 
used to identify chromosomal regions with abnormal fluorescence 
ratios, indicating regions of DNA gains and losses. The average 
green:red fluorescence Intensity ratio profiles were calculated using 
four images of each chromosome (eight chromosomes total) with 
normalization of the green: red fluorescence Intensity ratio for the 
entire metaphase and background correction. Chromosome identifi- 
cation was performed based on 4,6-dtamidino-2-phenyiindole band- 
ing patterns. Onry images showing uniform high intensity fluores- 
cence with minimal background staining were analyzed. All 
centromeres, p arms of acrocentric chromosome a. and hoterochro- 
matic regions wore excluded from the analysis. 

RESULTS 

ComparativQ Genomic Hybridization -The CGH analysis 
identified a number of chromosomal gains and losses in the 
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TASL£ I ' 

Correlation between alterations detected by CQH and by expression monitoring 

Top, CGH usad as Independent variable (If CGH alteration - what expression ratio was found); bottom , altered expression used as 
independent variable fit expression alteration - what CGH deviation was found). 



CGH alterations 
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Expression change clusters 



Concordance 



CGH alterations 



Tumor 827 r vs. 532 
Expression change dusters 



Concordance 



13 Gain 



10 Loss 



10 Up-regulatlon 

0 Down-regulation 

3 No change 

1 Up-regulation 

5 Down-regulation 

4 No change 



77% 



50% 



10GaJn 
12 Loss 



8 Up-regulatlon 
0 Down-regulation 

2 No change 

3 Up-regulation 

2 Down regulation 
7 No change 



B0% 
17% 



Expression change clusters 


Tumor 733 vs. 335 


Concordance 


Expression change clusters 


Tumor 4327 vs. 532 


Concordance 


CGH alterations 


CGH alterations 


16 Up-regulation 


11 Gain 


69% 


1 7 Up-regulatlon 


10 Gain 


59% 




2 Loss 






5 Loss 






3 No change 






2 No change 




21 Down- regulation 


1 Gain 


38% 


9 Down-regulation 


0 GaJn 


33% 




8 Losa 






3 Loss 
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two Invasive tumors (stage pT1 , TCCs 733 and 827), whereas 
the two non-invasive papillomas (stage pTa, TCCs 335 and 
532) showed only 9p-, 9q22-q33-, and X-. and 7+, 9q-, 
and Y-, respectively. Both Invasive tumors showed changes 
(1q22-24+ ( 2q14.1<jtef- p 3q12-q13.3- ( 6q12-q22-, 
9q34 + , 11q12-q13+, : 17+, and 20q11 .2-q12+) that are typ- 
ical for their disease stage, as well as additional alterations, 
some of which are shown In Fig. 1. Areas with gains and 
losses deviated from the normal copy number to some extent, 
and the average numerical deviation from normal was 0.4-fold 
In the case of TCC 733 and 0.3-fold foe TCC 827. The largest 
changes, amounting to at least a doubling of chromosomal 
content, were observed at 1q23 In TCC 733 (Fig, 1/4) and 
20q12lnTCC827 (F1g. 1fl). 

mRNA Expression In Relation to DNA Copy Number- The 
mRNA levels from the two Invasive tumors (TCCs 827 and 
733) were compared with the two non-Invasive counterparts 
(TCCs 532 and 335). This was done In two separate experi- 
ments in which we compared TCCs 733 to 335 and 827 to 
532, respectively, using two different scaling settings for the 
arrays to rule out scaling as a confounding parameter. Ap- 
proximately 1,800 genes that yielded a signal on the arrays 
were searched In the Unigene and Genemap data bases for 
chromosomal location, and those with a known location 
(1096) were plotted as bars covering their purported locus. In 
that way it was possible to construct a graphic presentation of 
DNA copy number and relative mRNA levels along the indi- 
vidual chromosomes (Fig. t). 

For each mRNA a ratio was ca/culated between the level in 
the invasive versus tne non- invasive counterpart. Bars, which 
represent chromosomal location of a gene, were color-coded 
according to the expression ratio, and only differences larger 



than 2-fold were regarded as Informative (Fig. 1). The density 
of genes along the chromosomes varied, and areas contain- 
ing only one gene were excluded from the calculations. The 
resolution of the CGH method is very low, and some of the 
outlier data may be because of the fact that the boundaries of 
the chromosomal aberrations are not known at high resolution. 

Two sets of calculations were made from the data. For the 
first set we used CGH alterations as the Independent variable 
and estimated the frequency of expression alterations In these 
chromosomal areas. In general, areas with a strong gain of 
chromosomal material contained a cluster of genes having 
Increased mRNA expression. For example, both chromo- 
somes 1q21-q25 t 2p and 9q, showed a relative gain of more 
than 100% In DNA copy number that was accompanied by 
Increased mRNA expression levels in the two tumor pairs (Fig. 
1). In most cases, chromosomal gains detected by CGH were 
accompanied by an increased level of transcripts In both 
TCCs 733 (77%) and 827 (80%) {Table I, top). Chromosomal 
losses, on the other hand, were not accompanied by de- 
creased expression In several cases, and were often regis- 
tered as having unaltered RNA levels (Table I, fop). The inabil- 
ity to detect RNA expression changes in these cases was not 
because of fewer genes mapping to the lost regions (data not 
shown). 

In the second set of calculations we selected expression 
alterations above 2-fold as the Independent variable and es- 
timated the frequency of CGH alterations in these areas. As 
above, we found that increased transcript expression corre- 
lated with gain of chromosomal material (TCC 733, 69% and 
TCC 827, 59%), whereas reduced expression was often de- 
tected tn areas with unaltered CGH ratios (Tabje I, bottom). 
Furthermore, as a control we looked at areas with r>o atter- 
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Expression changes 
d*t»ct»d 



Expression change* 
not detected 



Tumor 827 versus 532 



Expression change* 
detected 



Expression changes 
not detected 



Tunx)r 733 versus 335 



Fto. 2. Correlation between maximum CGH aberration and the obilHy to detsct expression change by oligonucleotide array 

monitoring. The aberration Is shown as a numerical -fokJ change In ratio between Invasive tumors 827 (A) and 733 ( ♦ ) and their non-Invasive 
counterparts 532 and 335. The expression change was taken from the Expression line to the right In Fig. 1 , which depicts the resulting 
expression change for a given chromosomal region. At least naif of the mRNAs from a given region have to be either up- or down-regulated 
to be scored as an expression change. All chromosomal arms tn which the CGH ratio plus or minus one standard deviation was outside the 
ratio value of one were Included. 



atlon In expression. No alteration was detected by CGH In 
most of these areas (TCC 733, 60% and TCC 827, 81 %; see 
Table I,, bottom). Because the ability to observe reduced or 
Increased mRNA expression clustering to a certain chromo- 
somal area clearly reflected the extent of copy number 
changes, we plotted the maximum CGH aberrations In the 
regions showing CGH changes against the ability to detect a 
change In mRNA expression as monitored by the oligonucleo- 
tide arrays (Fig. 2)C£or both tumors TCC 733 (p < 0.015) and 
TCC 827 (p < 0.00003) a highly significant correlation was 
observed between the ievel of CGH ratio change .(reflecting 
the DNA copy number) and alterations detected by the array 
based technology (Fig. 2^ Similar data were obtained when 
areas with altered expression were used as independent vari- 
ables. These areas correlated best with CGH when the CGH 
ratio deviated 1.6- to 2.0-fold (Table I, bottom) but mostly did 
not at lower CGH deviations. These data probably reflect that 
loss of an allele may only lead to a 50% reduction In expres- 
sion leveL which is at the cut-off point for detection of expres- 
sion alterations. Gain of chromosomal material can occur to a 
much larger extent. 

Microsatelitte-based Detection of Minor Areas of Loss- 
es—In TCC 733, several chromosomal areas exhibiting DNA 
amplification were preceded or followed by areas with a nor- 
mal CGH but reduced mRNA expression (see Fig. 1, TCC 733 
cnromosome 1q32. 2p21 . and 7q21 and q32, 9q34, and 
I0q22). To determine whether these results were because of 
undetected loss of chromosomal material in these regions or 



because of other non-structural mechanisms regulating tran- 
scription, we examined two micros ate II It es positioned at chro- 
mosome 1q25-32 and two at chromosome 2p22. Loss of 
heterozygosity (LOH) was found at both 1q25 and at 2p22 
Indicating that minor deleted areas were not detected with the 
resolution of CGH (Fig. 3). Additionally, chromosome 2p In 
TCC 733 showed a CGH pattern of gain/no change/gain of 
DNA that correlated with transcript Increase/decrease/in- 
crease. Thus, for the areas showing increased expression 
there was a correlation with the DNA copy number alterations 
(Fig. 1/4). As Indicated above, the mRNA decrease observed In 
the middle of the chromosomal gain was because of LOH, 
implying that one of the mechanisms for mRNA down-regu- 
lation may be regions that have undergone smaller losses of 
chromosomal material. However, this cannot be detected with 
the resolution of the CGH method. 

In both TCC 733 and TCC 827, the telomerlc end of chro- 
mosome 11 p showed a normal ratio In the CGH analysis; 
however, clusters of five and three genes, respectively, lost 
their expression. Two mlcrosatellltes (D11S1760, D11S922) 
positioned close to MUC2, IGF2, and cathepsin D indicated 
LOH as the most likely mechanism behind the loss of expres- 
sion (data not shown). 

A reduced expression of mRNA observed in TCC 733 at 
chromosomes 3q24, 11p11, 12p12.2, 12q21.1, and 16q24 
and in TCC 827 at chromosome 11p15.5, I2p11, 15q11.2, 
and 18q12 was also examined for chromosomal losses using 
microsatellites positioned as close as possible to the gene loci 
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Fk>. 3. Mlcrosatellite analyst of Iom of heterozygosity. Tumor 
733 showing loss of heterozygosity at chromosome 1q25, detected 
(a) by D1S215 close to Hu das* I histocompatibility antigen (gene 
number 38 in Fig. 1). tf>) by 01S2735 close to cathepsln E (gene 
number 41 In Fig. 1), and (c) at chromosome 2p23 by D2S2251 dose 
to generaJ ^spectrin (gene number 1 1 on Fig, 1) and of (d) tumor 827 
showing low of heterozygosity at chromosome 18q12byS18S111B 
close to mitochondrial 3-oxoacyf-coenzyme A thiotase (gene number 
12 in Fig. 1). The upper curves show the electrophorogram obtained 
from normal DNA from leukocytes (A/), and the lower curves show the 
etectropherogram from tumor DNA (7). In ail cases one allele is 
partially lost in the tumor ampllcon. 

showing reduced mRNA transcripts. Only the microsatellite 
positioned at I8q12 showed LOH (Fig. 3), suggesting that 
transcriptional down-regulation of genes In the other regions 
may be controlled by other mechanisms. 

Relation between Changes in mRNA and Protein Levels— 
2D-PAGE analysis, in combination with Coomassie Brilliant 
Blue and/or silver staining, was carried out on ail four tumors 
using fresh biopsy material. 40 well resolved abundant known 
proteins migrating in areas away from the edges of the pH 
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Fta 4. Correlation between protein level* ai judged by 20- 
PAGE and tranecrlpt ratio. For comparison proteins were divided In 
three groups, unaltered In teve* or up- or down-regulated Iporbonta/ 
axis). The mRNA ratio as determined by oilgonucJeotida arrays was 
plotted for each gene (vertical wtf$). A, mRNAs that were scored as 
present In both tumors used for the ratio calculation; A, mRNA* that 
were scored as abfeent In the Invasive tumors (along horizontal axis) or 
as absent In non-invasive reference (fop of figure). Two different 
scaJings were used to exclude scaling as a confoundw TCCs 827 
and 532 (AAJ were scaled with background suppression^ and TCCs 
733 and 335 (•O) were scafed without suppression. 3oth compari- 
sons showed highly significant (p < 0.0O5) differences in mRNA ratios 
between the groups. ProteMa shown were as follows; Groups (from 
tofl). phosphoglucomutase 1. glutathione transferase class M number 
4. fatty add-bindlng protein homologue, cytokeratin 15. and cyto- 
keratin 13; 8 (from loft), fatty acld-blndlng prolan homoiogue. 28-kDa 
heat shock protein, cytokeratin 13, and calcyclin; C (from fe/n <r-eno- 
lase. hnRNP B1. 28-kDa heat shock protein, 14-3-3-, and 
pre-mRNA splicing factor; D, mesothellai keratin K7 (type II); E (from 
top), glutathione S-transferase-7r and mesotheflai keratin K7(type II)- 
F(from fop and /eft), adenytyl cyclase-assoclated protein, E-cadherln' 
keratin 19, catglzrarln. phosphogfycerate mutase, annexin IV cv- 

lo^^T^ 01 ' 0 * hnRNP A1 ' int09raJ m ^brane protein cainexin 
(IP90). hnRNP H. brain-type clathrin light cha/n-a, hnRNP F 70-kOa 
heat shock protein, heterogeneous nuclear ribonucteoprot'e/n A/8 
translations^ controlled tumor protein, liver glyceraldehyde-3-phos^ 
Phate dehydrogenase, keratin a, aldehyde reductase and Na,K- 
ATPase 0-1 subunlt: G. (from fop and tefl), TCP20, caJgizzarin 70- 
kDa heat shock protein, cainexin, hnRNP H, cytokeratin 15 ATP 
synthase, keratin 19, trtosephosphate Isomerase, hnRNP F liver" giyc- 
eraidohyde-3-phosphatase dehydrogenase, glutathione S-transfer- 
ase-Tr, and keratin 8; H (from fen), plasma geisolin, autoantJgen cal- 
retjcuiin, thioredoxln, and NAD+ -dependent 15hydroxyprostagtandln 
dehydrogenase; / (from fop), prolyl 4-hydroxyiase 0-subunft, cyto- 
keratin 20, cytokeratin 17, prohibition, and fructose 1 triphos- 
phatase; J annexin II; K, annexin IV; L (f rom fop and left), 9f>kDa heat 
shock protein, prolyl 4-hydroxytase 0-subonH, a -«noJase GRP 78 
cyclophilin, and cofilln. 

gradient, and having a known chromosomal location, were 
selected for analysis in the TCC pair 827/532. Proteins were 
identified by a combination of methods {see "Expenmental 
Procedures"). In general there was a highry significant corre- 
late (p < 0.005) between mRNA and protein alterations (Fig. 
4). Only one gene showed disagreement between transcript 
alteration and protein alteration. Except for a group of cyto- 
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Fto. 5. Comparison of protein end transcript leveta In Invaah/e 
and non-invastv« TCCa. The upper part of the figure a hows a 2D gel 
(left) and the oligonucleotide array {right) of TCC 532. The red rectan- 
gles on the upper gel highlight the area* that ore compared below. 
Identical areas of 20 gels of TCCa 532 and 827 are shown below. 
Clearly, cytokeratlns 13 and 15 are strongly down-regulated In TCC 
B27 (rod annotation). The tile on the array containing probes for 
cytokeratin 15 Is enlarged below the array (red arrow) from TCC 532 
and Is compared with TCC 827. The upper row of squares in each tile 
corresponds to perfect match probes; the tower row corresponds to 
mismatch probes containing a mutation (used for correction for un- 
$ pec) fie binding). Absence of signal la depicted as black, and the 
higher the signal the lighter the color. A high transcript level waa 
detected in TCC 532 (6151 units) whereas a much lower level was 
detected In TCC 827 (absence of signals). For cytokeratin 13, a hfgh 
transcript level was also present In TCC 532 (15659 units), and a 
much tower level waa present In TCC 827 (623 units). The 20 gels at 
the bottom of the figure (terY) show levels of PA-FABP and adipocyte- 
FABP in TCCs 335 and 733 (Invasive), respectively. Both proteins are 
down-regulated in the Invasive tumor. To the right we show the array 
tiles for the PA-FABP transcript. A medium transcript level was de- 
. tectod in the case of TCC 335 (1277 units) whereas very low levels 
were detected In TCC 733 (166 units). /£F, Isoelectric focusing. 



keratins encoded by genes on chr-omosome 17 (Fig. 5) the 
analyzed proteins did not belong to a particular family. 26 well 
focused proteins whose genes had a know chromosomal 
location were detected In TCCs 733 and 335, and of these 19 
correlated (p < 0.005) with the mRM A changes detected using 
the arrays (Fig. 4). For example, PA-FABP was highly ex- 
pressed In the non-Invasive TCC 335 but lost In the Invasive 
counterpart (TCC 733; see Fig. 5). The smaJler number of 
proteins detected In both 733 and 335 was because of the 
smaller size of the biopsies that were available. 

1 1 chromosomal regions where CGH showed aberrations 
that corresponded to the changes in transcript levels also 
showed corresponding changes In the protein level (Table II). 
These regions included genes that encode proteins that are 
found to be frequently altered in bladder cancer, namely 
cytokeratlns 17 and 20, annexlna II and IV, and the fatty 
acid-binding proteins PA-FABP and FBP1. Four of these pro- 
teins were encoded by genes In chromosome 17q, a fre- 
quently amplified chromosomal area In Invasive bladder 
cancers. 

DISCUSSION 

Most human cancers have abnormal 0NA content, having 
lost some chromosomal parts and gained others. The present 
study provides some evidence as to the effect of these gains 
and losses on gene expression In two pairs of non-Invasive 
and Invasive TCCs using high throughput expression arrays 
and proteomics, in combination with CGH. In general, the 
results showed that there is a clear Individual regulation of the 
mRNA expression of single genes, which In some cases was 
superimposed by a ON A copy number effect in most cases, 
genes located In chromosomai areas with gains often exhib- 
ited Increased mRNA expression, whereas areas showing 
tosses showed either no change or a reduced mRNA expres- o 
sion. The latter might be because of the fact that losses most 
often are restricted to loss of one allele, and the cut-off point 
for detection of expression alterations was a 2-foJd change, 
thus being at the border of detection, in several cases, how- 



Table II 

Proteins whose expression leva! correlates with both mRNA and gene dose changes 



Protein 


Chromosomal location 


Tumor TCC 


CGH alteration 


Transcript alteration* 


Protein alteration 


Anne Kin 11 


1q2t 


733 


Gain 


Aba to Pres* 


Increase 


Annexin IV 


2p13 


733 


Gain 


3,9-FokJ up 


Increase 


Cytokeratin 17 


I7q12-q21 


827 


Gain 


3.8-Fokj up 


* Increase 


Cytokeratin 20 


17q21.1 


827 


Gain 


5.6-FoW up 


Increase 


(PA-)FABP 


8q21.2 


827 


Loss 


10-Fold down 


Decrease 


FBP1 


9q22 


827 


Gain 


2.3-Fokj up 


Increase 


Plasma gelsolin 


9q31 


827 


Gain 


Abs to Pres 


Increase 


Heat shock protein 28 


15q12-ql3 


827 


Lose 


2.5-FoW up 


Decrease 


Prohibits 


17q21 


827/733 


Gain 


3.7-/2.5-Fold up* 


Increase 


Proiyi-4-hydroxyl 


17q25 


827/733 


Gain 


5.7-/1.6- Fold up 


Increase 


hnRNPBI 


7p15 


827 


Loss 


2.5-FoW down 


Decrease 



• Abs, absent; Pres. present. 

6 In cases whore the corr «s ponding ail drat ions were lound in botn TCCs 827 and 733 these are shown as 827/733. 
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ever, an increase or decrease in DNA copy number was 
associated with do novo occurrence or complete loss of tran- 
script, respectively. Some of these transcripts could not be 
detected in the non-invasive tumor but were present at rela- 
tively high levels In areas with DNA amplifications In the Inva- 
sive tumors (e.g. In TCC 733 transcript from cellular llgand of 
annexin li gene (chromosome 1q21) from absent to 2670 
arbitrary units; In TCC 827 transcript from small prollne-rich 
protein 1 gene (chromosome 1q12-q21.1) from absent to 
1326 arbitrary units). It may be anticipated from these data 
that significant clustering of genes with an Increased expres- 
sion to a certain chromosomal area Indicates an Increased 
likelihood of gain of chromosomal material In this area. 

Considering the many possible regulatory mechanisms act- 
ing at the level of transcription, it seems striking that the gene 
dose effects were so clearly detectable in gained areas. One 
hypothetical explanation may lie In the loss of controlled 
methylatlon in tumor cells (17-19). Thus, It may be possible 
that In chromosomes with Increased DNA copy numbers two 
or more alleles could be demethylated simultaneously leading 
to a higher transcription level, whereas In chromosomes with 
losses the remaining allele could be partly methylated, turning 
off the process (20. 21). A recent report has documented a 
pioidy regulation of gene expression in yeast but In this case ail 
the genes were present In the same ratio (22), a situation that Is 
not analogous to that of cancer ceils, which show marked 
chromosomal aberrations, as well as gene dosage effects. 

Several CGH studies of bladder cancer have shown that 
some chromosomeJ aberrations are common at certain 
stages of disease progression, often occurring in more than 1 
of 3 tumors. In pTa tumors, these include 9p- , 9q - , 1 q + , Y- 
(2, 6), and In pT1 tumors, 2q-,11p-, 11q-. 1q+ . 5p+, 8q + , 
17q + , and 20q+ (2-4, 6, 7). The pTa tumors studied here 
showed similar aberrations such as 9p- and 9q22-q33- and 
9q- and Y-, respectively. Likewise, the two minlmaJ Invasive 
pT1 tumors showed aberrations that are commonly seen at 
that stage, and TCC 827 had a remarkable resemblance to the 
commonly seen pattern of losses and gains, such as 1q22-24 
amplification (seen in both tumors), 11q14-q22 loss, the latter 
often linked to 17 q+ (both tumors), and 1q+ and 9p- ( often 
linked to 20q + and 11 q13+ (both tumors) (7-9). These ob- 
servations Indicate that the pairs of tumors used In this study 
exhibit chromosomal changes observed In many tumors, and 
therefore the findings could be of general Importance for 
bladder cancer. 

Considering that the mapping resolution of CGH is of about 
20 megabases it Is only possible to get a crude picture of 
chromosomal instability using this technique. Occasionally, 
we observed reduced transcript levels close to or inside re~ 
glons with Increased copy numbers. Analysis of these regions 
by positioning heterozygous rnicrosateilires as close as pos- 
sible to the locus showing reduced gene expression revealed 
foss of heterozygosity in several cases. It seems likely that 
multiple and different events occur along each chromosomal 



arm and that the use of cDNA microarrays for analysis of DNA 
copy number changes will reach a resolution that can resolve 
these changes, as has recently been proposed (2). The outlier 
data were not more frequent at the boundaries of the CGH 
aberrations. At present we do not know the mechanism be- 
hind chromosomal aneuploldy and cannot predict whether 
chromosomal gains will be transcribed to a larger extent than 
the two native alleles. A mechanism as genetic Imprinting has 
an Impact on the expression level In normaJ cells and Is often 
reduced In tumors. However, the relation between Imprinting 
and gain of chromosomal material Is not known. 

We regard It as a strength of this Investigation that we were 
able to compare Invasive tumors to benign tumors rather than 
to normaJ urothellum, as the tumors studied were biologically 
very close and probabry may represent successive steps In 
the progression of bladder cancer. Despite the limited amount 
of fresh tissue available it was possible to apply three different 
state of the art methods. The observed correlation between 
DNA copy number and rfiRNA expression Is remarkable when 
one considers that different pieces of the tumor biopsies were 
used for the different sets of experiments. This Indicate that 
bladder tumors are relatively homogenous, > a notion recently 
supported by CGH and LOH data that showed a rernarkabte 
similarity even between tumors and distant metastasis (10, 23). 

In the few cases analyzed, rnRNA and protein levels 
showed a striking correspondence although In some cases 
we found discrepancies that may be attributed to translations! 
regulation, post-transIationaJ processing, protein degrada- 
tion, or a combination of these. Some transcripts belong to 
undertranslated mRNA pools, which are associated with few 
transiatlonalry Inactive ribosomes; these pools, however, 
seem to be rare (24). Protein degradation, for example, may 
be very important In the case of polypeptides with a short 
haJHIfe (e.g. signaling proteins). A poor correlation between 
mRNA and protein levels was found in liver cells as deter- 
mined by arrays and 2D-PAGE (25), and a moderate correla- 
tion was recently reported by Ideker ef a/. (26) in yeast 
^Interestingly, our study revealed a much better correlation 
between gained chromosomal areas and Increased mRNA 
levels than between loss of chromosomal areas and reduced 
mRNA levels. In general, the level of CGH change determined 
'the ability to detect a change in transcript? One possible 
explanation could be that by losing one allele the change In 
mRNA level Is not so dramatic as compared with gain of 
material, which can be rather unlimited and may lead to a 
severalfold increase In gene copy number resulting In a much 
higher Impact on transcript level. The latter would be much 
easier to detect on the expression arrays as the cut-off poini 
was placed at a 2-fold level so as not to be biased by noise on 
the array. Construction of arrays with a better signal to noise 
ratio may In fhe future allow detection of lesser than 2-fold 
alterations In transcript levels, a feature that may facilitate the 
analysis of the effect of loss of chromosomal areas on tran- 
script levels. 
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In eleven cases we found a significant correlation between 
DNA copy number, mRNA expression, and protein level, Four 
of these proteins were encoded by genes located at a fre- 
quently amplified area in chromosome 17q, Whether DNA 
copy number is one of the mechanisms behind alteration of 
these eleven proteins Is at present unknown and will have to 
be proved by other methods usJng a larger number of sam- 
ples. One factor making such studies complicated Is the large 
extent of protein modification that occurs after translation, 
requiring Immunoidentlficatlon and/or mass spectrometry to 
correctly Identify the proteins In the gels. 

In conclusion, the results presented in this study exemplify 
the large body of knowledge that may be possible to gather In 
the future by combining state of the an techniques that follow 
the pathway from DNA to protein (26). Here, we used a tradi- 
tional chromosomal CGH method, but in the future high reso- 
lution CGH based on mlcroarrays wfth many thousand radiation 
hybrid-mapped genes will increase the resolution and informa- 
tion derived from these types of experiments (2). Combined with 
expression arrays analyzing transcripts derived from genes with 
known locations, and 2D gel analysis to obtain Information at 
the post-translational level, a clearer and more developed un- 
derstanding of the tumor genome will be forthcoming, 
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ABSTRACT 

Genetic changes underlie tumor progression and may lead to cancer- 
specific expression of critical genes. Over 1100 publications have de- 
scribed tbe use of comparative genomic hybridization (CGH) to analyze 
the pattern of copy number alterations In cancer, bat very few of the genes 
affected are known. Here, we performed high-resolution CGH analysis on 
cDNA microarrays In breast cancer and directly compared copy number 
and mRNA expression levels of 13,824 genes to quantitate the impact of 
genomic changes on gene expression. We identified and mapped the 
boundaries of 24 independent ampJlcons, ranging In size from 0 J to 12 
Mb. Throughout the genome, both high- and low-level copy number 
changes bad a substantial Impact on gene expression, with 44% of the 
highly amplified genes showing over/expression and 103% of the highly 
overexpressed genes being amplified. Statistical analysis with random 
permutation tests identified 270 genes whose expression levels across 14 
samples were systematically attributable to gene amplification. These 
included most previously described amplified genes In breast cancer and 
many novel targets for genomic alterations, including the WOXfl7,gene, 
the presence of which in a novel amplicon at 17q21 3 was validated in 
10.2% of primary breast cancers and associated with poor patient prog- 
nosis. In conclusion, CGH on cDNA microarrays revealed hundreds of 
novel genes whose overexpression Is attributable to gene amplification. 
These genes may provide insights to the clonal evolution and progression 
of breast cancer and highlight promising therapeutic targets. 

INTRODUCTION 

Gene expression patterns revealed by cDNA microarrays have 
facilitated classification of cancers into biologically distinct catego- 
ries, some of which may explain the clinical behavior of the tumors 
(1-6). Despite this progress in diagnostic classification, the molecular 
mechanisms underlying gene expression patterns in cancer have re- 
mained elusive, and the utility of gene expression profiling in the 
identification of specific therapeutic targets remains limited. 

Accumulation of genetic defects is thought to underlie the clonal 
evolution of cancer. Identification of the genes that mediate the effects 
of genetic changes may be important by highlighting transcripts that 
are actively involved in tumor progression. Such transcripts and their 
encoded proteins would be ideal targets for anticancer therapies, as 
demonstrated by the clinical success of new therapies against ampli- 
fied oncogenes, such as ERBB2 and EGFR (7, 8), in breast cancer and 
other solid tumors. Besides amplifications of known oncogenes, over 
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Copy number ratio 




Expression ratio 

Fig. 1 . Impact of gene copy number on global gene expression levels, A. percentage of 
over- and underaiprcsscd genes (Y axis) according to copy number ratios (X axis). 
Threshold values used for over- and undcrexpressioQ were >2.J84 (global upper 7H of 
the cDNA ratios) and <0.4S26 (global lower 7% of the expression ratios). B, percentage 
of amplified and deleted genes according to expression ratios. Threshold values lor 
amplification and deletion were >\J5 and <0.7. 



20 recurrent regions of DNA amplification have been mapped in 
breast cancer by CGH 5 (9, 10). However, these amplicons are often 
large and poorly defined, and their impact on gene expression remains 
unknown. 

We hypothesized that genome-wide identification of those gene 
expression changes that are attributable to underlying gene copy 
number alterations would highlight transcripts that are actively in- 
volved in the causation or maintenance of the malignant phenotype. 
To identify such transcripts, we applied a combination of cDNA and 
CGH microarrays to: (a) determine the global impact that gene copy 
number variation plays in breast cancer development and progression; 
and (6) identify and characterize those genes whose mRLNA expres- 



3 The abbreviations used are: CGH, comparative genomic hybridization; FISH, fluo- 
rescence in situ hybridization, RT-PCR. reverse transcription-PC R. 
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Fig. 2. Gcnotnc-wfde copy number and expression analysis in the MCF-7 breast cancer cell line. A. chromosomal CGH analysis of MCF-7. The copy number ratio profile {blue 
line) across the entire genome from Ip telomere to Xq telomere is shown along with ± 1 SO {orange lints). The black horizontal tint indicates a ratio of 1 .0; red tin*, a ratio of 0.8; 
and green tine, a ratio of 1.2. 0-C genome-wide copy number analysis in MCF-7 by CGH on cDNA microamy. The copy number ratios were plotted as a function of the position 
of the cDNA clones along the human genome. In b\ individual data points arc connected with a line, and a moving median of 10 adjacent clones is shown. Red horizontal line, the 
copy number ratio of 1.0. In C individual data points are labeled by color coding according to cDNA expression ratios. The bright red don indicate the upper 2%, and dark red dots. 
the next 5% of the expression ratios in MCF-7 cells (overexpressed genes); bright green dots indicate the lowest 2%, and dark green dots, the next 5% of the expression ratios 
(underexpressed genes); the rest of the observations are shown with black crosses. The chromosome numbers are shown at the bottom of the figure, and chromosome boundaries are 
indicated with a dashed line. 



sion is most significantly associated with amplification of the corre- 
sponding genomic template. 

MATERIALS AND N^ETHODS 

Breast Cancer Cell Unes. Fourteen breast cancer cell lines (BT-20, BT- 
474. HCC1428, Hs578t, MCF7. MDA-361, MDA-436. MDA-453, MDA-468, 
SKBR-3, T-47D, UACC812, ZR-75-1, and ZR-75-30) were obtained from the 
American Type Culture Collection (Manassas, VA). Cells were grown under 
recommended culture conditions. Genomic DNA and mRNA were isolated 
using standard protocols, 

Copy Number and Expression Analyses by cDNA Micro arrays. The 
preparation and printing of the 13,824 cDNA clones on glass slides were 
performed as described (U-13). Of these clones, 244 represented uncharac- 
terized expressed sequence tags, and the remainder corresponded to known 
genes. CGH experiments on cDNA microarrays were done as described (14, 
15). Briefly, 20 /ig of genomic DNA from breast cancer cell lines and normal 
human WBCs were digested for 14-18 h with Alu\ and Rsa\ (Life Technol- 
ogies, Inc., Rockville, MD) and purified by phenol/chloroform extraction. Six 
y% of digested cell line DNAs were labeled with Cy3-dUTP (Amersham 
Pharmacia) and normal DNA with CyS-dUTP (Amersham Pharmacia) using 
the Bioprimc Labeling kit (Life Technologies, Inc.). Hybridization ( 14, 15) and 
posthybridizaiion washes (13) were done as described. For the expression 
analyses, a standard reference (Universal Human Reference RNA; Stratagcne, 
La Jolla, CA) was used in all experiments. Forty /xg of reference RNA were 
labeled with Cy3-dUTP and 3.5 jig of test mRNA with Cy5-dUTP, and the 
labeled cDNAs were hybridized on microarrays as described (13, 15). For both 
microarray analyses, a laser confocal scanner (Agilent Technologies, Palo 
Alto, CA) was used to measure the fluorescence intensities at the target 
locations using the DE ARRAY software (16). After background subtraction, 
average intensities at each clone in the test hybridization were divided by the 
average intensity of the corresponding clone in the control hybridization. For 
the copy number analysis, the ratios were normalized on the basis of the 
distribution of ratios of all targets on the array and for the expression analysis 
on the basts of 88 housekeeping genes, which were spotted four times onto the 
array. Low quality measurements {i.e., copy number data with mean reference 
intensity <100 fluorescent units, and expression data with both test and 
reference intensity <100 fluorescent units and/or with spot size <50 units) 



were excluded from the analysis and were treated as missing values. The 
distributions of fluorescence ratios were used to define cutpoints for increased/ 
decreased copy number. Genes with CGH ratio > 1 .43 (representing the upper 
5% of the CGH ratios across all experiments) were considered to be amplified, 
and genes with ratio <0.73 (representing the lower 5%) were considered to be 
deleted. 

Statistical Analysis of CGH and cDNA Microarray Data* To evaluate 
the influence of copy number alterations on gene expression, we applied the 
following statistical approach. CGH and cDNA calibrated intensity ratios were 
log-transformed and normalized using median centering of the values in each 
cell line. Furthermore, cDNA ratios for each gene across all 14 cell lines were 
median centered. For each gene, the CGH data were represented by a vector 
that was labeled 1 for amplification (ratio, > 1.43) and 0 for no amplification. 
Amplification was correlated with gene expression using the signal-to-noise 
statistics (1). We calculated a weight, w r for each gene as follows: 

m,, - m^o 
w = ___________ 

• + 

where m gU cr #1 and denote the means and SDs for the expression 

levels for amplified and nonamplified cell lines, respectively. To assess the 
statistical significance of each weight, we performed 10,000 random permu- 
tations of the label vector. The probability that a gene had a larger or equal 
weight by random permutation than the original weight was denoted by a. A 
low a (<0.05) indicates a strong association between gene expression and 
amplification. 

Genomic Localization of cDNA Clones and Amplicon Mapping. Each 
cDNA clone on the microarray was assigned to a Unigene cluster using the 
Unigene Build 141. 6 A database of genomic sequence alignment information 
for mRNA sequences was created from the August 2001 freeze of the Uni- 
versity of California Santa Cruz's GoldenPath database. 7 The chromosome and 
bp positions for each cDNA clone were then retrieved by relating these data 
sets. Amplicons were defined as a CGH copy number ratio >2.0 in at least two 
adjacent clones in two or more cell lines or a CGH ratio >2.0 in at least three 
adjacent clones in a single cell line. The amplicon start and end positions were 



* Internet address: http;//rc3carchnhgri.nih.gov/nucrc*^ 
7 Internet address: www.gcnomc.ucsc.edu. 
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Table I Summery of independent amplicons fit 14 breast cancer cell lines by 
CGH microarray 



Location 



lp!3 
Jq21 
Iq22 
3pl4 

7pl2.l-7p1IJ 

7q31 

7q32 

8q21.ll-4q2t.I3 
8<*2U 

8q23.3-«c.24.l4 

8q24.22 

9pl3 

I3q22-q3! 

16q22 

I7qll 

!7ql2-q2l.2 

I7q2l.32-q21.33 

I7q22-q233 
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CGH were validated, with lq21, 17ql2.— q2l.2 f 17q22-q23, 20ql3.1, 
and 20ql3.2 regions being most comnnonly amplified. Furthermore, 
the boundaries of these amplicons were precisely delineated. In ad- 
dition, novel amplicons were identified at 9pl3 (38.65-39.25 Mb), 
and 17q21.3 (52.47-55.80 Mb). 

Direct Identification of Putative Amplification Target Genes. 
The cDNA/CGH microarray technique enables the direct correla- 
tion of copy number and expression data on a gene-by-gene basis 
throughout the genome. We directly annotated high-resolution 
CGH plots with gene expression data using color coding. Fig. 2C 
shows that most of the amplified genes in the MCF-7 breast cancer 
cell line at lp!3, 17q22-q23, and 20ql3 were highly overex- 
pressed. A view of chromosome 7 in the MDA-468 cell line 
implicates EGFR as the most highly overexpressed and amplified 
gene at 7pl l-p!2 (Fig. 1A). In BT-474, the two known amplicons 
at 17ql2 and 17q22-q23 contained numerous highly overex- 
pressed genes (Fig. 35). In addition* several genes, including the 
homeobox genes HOXB2 and HOXB 7, were highly amplified in a 
previously undescribed independent amplicon at 17q21.3. HOXB7 
was systematically amplified (as validated by FISH, Fig. 3£, inset) 
as well as overexpressed (as verified by RT-PCR, data not shown) 
in BT-474, UACC812, andZR-75-30 cells. Furthermore, this novel 



extended to include neighboring nonamplificd clones (ratio, <1.5). The am- 
plicon size determination was partially dependent on local clone density. 

FISH. Dual-color interphase FISH to breast cancer cell lines was done as 
described (17). Bacterial artificial chromosome clone RP1 1-361KS was la- 
beled with SpcctnimOrange (Vysia, Downen Grove, IL), and Spectrum- 
Orange-labeled probe for EGFR was obtained from Vysis. SpcctrumGrecn- 
labeled chromosome 7 and 17 centromere probes (Vysis) were used as a 
reference. A tissue microarray containing 612 formal to-fixed, paraffin-embed- 
ded primary breast cancers (17) was applied in FISH analyse* as described 
(18). The use of these specimens was approved by the Ethics Committee of the 
University of Basel and by the NIH. Specimens containing a 2-fold or higher 
increase in the number of lest probe signals, as compared with corresponding 
centromere signals, in at least 10% of the tumor cells were considered to be 
amplified. Survival analysis was performed using the Kaplan-Meier method 
and the tog-rank test 

RT-PCR. The HOXB 7 expression level was determined relative to 
GAPDH. Reverse transcription and PCR amplification were performed using 
Access RT-PCR System (Promega Corp., Madison, Wl) with 10 ng of mRNA 
as a template. HOXB7 primers were 5'-GAGCAGAGGGACTCGGACTT-3* 
and 5'-GCOTCAGGTAGCGATTGTAG-3'. 

RESULTS 

Global Effect of Copy Number on Gene Expression. 13,824 
arrayed cDNA clones were applied for analysis of gene expression 
and gene copy number (CGH raicroarrays) in 14 breast cancer cell 
lines. The results illustrate a considerable influence of copy number 
on gene expression patterns. Up to 44% of the highly amplified 
transcripts (CGH ratio, >2.5) were overexpressed (/.e.. belonged to 
the global upper 7% of expression ratios), compared with only 6% for 
genes with normal copy number levels (Fig. I A). Conversely, 10.5% 
of the transcripts with high-level expression (cDNA ratio, >10) 
showed increased copy number (Fig. IB). Low-level copy number 
increases and decreases were also associated with similar, although 
less dramatic, outcomes on gene expression (Fig. 1). 

Identification of Distinct Breast Cancer Amplicons. Base-pair 
locations obtained for 11 ,994 cDNAs (86.8%) were used to plot copy 
number changes as a function of genomic position (Fig. 2 f Supple- 
ment Fig. A). The average spacing of clones throughout the genome 
was 267 kb. This high-resolution mapping identified 24 independent 
breast cancer amplicons, spanning from 0.2 to 12 Mb of DNA (Table 
1). Several amplification sites detected previously by chromosomal 
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Fig. 3. Annotation of gene expression data on CGH microarray profiles. A, genes in the 
7pl l-p!2 amplicon in the MDA-468 cell line ire highly expressed (red dots) and include 
the EGFR oncogene. B, several geaci in the 17q]2, I7q21.3, and 17q23 amplicons in the 
BT-474 breast cancer cell Una arc highly overexpressed (red) and include the HOXB 7 
genc. The data labels and color coding arc as indicated for Fig, 2C. Insets show 
chromosomal CGH profiles for the corresponding chromosomes and validation of the 
increased copy number by interphase FISH using EGFR (red) and chromosome 7 
centromere probe (green) to MDA-468 {A) and //0AB7-specific probe (red) and chro- 
mosome 17 centromere (green) to BT-474 cclU (B). 
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Pig. 4. List of 50 genes with *w*nczl\y 
significant correlation (o value <0.O5) between 
gene copy number and gene expression. Name 
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gene are indicated. The gene* have been ordered 
accenting to their position in the genome. The color 
maps on the right illustrate the copy number and 
expression ratio patterns in the 14 cell hnes. The 
key to the color code is shown at the bottom of the 
graph. Gmyujucres. missing vslues. The complete 
list of 270 genes is shown in supplemental Fig. B. 




amplification was validated to be present in 10.2% of 363 primary 
breast cancers by FISH to a tissue microarray and was associated 
with poor prognosis of the patients (P = 0.001). 

Statistical Identification and Characterization of 270 Highly 
Expressed Genes in Amplicons. Statistical comparison of expres- 
sion levels of all genes as a function of gene amplification identified 
270 genes whose expression was significantly influenced by copy 
number across all 14 cell lines (Fig. 4, Supplemental Fig. B). Accord- 
ing to the gene ontology data, 8 91 of the 270 genes represented 
hypothetical proteins or genes with no functional annotation, whereas 
179 had associated functional information available. Of these, 15 J 
(84%) are implicated in apoptosis, cell proliferation, signal transduc- 
tion, and transcription, whereas 28 (16%) had functional annotations 
that'could not be directly linked with cancer. 



DISCUSSION 

The importance of recurrent gene and chromosome copy number 
changes in the development and progression of solid tumors has been 
characterized in >1000 publications applying CGH 9 (9, 10), as well 
as in a large number of other molecular cytogenetic, cytogenetic, and 
molecular genetic studies. The effects of these somatic genetic 
changes on gene expression levels have remained largely unknown, 
although a few studies have explored gene expression changes occur- 
ring in specific amplicons (15, 19-21). Here, we applied genome- 
wide cDNA microarrays to identify transcripts whose expression 
changes were attributable to underlying gene copy number alterations 
in breast cancer. 

The overall impact of copy number on gene expression patterns was 
substantial with the most dramatic effects seen in the case of high- 



* Internet address: hup:/; www.gcneontology.ony. 
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level copy number Increase. Low-level copy number gains and losses 
also had a significant influence on expression levels of genes in the 
regions affected, but these effects were more subtle on a gene-by-gene 
basis than those of high-level amplifications. However, the impact of 
low-level gains on the dysregulation of gene expression patterns in 
cancer may be equally important if not more important than that of 
high-level amplifications. Aneuploidy and low-level gains and losses 
of chromosomal arms represent the most common types of genetic 
alterations in breast and other cancers and, therefore, have an influ- 
ence on many genes. Our results in breast cancer extend the recent 
studies on the impact of aneuploidy on global gene expression pat- 
terns in yeast cells, acute myeloid leukemia, and a prostate cancer 
model system (22-24). 

The CGH microarray analysis identified 24 independent breast 
cancer amplicons. We defined the precise boundaries for many am- 
plicons detected previously by chromosomal CGH (9, 10, 25, 26) and 
also discovered novel amplicons that had not been detected previ- 
ously, presumably because of their small size (only 1-2 Mb) or close 
proximity to other larger amplicons. One of these novel amplicons 
involved the bomeobox gene region at I7q21.3 and led to the over- 
expression of the HOXB7 and HOXB2 genes. The homeodomain 
transcription factors are known to be key regulators of embryonic 
development and have been occasionally reported to undergo aberrant 
expression in cancer (27, 28). HOXB7 transfection induced cell pro- 
liferation in melanoma, breast, and ovarian cancer cells and increased 
tumorigenicity and angiogenesis in breast cancer (29-32). The pres- 
ent results imply that gene amplification may be a prominent mech- 
anism for overexpressing HOXB7 in breast cancer and suggest that 
HOXB7 contributes to tumor progression and confers an aggressive 
disease phenotype in breast cancer. This view is supported by our 
finding of amplification of HOXB7 in 10% of 363 primary breast 
cancers, as well as an association of amplification with poor prognosis 
of the patients. 

We carried out a systematic search to identify genes whose 
expression levels across all 14 cell lines were attributable to 
amplification status. Statistical analysis revealed 270 such genes 
(representing -2% of all genes on the array), including not only 
previously described amplified genes, such as HER-2, MYC, 
EGFR, ribosomal protein s6 kinase, and AIB3 t but also numerous 
novel genes such as NRAS-related gene (lpl3), syndecan-2 (8q22) t 
and bone morphogenic protein (20ql3.1), whose activation by 
amplification may similarly promote breast cancer progression. 
Most of the 270 genes have not been implicated previously in 
breast cancer development and suggest novel pathogenetic mech- 
anisms. Although we would not expect all of them to be causally 
involved, it is intriguing that 84% of the genes with associated 
functional information were implicated in apoptosis, cell prolifer- 
ation, signal transduction, transcription, or other cellular processes 
that could directly imply a possible role in cancer progression. 
Therefore, a detailed characterization of these genes may provide 
biological insights to breast cancer progression and might lead to 
the development of novel therapeutic strategies. 

In summary, we demonstrate application of cDNA microarrays 
to the analysis of both copy number and expression levels of over 
12,000 transcripts throughout the breast cancer genome, roughly 
once every 267 kb. This analysis provided: (a) evidence of a 
prominent global influence of copy number changes on gene 
expression levels; (b) a high-resolution map of 24 independent 
amplicons in breast cancer, and (c) identification of a set of 270 
genes, the overexpression of which was statistically attributable to 
gene amplification. Characterization of a novel amplicon at 
17q2l.3 implicated amplification and overexpression of the 
HOXB7 gene in breast cancer, including a clinical association 



between HOXB7 amplification and poor patient prognosis. Overall, 
our results illustrate how the identification of genes activated by 
gene amplification provides a powerful approach to highlight 
genes with an important role in cancer as well as to prioritize and 
validate putative targets for therapy development. 
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Genomic DNA copy number alterations are key genetic events In 
the development and progression of human cancers. Here we 
report a genome-wkle microarray comparative genomic hybrid- 
ization (array CGH) analysis of DNA copy number variation in 
a series of primary human breast tumors. We have profiled DNA 
copy number alteration across 6,691 mapped human genes, in 44 
predominantly advanced, primary breast tumors and 10 breast 
cancer cell lines. While the overall patterns of DNA amplification 
and deletion corroborate previous cytogenetic studies, the high- 
resolution (gene-by-gene) mapping of ampllcon boundaries and 
the quantitative analysis of ampllcon shape provide significant 
improvement in the localization of candidate oncogenes. Parallel 
microarray measurements of mRNA levels reveal the remarkable 
degree to which variation In gene copy number contributes to 
variation in gene expression in tumor cells. Specifically, we find 
that 62% of highly amplified genes show moderately or highly 
elevated expression, that DNA copy number Influences gene ex- 
pression across a wide range of DNA copy number alterations 
(deletion, low-, mid- and high-level amplification), that on average, 
a 2-fold change in DNA copy number Is associated with a corre- 
sponding 1.5-fold change in mRNA levels, and that overall, at least 
12% of all the variation in gene expression among the breast 
tumors Is directly attributable to underlying variation In gene copy 
number. These findings provide evidence that widespread DNA 
copy number alteration can lead directly to global deregulation of 
gene expression, which may contribute to the development or 
progression of cancer. 

Conventional cytogenetic techniques, including comparative 
genomic hybridization (CGH) (1), have led to the identifi- 
cation of a number of recurrent regions of DNA copy number 
alteration in breast cancer cell lines and tumors (2-4). While 
some of these regions contain known or candidate oncogenes 
(e.g„ FGFR1 (8pll), MYC (8q24), CCND1 (llql3), ERBB2 
(17ql2), and ZNF217 (20ql3)j and tumor suppressor genes 
{RBI (13ql4) and TP53 (I7pl3)], the relevant gene(s) within 
other regions (e.g., gain of lq, 8q22, and 17q22-24, and loss of 
8p) remain to be identified. A high-resolution genome-wide 
map. delineating the boundaries of DNA copy number alter- 
ations in tumors, should facilitate the localization and identifi- 
cation of oncogenes and tumor suppressor genes in breast 
cancer. In this study, we have created such a map, using 
array-based CGH (5-7) to profile DNA copy number alteration 
in a series of breast cancer cell lines and primary tumors. 

An unresolved question is the extent to which the widespread 
DNA copy number changes that we and others have identified 
in breast tumors alter expression of genes within involved 
regions. Because we bad measured mRNA levels in parallel in 
the same samples (8), using the same DNA microarrays, we had 
an opportunity to explore on a genomic scale the relationship 
between DNA copy number changes and gene expression. From 



this analysis, we have identified a significant impact of wide- 
spread DNA copy number alteration on the transcriptional 
programs of breast tumors. 

Materials and Methods 

Tumors and Cell lines. Primary breast tumors were predominantly 
large (>3 cm), intermediate-grade, infiltrating ductal carcino- 
mas, with more than 50% being lymph node positive. The 
fraction of tumor cells within specimens averaged at least 50%. 
Details of individual tumors have been published (8 f 9), and 
are summarized in Table 1, which is published as supporting 
information on the PNAS web site, vzww.pnas.org. Breast cancer 
cell lines were obtained from the American Type Culture 
Collection. Genomic DNA was isolated either using Qiagen 
genomic DNA columns, or by phenol/chloroform extraction 
followed by ethanol precipitation. 

DNA Labeling and Microarray Hybridizations. Genomic DNA label- 
ing and hybridizations were performed essentially as described 
in Pollack et al (7), with slight modifications. Two micrograms 
of DNA was labeled in a total volume of 50 microliters and the 
volumes of all reagents were adjusted accordingly. 'Test" DNA 
(from tumors and cell lines) was f luorescently labeled (Cy5) and 
hybridized to a human cDNA microarray containing 6,691 
different mapped human genes (i.e., UniGene clusters). The 
"reference" (labeled with Cy3) for each hybridization was nor- 
mal female leukocyte DNA from a single donor. The fabrication 
of cDNA microarrays and the labeling and bybridization of 
mRNA samples have been described (8). 

Data Analysis and Map Positions. Hybridized arrays were scanned 
on a GenePix scanner (Axon Instruments, Foster City, CA), and 
fluorescence ratios (test/reference) calculated using scanalyze 
software (available at http://rana.lbl.gov). Fluorescence ratios 
were normalized for each array by setting the average log 
fluorescence ratio for all array elements equal to 0. Measure- 
ments with fluorescence intensities more than 20% above back- 
ground were considered reliable. DNA copy number profiles 
that deviated significantly from background ratios measured in 
normal genomic DNA control hybridizations were interpreted as 
evidence of Teal DNA copy number alteration (see Estimating 
Significance of Altered Fluorescence Ratios in the supporting 
information). When indicated, DNA copy number profiles are 
displayed as a moving average (symmetric 5-nearest neighbors). 
Map positions for arrayed human cDNAs were assigned by 
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identifying the starting position of the best and longest match of 
any DNA sequence represented in the corresponding UmGene 
cluster (10) against the "Golden Path" genome assembly 
(http://genome.ucsc.edu/; Oct 7, 2000 Freeze). For UmGene 
clusters represented by multiple arrayed elements, mean fluo- 
rescence ratios (for all elements representing the same UmGene 
cluster) are reported. For mRNA measurements, fluorescence 
ratios are "mean-centered" (i.e^ reported relative to the mean 
ratio across the 44 tumor samples). The data set described here 
can be accessed in its entirety in the supporting information. 

Results 

We performed CGH on 44 predominantly locally advanced, 
primary breast tumors and 10 breast cancer cell lines, using 
cDNA microarrays containing 6,691 different mapped human 
genes (Fig 1«; also sec Materials and Methods for details of 
microarray hybridizations). To take full advantage of the im- 
proved spatial resolution of array CGH, we ordered (fluores- 
cence ratios for) the 6,691 cDNAs according to the f Golden 
Path" (http://genome.ucsc.edu/) genome assembly of the draft 
human genome sequences (11). In so doing, arrayed cDNAs not 
only themselves represent genes of potential interest (e.g., 
candidate oncogenes within amplicons), but also provide precise 
genetic landmarks for chromosomal regions of amplification and 
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deletion. Parallel analysis of DNA from cell lines containing 
different numbers of X chromosomes (Fig. 16), as we did before 
m demonstrated the sensitivity of our method to detect single- 
copy loss (45, XO), and 13- (47,XXX). 2, (46\XXXX), or 
23-fold (49.XXXXX) gains (also see Fig, 5, which is published 
as supporting information on the PNAS web site). Fluorescence 
ratios were linearly proportional to copy number ratios, which 
were slightly underestimated, in agreement with previous ob- 
servations (7). Numerous DNA copy number alterations were 
evident in both the breast cancer cell lines and primary tumors 
(Fig. la), detected in the tumors despite the presence of euploid 
non-tumor cell types; the magnitudes of the observed changes 
were generally lower in the tumor samples. DNA copy-number 
alterations were found in every cancer cell line and tumor, and 
on every human chromosome in at least one sample. Recurrent 
regions of DNA copy number gain and loss were readily iden- 
tifiable. For example, gains within lq, 8q, 17q, and 20q were 
observed in a high proportion of breast cancer cell lines/tumors 
(90%/69%, 10Q%/47%, 100%/60%, and 90%/44%, respective- 
IvK as were losses within lp, 3p, 8p, and 13q (80%/24%, 
80%/22%, 80%/22%, and 70%/18%, respectively), consistent 
with published cytogenetic studies (refe. 2-4; a complete listing 
of gains/losses is provided in Tables 2 and 3, which are published 
as supporting information on the PNAS web site). The total 
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of X chromosomes, for breast cancer cell lines, and for breast tumors. Breast cancer cell lines and tumor, "V^*^ 
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chromosome. Fluorescence ratios (test/reference) are depicted by a log, pseudocolor scale (indicated). Selected genes are pleated with colofH^ded Mtext fredt 
t^S^Ji decreased; bUckl ™ W gray, not well mea^ 

of samples displaying the DMA copy number change). The map positions for genes of interest that are not represented on the microarray are indicated buhe 
To^^T^n^J^ on the array (b) Graphical display of ONA copy number profile for breast cancer cell line SKBR3. Fluorescence ratios 
(tumor/normal) are plotted on a logs scale for chromosome B genes, ordered along the chromosome. 



number of genomic alterations (gains and losses) was found to 
be significantly higher in breast tumors that were high grade (P - 
0.008), consistent with published CGH data (3). estrogen recep- 
tor negative (P - 0.04), and harboring TP53 mutations (P « 
0.0006) (see Table 4* which is published as supporting informa- 
tion on the PNAS web site). 

The improved spatial resolution of our array CGH analysis is 
illustrated for chromosome 8, which displayed extensive DNA 
copy number alteration in our series. A detailed view of the 
variation in the copy number of 241 genes mapping to chromo- 
some 8 revealed multiple regions of recurrent amplification; 
each of these potentially harbors a different known or previously 
unchaxacterized oncogene (Fig. 2a). The complexity of amplicon 
structure is most easily appreciated in the breast cancer cell line 
SKBR3. Although a conventional CGH analysis of 8q in SKBR3 
identified only two distinct regions of amplification (12), we 
observed three distinct regions of high-level amplification (la- 
beled 1-3 in Ft£ 2fc). For each of these regions we can define the 



boundaries of the interval recurrently amplified in the tumors we 
examined; in each case, known or plausible candidate oncogenes 
can be identified (a description of these regions, as well as the 
recurrently amplified regions on chromosomes 17 and 20, can be 
found in Figs. 6 and 7, which are published as supporting 
information on the PNAS web site). 

For a subset of breast cancer cell lines and tumors (4 and 37, 
respectively), and a subset of arrayed genes (6,095), mRNA 
levels were quantitatively measured in parallel by using cDNA 
microarrays (8). The parallel assessment of mRNA levels is 
useful in the interpretation of DNA copy number changes. For 
example, the highly amplified genes that are also highly ex- 
pressed are the strongest candidate oncogenes within an ampli- 
con. Perhaps more significantly, our parallel analysis of DNA 
copy number changes and mRNA levels provides us the oppor- 
tunity to assess the global impact of widespread DNA copy 
number alteration on gene expression in tumor cells. 

A strong influence of DNA copy number on gene expression 
is evident in an examination of the pseudocolor representations 
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artLt^or breast cancer cell UnS and tumors. Breast cancer cell lines and tumon are separate* orders 

and mRNA leveb were determined, are ordered by position along the chromosome; selected genes are indkated In cok>r<oded text (see Fig. 2 legend). 
Fluorescence ratios (test/reference) are depicted by separate iog a pseudocolor scales (indicated). 



of DNA copy number and mRNA levels for genes on chromo- 
some 17 (Fig. 3). The overall patterns of gene amplification and 
elevated gene expression are quite concordant; a significant 
fraction of highly amplified genes appear to be correspondingly 
highly expressed. The concordance between high-level amplifi- 
cation and increased gene expression is not restricted to chro- 
mosome 17. Genome-wide, of 117 high-level DNA amplifica- 
tions (fluorescence ratios >4, and representing 91 different 
genes), 62% (representing 54 different genes; see Table 5, which 
is published as supporting information on the PNAS web site) 
are found associated with at least moderately elevated mRNA 
levels (mean-centered fluorescence ratios >2), and 42% (rep- 
resenting 36 different genes) are found associated with compa- 
rably highly elevated mRNA levels (mean-centered fluorescence 
ratios >4). 

To determine the extent to which DNA deletion and lower- 
level amplification (in addition to high-level amplification) are 
also associated with corresponding alterations in mRNA levels, 
we performed three separate analyses on the complete data set 
(4 cell lines and 37 tumors, across 6,095 genes). First, we 
determined the average mRNA levels for each of five classes 
of genes, representing DNA deletion, no change, and low-, 
medium-, and high-level amplification (Fig. 4<?). For both the 



breast cancer cell lines and tumors, average mRNA levels 
tracked with DNA copy number across all five classes, in a 
statistically significant fashion (P values for pair-wise Student's 
r tests comparing adjacent classes: cell lines, 4 x 10"" 49 , 1 x 10* 49 , 
5 x V',1 X 10-4 tumors, 1 X 10"« 1 X 10-*» 5 x 10- 1 . 
1 x 10~ 4 ). A linear regression of the average k>g(DNA copy 
number), for each class, against average log(mRNA level) 
demonstrated that on average, a Mold change in DNA copy 
number was accompanied by 1.4- and 1.5-fold changes in mRNA 
level for the breast cancer cell lines and tumors, respectively (Fig. 
Aa f regression line not shown). Second, we characterized the 
distribution of the 6,095 correlations between DNA copy num- 
ber and mRNA level, each across the 37 tumor samples (Fig. 4fr). 
The distribution of correlations forms a normal-shaped curve, 
but with the peak markedly shifted in the positive direction from 
zero. This shift is statistically significant, as evidenced in a plot 
of observed vs. expected correlations (Fig. 4c), and reflects a 
pervasive global influence of DNA copy number alterations on 
gene expression. Notably, the highest correlations between DNA 
copy number and mRNA level (the right tail of the distribution 
in Fig. 46) comprise both amplified and deleted genes (data not 
shown). Third, we used a linear regression model to estimate the 
fraction of all variation measured in mRNA levels among the 37 
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Ftg. 4. Genome^We influence of DNA copy number alterations on mRNA levels, (a) For breast cancer cell lines (gray) end ^inorsi^es (btock). both 

lndk«te standard error, of the mean) are plotted for each of five classes of genes, representing DMA deletion (turrKK/rwrmalmte < ^ *»*^*»^ 
low (1 2-2) medium- (2-4). and high-level (>4) amplification. P values for pair-wfce Student's t tests, comparing averages between adjacent dasses (moving 
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The expected values were obtained by ranoomization of the sample labels in the DNA copy number data set. The l^of unrty * mdkate£(oO Percent variance 
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(gray line) are plotted for different fluorescence intensity/background (a rough surrogate for ^njjl/nobe) cutoff v »^^^^^^^J^ b ^ 
to the 1.2 Intensity/background cutoff. Details of the linear regression model used to estimate the fraction of variation In gene expression /^^We to 
underlying DNA copy number alteration can be found in the supporting information (see Estimating the Fraction of Variation in Gene Expression Attributable 
to Underlying DNA Copy Number Alteration). 



tumors that could be attributed to underlying variation in DNA 
copy number. From this analysis, we estimate that, overall, about 
7% of all of the observed variation in mRNA levels can be 
explained directly by variation in copy number of the altered 
genes (Fig. Ad). We can reduce the effects of experimental 
measurement error on this estimate by using only that fraction 
of the data most reliably measured (fluorescence intensity/ 
background >3); using that data, our estimate of the percent 
variation in mRNA levels directly attributed to variation in gene 
copy number increases to 12% (Fig. 4d). This still undoubtedly 
represents a significant underestimate, as the observed variation 
in global gene expression is affected not only by true variation in 
the expression programs of the tumor cells themselves, but also 
by the variable presence of non-tumor cell types within clinical 
samples. 

Discussion 

This genome-wide, array CGH analysis of DNA copy number 
alteration in a series of human breast tumors demonstrates the 
usefulness of defining amplicon boundaries at high resolution 
(gene-by-gene), and quantitatively measuring amplicon shape, to 
assist in locating and identifying candidate oncogenes. By ana* 
lyzing mRNA levels in parallel, we have also discovered that 
changes in DNA copy number have a large, pervasive, direct 
effect on global gene expression patterns in both breast cancer 



cell lines and tumors. Although the DNA microarrays used in our 
analysis may display a bias toward characterized and/or highly 
expressed genes, because we are examining such a large fraction 
of the genome (approximately 20% of all human genes), and 
because, as detailed above, we are likely underestimating the 
contribution of DNA copy number changes to altered gene 
expression, we believe our findings are likely to be gencralizable 
(but would nevertheless still be remarkable if only applicable to 
this set of -6,100 genes). ~~ 

In budding yeast, aneuploidy has been shown to result in 
chromosome-wide gene expression biases (13). Two recent 
studies have begun to examine the global relationship between 
DNA copy number and gene expression in cancer cells. In 
agreement with our findings, Phillips et aL (14) have shown that 
with the acquisition of tumorigenicity in an immortalized pros- 
tate epithelial cell line, new chromosomal gains and losses 
resulted in a statistically significant respective increase and 
decrease in the average expression level of involved genes. In 
contrast, Platzer et aL (15) recently reported that in metastatic 
colon tumors only —4% of genes within amplified regions were 
found more highly (>2-fb)d) expressed, when compared with 
normal colonic epithelium. This report differs substantially from 
our finding that 62% of highly amplified genes in breast cancer 
exhibit at least 2-fold increased expression. These contrasting 
findings may reflect methodological differences between the 
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studies. For example, the study of Platzer et al. (15) may have 
systematically under-measured gene expression changes. In this 
regard it is remarkable that only 14 transcripts of many thousand 
residing within unamplified chromosomal regions were found to 
exhibit at least 4-fold altered expression in metastatic colon 
cancer. Additionally, their reliance on lower-resolution chromo- 
somal CGH may have resulted in poorly delimiting the bound- 
aries of high-complexity amplicons, effectively overcallmg re- 
gions with amplification. Alternatively, the contrasting findings 
for amplified genes may represent real biological differences 
between breast and metastatic colon tumors; resolution of this 
issue will require further studies. 

Our finding that widespread DNA copy number alteration has 
a large, pervasive and direct effect on global gene expression 
patterns in breast cancer has several important implications. 
First, this finding supports a high degree of copy number- 
dependent gene expression in tumors. Second, it suggests that 
most genes are not subject to specific autoreguiation or dosage 
compensation. Third, this finding cautions that elevated expres- 
sion of an amplified gene cannot alone be considered strong 
independent evidence of a candidate oncogene's role in tumor- 
igenesis. In our study, fully 62% of highly amplified genes 
demonstrated moderately or highly elevated expression. This 
highlights the importance of high-resolution mapping of ampli- 
con boundaries and shape [to identify the "driving" gene(s) 
within amplicons (16)], on a large number of samples, in addition 
to functional studies. Fourth, this finding suggests that analyzing 
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the genomic distribution of expressed genes, even within existing 
microarray gene expression data sets, may permit the inference 
of DNA copy number aberration, particularly aneuploidy (where 
gene expression can be averaged across large chromosomal 
regions; see Fig. 3 and supporting information). Fifth, this 
finding implies that a substantial portion of the phenotypic 
uniqueness (and by extension/ the heterogeneity in clinical 
behavior) among patients* tumors may be traceable to underly- 
ing variation in DNA copy number. Sixth, this finding supports 
a possible role for widespread DNA copy number alteration in 
tumorigenesis (17, 18), beyond the anrplification of specific 
oncogenes and deletion of specific tumor suppressor genes. 
Widespread DNA copy number alteration, and the concomitant 
widespread imbalance in gene expression, might disrupt critical 
stochioroetric relationships in cell metabolism and physiology 
(eg., proteosome, mitotic spindle), possibly promoting further 
chromosomal instability and directly contributing to tumor 
development or progression. Finally, our findings suggest the 
possibility of cancer therapies that exploit specific or global 
imbalances in gene expression in cancer. 

We thank the many member* of the P.O.B. and D.B. labs for helpful 
discussions. J.R.P. was a Howard Hughes Medical Institute Physician 
Postdoctoral Fellow during a portion of this work. P.O.B. is a Howard 
Hughes Medical Institute Associate Investigator. This work was 
supported by grants from the National Institutes of Health, the Howard 
Hughes Medical Institute, the Norwegian Cancer Society, and the 
Norwegian Research Council 



T„ Eiaen, M. £U van de Rijn, M„ Jeffrey, S.S^ etaL (2001) Pwc Nad. Acad. 
ScL USA 98, 10869-10674. 

10. Schuler, G. D. (1997) /. Mot Med. 75, 694-698. 

11. Under. E. Unton. U M„ Btrren, B., Nufbaum. C Zody, M. C, Baldwin, 
J„ Devon, 1L, Dewar, K., Doyte, M, FtaHugh, W, ct aL (2001) Nature 
(London) 409, 860-921. 

12. Fejzo, M. Godfrey, T„ Chen. G, Waldman, F. & Gray, J. W. (1998) Genes 
Chromosomes Cancer 22, 105-113. 

13. Hughes, T- R-, Roberts, C J, Dai, H., Jones, A. R„ Meyer, M. R^ Slade, D„ 
Burcbard, K Dow, Ward, T. R-, Kidd, M. J„ ct at (2000) Nau Genet 25, 
333-337. 

14. Phillips, J. U Hayward, S. W„ Wang, Y-, VaiKJU, J, Favkxrfch, G. PadiBa- 
Nash, H., Peznllo, J. R-, Ghadimi, B. \L, Grossfcld, G. D^ Rivera, K+etal. 
(2001) Cancer Res. 61, 8145-8149. 

15. Platrcr, Upender, M. B M Wilson, WOHi, J, Lotterbaugh, J^ Nosrarl, 
Willson, J. IC Mack, D., Ried. T. & Markowhz, S. (2002) Cancer Bex. 62, 
1134-1138. 

16. Albertson. D. G.. Ybtra. B., St grave*, R~, CoUini, C, Dairkee, S. Kowbel, 

Kuo, W. U, Gray, J. W. A Pinkel, D. (2000) Mil. G«irt. 25, 
144-146. 

17. U, R- Yerganian, Ductberg. Kraemcr, Wilier, A, Rautch, C & 
HcWmann. R. (1997) Proc Natl Acad. Set. USA 94, 14506-14511. 

18. Rasnick, D. & Duesbcrg, P. H. (1999) Biochem. J. 348, 621-630. 



12968 | wvww.pnas.ofgyc9i/doVl0.1073/pna».t 62471999 



Pollack et*i 



■Ml TECHNICAL UPDATE 

FROM YOUR LABORATORY SERVICES PROVIDER 

HER-2/neu Breast Cancer Predictive Testing 

Julie SanfordHanna, Ph.D. and Dan Mornin, A/.D. 



Each year, over 182,000 women in the United States are 
diagnosed with breast cancer, and approximately 45,000 die 
of the disease. 1 Incidence appears to be increasing in the 
United States at a rate of roughly 2% per year. The reasons 
for the increase are unclear, but non-genetic risk factors appear 
to play a large role. 2 

Five-year survival rates range from approximately 65%- 
85%, depending on demographic group, with a significant 
percentage of women experiencing recurrence of their cancer 
within 10 years of diagnosis. One of the factors most predic- 
tive for recurrence once a diagnosis of breast cancer has been 
made is the number of axillary lymph nodes to which tumor 
has metastasized. Most node-positive women are given adju- 
vant therapy, which increases their survival. However, 20%- 
30% of patients without axillary node involvement also 
develop recurrent disease, and the difficulty lies in how to iden- 
tify this high-risk subset of patients. These patients could 
benefit from increased surveillance, early intervention, and 
treatment. 

Prognostic markers currently used in breast cancer recur- 
rence prediction include tumor size, histological grade, steroid 
hormone receptor status, DNA ploidy, proliferative index, and 
cathepsin D status. Expression of growth factor receptors and 
over-expression of the HER-2/neu oncogene have also been 
identified as having value regarding treatment regimen and 
prognosis. 

HER-2/neu (also known as c-erbB2) is an oncogene that 
encodes a transmembrane glycoprotein that is homologous 
to, but distinct from, the epidermal growth factor receptor. 
Numerous studies have indicated that high levels of expres- 
sion of this protein are associated with rapid tumor growth, 
certain forms of therapy resistance, and shorter disease-free 
survival. The gene has been shown to be amplified and/or 
overexpressed in 10%-30% of invasive breast cancers and in 
40%-60% of intraductal breast carcinoma. 3 

There are two distinct FDA-approved methods by which 
HER-2/neu status can be evaluated: immunohistochemistry 
(IHC, HercepTest™) and FISH (fluorescent in situ hybridiza- 
tion, PathVysion™ Kit). Both methods can be performed on 
archived and current specimens. The first method allows visual 
assessment of the amount of HER-2/neu protein present on 
the cell membrane. The latter method allows direct quantifi- 
cation of the level of gene amplification present in the tumor, 
enabling differentiation between low- versus high-amplifica- 
tion. At least one study has demonstrated a difference in 



recurrence risk in women younger than 40 years of age for 
low- versus high-amplified tumors (54.5% compared to 
85.7%); this is compared to a recurrence rate of 16.7% for 
patients with no HER-2/neu gene amplification. 4 HER-2/neu 
status may be particularly important to establish in women with 
small (£ 1 cm) tumor size. 

The choice of methodology for determination of HER-2/ 
neu status depends in part on the clinical setting. FDA approval 
for the Vysis FISH test was granted based on clinical trials 
involving 1549 node-positive patients. Patients received one 
of three different treatments consisting of different doses of 
cyclophosphamide, Adriamycin, and 5-fluorouracil (CAF). 
The study showed that patients with amplified HER-2/neu 
benefited from treatment with higher doses of adriamycin- 
based therapy, while those with normal HER-2/neu levels did 
not. The study therefore identified a sub-set of women, who 
because they did not benefit from more aggressive treatment, 
did not need to be exposed to the associated side effects. In 
addition, other evidence indicates that HER-2/neu amplifica- 
tion in node-negative patients can be used as an independent 
prognostic indicator for early recurrence, recurrent disease at 
any time and disease-related death. 5 Demonstration of HER- 
2/neu gene amplification by FISH has also been shown to be 
of value in predicting response to chemotherapy in stage-2 
breast cancer patients. 

Selection of patients for Herceptin 0 (Trastuzumab) mono- 
clonal antibody therapy, however, is based upon demonstra- 
tion of HER-2/neu protein overexpression using HercepTest™. 
Studies using Herceptin 0 in patients with metastatic breast 
cancer show an increase in time to disease progression, 
increased response rate to chemotherapeutic agents and a small 
increase in overall survival rate. The FISH assays have not yet 
been approved for this purpose, and studies looking at response 
to Herceptin 0 in patients with or without gene amplification 
status determined by FISH are in progress. 

In general, FISH and IHC results correlate well. However, 
subsets of tumors are found which show discordant results; 
i.e., protein overexpression without gene amplification or lack 
of protein overexpression with gene amplification. The clini- 
cal significance of such results is unclear. Based on the above 
considerations, HER-2/neu testing at SHMCVPAML will uti- 
lize immunohistochemistry (HercepTest 0 ) as a screen, fol- 
lowed by FISH in IHC-negative cases. Alternatively, either 
method may be ordered individually depending on the clini- 
cal setting or clinician preference. 
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CPT code information 

HER-2/aeu via IHC 

88342 (including interpretive report) 

HER-27neu via FISH 

88271 x2 Molecular cytogenetics, DNA probe, each 
88274 Molecular cytogenetics, interphase in situ hybrid- 
ization, analyze 25-99 cells 
8829 1 Cytogenetics and molecular cytogenetics, interpre- 
tation and report 



Procedural Information 

Immunohistochernistry is performed using the FDA-approved 
DAKO antibody kit, Herceptest<>. The DAKO kit contains 
reagents required to complete a two-step immunohisto- 
chemical staining procedure forroutinely processed, paraffin- 
embedded specimens. Following incubation with the primary 
rabbit antibody to human HER-2/neu protein, the kit employs 
a ready-to-use dextran-based visualization reagent. This re- 
agent consists of both secondary goat anti-rabbit antibody 
molecules with horseradish peroxidase molecules linked to a 
common dextran polymer backbone, thus eliminating the need 
for sequential application of link antibody and peroxidase 
conjugated antibody. Enzymatic conversion of the subse- 
quently added chromogen results in formation of visible 
reaction product at the antigen site, the specimen is then coun- 
terstained; a pathologist using light-microscopy interprets 
results. 

FISH analysis at SHMC/PAML is performed using the 
FDA-approved PathVysion™ HER-2/neu DNA probe kit, pro- 
duced by Vysis, Inc. Formalin fixed, paraffin-embedded breast 
tissue is processed using routine histological methods, and then 
slides are treated to allow hybridization of DNA probes to the 
nuclei present in the tissue section. The Pathvysion™ kit con- 
tains two direct-labeled DNA probes, one specific for the 
alphoid repetitive DNA (CEP 1 7, spectrum orange) present at 
the chromosome 17 centromere and the second for the HER- 
2/neu oncogene located at 17ql 1 .2-12 (spectrum green). Enu- 
meration of the probes allows a ratio of the number of copies 
of chromosome 17 to the number of copies of HER-2/neu to 
be obtained; this enables quantification of low versus high 
amplification levels, and allows an estimate of the percentage 
of cells with HER-2/neu gene amplification. The clinically 
relevant distinction is whether the gene amplification is due 
to increased gene copy number on the two chromosome 17 
homologues normally present or an increase in the number of 
chromosome 17s in the cells. In the majority of cases, ratio 
equivalents less than 2.0 are indicative of a normal/negative 
result, ratios of 2.1 and over indicate that amplification is 
present and to what degree. Interpretation of this data will be 
performed and reported from the Vysis-certified Cytogenet- 
ics laboratory at SHMC. 
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High-throughput technologies, such as proteomic screening and DNA micro-arrays, produce vast 
amounts of data requiring comprehensive analytical methods to decipher the biologically relevant 
results. One approach would be to manually search the biomedical literature; however, this would be 
an arduous task. We developed an automated literature-mining tool, termed MedGene, which 
comprehensively summarizes and estimates the relative strengths of all human gene-disease 
relationships in Medline. Using MedGene, we analyzed a novel micro-array expression dataset 
comparing breast cancer and normal breast tissue in the context of existing knowledge. We found no 
correlation between the strength of the literature association and the magnitude of the difference in 
expression level when considering changes as high as 5-fold; however, a significant* correlation was 
observed (r = 0.41; p = 0.05) among genes showing an expression difference of 10-fold or more. 
Interestingly, this only held true for estrogen receptor (ER) positive tumors, not ER negative. MedGene 
identified a set of relatively understudied, yet highly expressed genes in ER negative tumors worthy of 
further examination. 

Keywords: bioinformatics • micro-araiy • text mining • gene-disease association • breast cancer 



Introduction 

At its current pace, the accumulation of biomedical literature 
outpaces the ability of most researchers and clinicians to stay 
abreast of their own immediate fields. let alone cover a broader 
range of topics. For example, to follow a single disease, c.g„ 
breast cancer, a researcher would have had to scan 130 different, 
journals and read 27 papers per day in 1999. 1 This problem is 
accentuated with high-throughput technologies such as DNA 
micro-arrays and proteomics, which require the analysis of 
large datasets Involving thousands of genes, many of which are 
unfamiliar to a particular researcher. In any microarray experi- 
ment, thousands of genes may demonstrate statistically sig- 
nificant expression changes, but only a fraction of these may 
be relevant to the study. The ability to interpret these datasets 
would be enhanced if they could be compared to a compre- 
hensive summary of what is known about all genes. Thus, there 
is a need to summarize existing knowledge in a format that 
allows for the rapid analysis of associations between genes and 
diseases or other specific biological concepts. 

One solution to this problem Is to compile structured digital 
resources, such as the Breast Cancer Gene Database 1 and the 
Tumor Gene Database. 2 However, as these resources are hand- 
curated, the labor-intensive review process becomes a rate- 
limiting step in the growth of the database. As a result, these 
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databases have a limited scale and the genes are not selected 
in a systematic fashion. 

An alternative approach is automated text mining; a method 
which Involves automated information extraction by searching 
documents for text strings and analyzing their frequency and 
context. This approach has been used successfully in several 
instances for biological applications. In most cases, it has been 
applied to extract information about the relationships or 
interactions that proteins or genes have with one another, in 
die literature or by functional annotation. 3 * 7 Thus far, few 
publication have applied text-mining to examine the global 
relationships between genes and diseases. Perez- Iratxeta et al. 
automatically examined the GO (Gene Ontology) annotation 
of genes and their predicted chromosomal locations in order 
to identify genes linked to inherited disorders. 8 

To obtain a more global understanding of disease develop- 
ment, it would be valuable to incorporate information regarding 
all possible gene-disease relationships, including biochemical, 
physiological, pharmacological epidemiological, as well as 
genetic. This information would enable comprehensive com- 
parisons between large experimental datasets and existing 
knowledge in the literature. This would accomplish two things. 
First, it would serve to validate experiments by demonstrating 
that known responses occur as predicted. vSecond. it would 
rapidly highlight which genes are corroborated by the literature 
and which genes are novel in a given context. We have utilized 
a computational approach to literature mining to produce a 
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comprehensive set of gene-disease relationships. In addition, 
we have developed a novel approach lo assess the strength of 
each association based on the frequency of citation and co- 
citation. We applied this tool to help interpret the data from a 
large micro-array gene expression experiment comparing 
normal and cancerous breast tissue. 

Methods 

MedGene Database. MedGene is a relational database, stor- 
ing disease and gene information from NCBI, text mining re- 
sults. statisticaJ scores, and hyperlinks to the primary lit- 
erature. MedGene has a web-based user interface for users to 
query the database (http://riipseq.med.harvard.e<iu/MetlC<me/ / ). 

Text Mining Algorithms. MeSH Tiles were downloaded from 
the MeSH web site at NLM (Nation Library of Medicine) (http:// 
www.nlm.niri.gov/mesh/meshhoine.hUnl) and human disease 
categories were selected. LocusLink files were downloaded from 
the LocusIJnk web site at NCBI (http://www.ncbi.nih.gov/ 
LocusLink/). Official/preferred gene symbol, official/preferred 
gene name, and gene alternative symbols and names, all 
relevant annotations and URLs for each LocusLink record, were 
collected. Gene search terms were used for literature searching 
and included all qualified gene names, gene symbols, and gene 
family terms. Primary gene keys, predominantly qualified gene 
family terms and gene official/preferred symbols, were used 
to index Medline records. If the official/preferred gene symbols 
did not meet the standards to be an index, then qualified gene 
official/preferred names were used. A local copy of Medline 
records (up to July, 2002) was pre-selected. 

A JAVA module examined the MeSH terms and men indexed 
each Medline record with the appropriate disease terms. A 
separate JAVA module was used to examine the titles and 
abstracts for gene search terms and then to index the gene- 
related Medline records with the relevant primary gene key(s). 

Statistical Methods. For every gene and disease pair, we 
counted records that were indexed for both gene and disease 
(double positive hits), for disease only (disease single hits), for 
gene only (gene single hits), and for neither gene nor disease 
(double negative hits) to generate a 2 x 2 contingency table. 
On me basis of the contingency table-framework, we applied 
different statistical methods to estimate the strength of gene- 
disease relatioaships and evaluated the results. These methods 
Included chi-square analysis, Fisher's exact probabilities, rela- 
tive risk of gene, and relative risk of -disease 16 (http:// 
hipseq.med.haivard.edu/MedGene/). In addition, we computed 
the "product of frequency", which is the product of the 
proportion of disease/gene double hits to disease single hits 
and the proportion of disease/gene double hits to gene single 
hits. To obtain a normal distribution, we transformed all the 
statistical scores using the natural logarithm. We selected the 
log of the product of frequency (LPF) to validate MedGene and 
to use for the analysts with the micro-array data. Spearman 
rank-correlation coefficients were used to assess the linear 
relationship between LPF and micro-array fold change in 
expression level. 

Global Analysis. Diseases with at least 50 related genes were 
selected for clustering analysis, and the LPF scores were 
normalized with total score for each disease. Hierarchical 
clustering was done with the "Cluster" software and the 
clustering result was visualized using "TreeViewer" (http:// 
rana.lbl.gov/EisenSoftware.htm). 
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Breast Tissue Micro-Arrays. Eighty-nine breast cancer 
samples (79% ER-positive) and 7 normal breast tissue samples 
were selected from the Harvard Breast SPORE frozen tissue 
repository and were representative of the spectrum of histo- 
logical types, grades, and hormone receptor immuno-pheno- 
types of breast cancer. Biotinylated cRNA, generated from the 
total RNA extracted from the bulk tumor, was hybridized to 
Affymetrix U95A oligo-nucleotide micro-arrays. These micro- 
arrays consist of 12 400 probes, which represent approximately 
9000 genes. Raw expression values were obtained using GENE- 
CHIP software from Affymetrix. and then further analyzed using 
the DNA-Chip Analyzer (dChip) custom software. 

Results 

Automated Indexing of Medline Records by Disease and 
Gene. To. study the gene-disease associations in the literature, 
we first compiled complete lists for human diseases and human 
genes. To index all Medline records that were relevant to 
human diseases, the Medical Subject Heading (MeSH) index 
of Medline records was utilized. MeSH is a controlled medical 
vocabulary from the National Library of Medicine and consists 
of a set of terms or subject headings that are arranged in both 
an alphabetic and an hierarchical structure. Medline records 
are reviewed manually and MeSH terms are added to each with 
software assistance. 9 10 Twenty-three human disease category 
headings along with all of their child terms (see the Supporting 
Information, Supplemental Table 1. or visit http://hipseq. 
med.harvard.edu/MedGene/publication/s_Table l.html) were 
selected from the 2002 MeSH Index creating a list of 4033 
human diseases. 

No index comparable to the MeSH index exists for genes, 
and thus, it was necessary to apply a string search algorithm 
for gene names or symbols found in Medline text. A complete 
list, of genes, gene names, gene symbols, and frequently used 
synonyms were collected from the LocusLink database at 
NCBI."-" which contains 53 259 independent records keyed 
by an official gene symbol or name (June 18 th . 2002). For the 
purposes of this study, no distinction was made between genes 
and their gene products. Authors often use the same name for 
both, differentiating the two only by the use of italics, If at all. 
For the Intended use of this study, this lack of distinction is 
unlikely to have a large effect and may in fact be beneficial. 

Initial attempts to search the literature using these lists 
revealed several sources of false positives and false negatives 
(Table 1). False positives primarily arose when the searched 
temi had other meanings, whereas false negatives arose from 
syntax discrepancies necessitating the development of Glters 
to reduce these errors. The syntax issues were readily handled 
by including alternate syntax forms in the search terms. The 
false positive cases, caused by . duplicative and unrelated 
meanings for the terms, were more difficult to manage. Where 
possible, case sensitive string mapping reduced inappropriate 
citations. In many cases, however, this was not sufficient and 
the terms had to be eliminated entirely, thereby reducing the 
false positive rate but unavoidably under-representing some 
genes. 

For the purposes of data tracking, a primary gene key was 
selected to represent all synonyms that correspond to each 
gene. Medline records were indexed with a primary gene key 
when any synonym for that key was found in the title or 
abstract. Case-insensitive string mapping was used for all 
searches except av noted above. No additional weight, was 
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papers) wen; reviewed manually (see the; Supporting Informa- 
tion, Supplemental Table 2, or visit http://hipseq.med. 
harvard.edu/MedGene/publication/s_TabIe 2.html). Among 
these papers, most false negatives were caused by nonstandard 
gene terms or gene terms eliminated by our specificity filters. 
Few genes were missed because they were only mentioned in 
review papers (0.4%) or they appeared only in the body of the 
manuscript but not the abstract or title (1.1%). Of note, 
MedGene identified approximately 2000 additional breast 
cancer-related genes not listed in any other database. 

To assess the false positive error rate, two complementary 
approaches were used: a detailed analysis of one disease and 
a global examination of 1000 diseases. The detailed approach 
examined the false positive error rate and its sources, whereas 
the global approach tested whether the overall results made . 
biomedical sense. 

Using the LPF, 1467 genes related to prostate cancer were 
assembled in rank order. We then retrieved approximately 300 
Medline records each for the highest ranked 100 and the lowest 
ranked 200 genes and manually reviewed the titles and 
abstracts to determine the verity of the association. Nearly 80% 
of the highest ranked 100 genes fell into one of the five 
categories that reflect meaningful gene-disease relationships 
(see the Supporting Information, Supplemental Table 3. or visit 
http://hipseq.med.harvard.edu/MedGene/publication/ 
s_Table 3.html). Among the lowest ranked 200 genes, ap- 
proximately 70% reflected true relationships. Of the 600 records 
reviewed, there were only two in which the association between 
the gene and the disease was described as negative. Both were 
genes with very low scores, in both cases, the authors did not 
argue the absence of any relationship, but rather that a 
particular feature of the gene or protein was not shown to be 
related to human prostate cancer. 13,14 

The coincidence of some gene symbols with medical ab- 
breviations, chemical abbreviations and biological abbrevia- 
tions resulted in most of the false positives (see the Supporting 
information, Supplemental Table 4, or visit http://hipse- 
q.med.harvard.edu/MedGene/publication/s_Table 4.html), em- 
phasizing the importance of the filters that were added in the 
search algorithm (Table I). Without the filters, the false positive 
rate more than doubled, and the false negative rate rose 
dramatically (data not shown). For example, among the papers 
about breast cancer, there were only 12 Medline records that 
referred to ESR1 and 10 to ESR2, whereas almost 2000 papers 
mention**! estrogen receptor without specifying FSRI or ESR£ 
this latter group was detected by the family stem term filter. 

To further validate these results, a global analysis of the gene- 
disease relationships described by MedGene was performed. 
For this experiment, it was reasoned that the more closely 
related the diseases are to one another, the more they will be 
related to the same gene sets. Thus, if the relationships defined 
by MedGene accurately reflected the literature, then an unsu- 
pervised hierarchical clustering of the gene data should group 
diseases in a manner consistent with common medical think- 
ing. Conversely, if the clustered diseases do not make sense 
biologically or medically, it may reflect excessive false positives, 
false negatives, or inappropriate scoring of the data. 

To execute tills experiment, the gene sets and the corre- 
sponding LPF values for 1 000 randomly selected diseases (each 
with at least 50 gene relationships) were used as a dataset for 
clustering the diseases. A review of the results showed that the 
resulting disease clusters were indeed logical based upon 
common medical knowledge (see the Supporting Information, 
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Supplemental Figure I, or visit http://hipseq.med.harvard.edu/ 
MedGene /publication/s_J , 'igure lJitml). For example, in one 
such cluster shown in Figure 2, diabetes and its complications 
grouped together and were also closely linked to diseases 
associated with starvation states. 

The number of genes associated with a given disease can 
be estimated by adjusting the MedGene number up by the false 
negative rate (~9%) and down by the false positive rate (—26% 
on average). Using this, the average disease has 103.7 ± 45.3 
(mean ± s,d.) genes associated with it, although the range is 
quite broad with 2359 genes related to breast cancer, 2122 
genes related to lung cancer and no genes related to a number 
of diseases. 

Applying MedGene to the Analysis of Large Datasets. Access 
to a comprehensive summary of the genes linked to human 
diseases provided an opportunity to analyze data obtained from 
a high- throughput experiment. We compared the MedGene 
breast cancer gene list to a gene expression data set generated 
from a micro-array analysis comparing breast cancer and 
normal breast tissue samples. Micro-array analysis identified 
2286 genes that had greater than a I -fold difference in mean 
expression level between breast cancer samples and normal 
breast samples. Using MedGene. we sorted the 2286 genes into 
four classes: 555 genes directly linked to breast cancer in the 
literature by gene term search (first-degree association by gene 
name); 328 genes directly linked by family term search (first- 
degree association by family term); 1021 genes linked to breast 
cancer only through other breast cancer genes (second-degree 
association); and 505 genes not previously associated with 
breast cancer. (See the Supporting Information, Supplemental 
Figure 2, or visit http://hipseq.med.harvard.edu/MedGene/ 
publication/s_Figure 2.html.) Among the 505 previously un- 
related genes, 467 were either newly identified genes or genes 
that had not previously been associated with any disease. 
Among the remaining 38 genes, 9 had been related to other 
cancers, specifically esophageal, colon, uterine, skin, and cervix. 

To determine whether the genes highlighted by the micro- 
an*ay analysis were more likely to have been previously linked 
to breast cancer in the literature, we created a two-dimensional 
plot of the fold change of expression level between breast 
cancer and normal tissue versus the literature score (LPF). 
(Figure 3A). There was a broad spread of expression changes 
among the genes directly linked to breast cancer ranging from 
less than 1-fold change (68%) to over 40-fold (0.3%). Notably, 
the majority of genes with greater than 10-fold expression 
changes were linked to breast cancer by first-degree associa- 
tion. 

Among all 754 genes directly linked to breast cancer in the 
literature, there was no correlation between LPF and micro- 
array fold change (r = 0.018, p- value = 0.62). However, when 
we stratified the analysis based on the magnitude of the fold 
change, we observed an increasing trend in correlation (Figure 
3B) suggesting that genes with a more substantial change in 
expression level were more likely to have a stronger association 
in the literature. For genes that had 10-fold change or more in 
expression level, the correlation increased to 0.41 (/rvalue = 
0.05). 

When we evaluated the micro-array data separately for ER 
positive and ER negative tumors, the trend in correlation 
between fold change and literature score was highly dependent 
on estrogen receptor status, interestingly, there was a similar 
trend in correlation for ER positive tumors, but no trend in 
correlation for ER negative tumors. 
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Figure 2. Global validation by clustering analysis. 2(A). The gene sets and the corresponding LPF values for 1000 diseases, each with 
at least 50 gene relationships, were used in an unsupervised clustering of the diseases based on the gene patterns associated with 
them. A sample of the data is shown here. 2(B). One of the resulting clusters is shown that corresponds to blood sugar states. Diabetes 
terms (above the line) and starvation states terms (under the line) clustered together. Within these groups, there is also clustering of 
diabetic small vessel complications, altered serum chemistries, nutritional disorders, etc. (Supplemental Figure 1: http://hipseq.med. 
harvard.edu/MedGene/publication/s_Figure 1 .html). 



Finally, to validate our findings, we computed similar cor- 
relations between the breast cancer expression data and 
LPF scores generated by MedGene for hypertension, a 



disease unrelated to breast cancer. As expected, we did not 
observe an increasing trend in correlation for hyperten- 
sion. 
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Figure 3. Relationship between literature score and functional data for breast cancer. 3 A. The data from an expression analysis of 
samples for breast tumors and normal breast tissue were analyzed to indicate the fold difference of expression level between breast 
tumor and normal sample (cutoff > 3 -fold change). The fold changes were plotted against the literature score for the same gene set. 
Green dots represent first-degree association by gene search, blue dots represent first-degree association by family search and red 
dots represent no-association. Some well-studied genes, such as BRCA2 (pink circle), are not reflected by a substantial difference in 
expression level. Furthermore, the majority of genes that have no association with breast cancer in the literature had less than 10-fold 
expression changes (shaded area). 3B. The Spearman rank-correlation coefficients between literature score (LPF) and the fold change 
of expression level between tumor and normal breast samples (y-axis) in relation to the amount of fold change of expression level 
(x-axis). Gene rank lists were generated for breast cancer (blue) and hypertension (pink). Correlations were also computed between 
the breast cancer gene LPF scores and fold change expression data among estrogen receptor positive tumors only (light blue) and 
estrogen receptor negative tumors only (purple). 
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» MedGene results for the top 25 genes dissociated with breast neoplasms, hypertension, rheumatoid arthritis, bipolar disorder, and atherosclerosis, respectively, 
ranked by LPF scores. The hyperlink to all the papers iio-dting the gene and the disease is available at MedGene website flittp://hipseq.mecLharv3rd.edu/ 
Med Gene/). 



Discussion 

The Human Genome Project heralded a new era in biological 
research where the emphasis on understanding specific path- 
ways has expanded to global studies of genomic organization 
and biological systems. High -throughput technologies can 
provide novel insight into comprehensive biological function 
but also Introduces new challenges. The utility of these 
technologies is limited to the ability to generate, analyze, and 
interpret, large gene lists. MedGene, a relational database 
derived by mining the information in Medline, was created to 
address this need MedGene users can query for a rank -ordered 
list of human gene-disease relationships (Table 2) for one or 
more diseases. Each entry is hyperlinked to the original papers 
supporting each association and to other relevant databases. 

MedGene is an innovative extension of previous text mining 
approaches. Perez-lratxeta et a), used the GO annotation and 
their chromosomal locations to predict genes that may con- 
tribute to inherited disorders. 8 MedGene takes a broader view 
and includes all diseases and all possible gene-disease relation- 
ships. Furthermore, MedGene utilizes co-citation to indicate a 
relationship rather than GO annotation, which is limited to the 
subset of genes that have GO annotation. Our approach Is 
complementary to that taken by Chaussabel and Sher, who 
used the frequency of co-cited terms to cluster genes into a 
hierarchy of gene-gene relationships. 6 

A unique aspect of this tool is the ability to assess the relative 
strengths of gene-disease relationships based on the frequency 
of both co-citation and single citation. This presupposes that 
most co-citations describe a positive association, often referred 
to as publication bias 15 and is supported by our observations 



that negative associations are rare (Supplemental Table 3: 
http://hipseq.med. harvard.edu/MedGene /publication/sta- 
ble 3.html). Of course, relationships established by frequency 
of co-citation do not necessarily represent a u*ue biological link; 
however, it is strong evidence to support a true relationship. 

Another important feature of MedGene is the implementa- 
tion of software filters that substantially reduced the error rate. 
We estimate that less than 10% of all associations were missed 
and at least 70% of even the weakest associations were real. 
For this study, all of the Miters that we applied were general 
ones, e.g., expanding the list of all gene names to address the 
different syntax forms used by different journals, eliminating 
gene names that correspond to common English words, etc. 
Trie majority of the remaining search term ambiguities were 
idiosyncratic and difficult to identify systematically without 
causing a significant rise in false negatives. Alternative ap- 
proaches, such as the examination of the nearest neighbor 
terms, need to be considered to further reduce trie false positive 
rate. 

It is not uncommon to see expression changes in micro- 
array experiments as small as 2 -fold reported in the literature. 
Even when these expression changes are statistically significant, 
it is not always clear if they are biologically meaningful. When 
comparing expression levels of disease to normal tissue, one 
expects an enrichment of known disease-related genes to 
appear in the altered expression group. MedGene provided a 
unique opportunity to test this notion in the context of existing 
knowledge on a novel breast cancer micro-array dataset. For 
genes displaying a 5-fold change or less in tumors compared 
to'normal, there was no evidence of a correlation between 
altered gene expression and a known role In the disease. This 
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Table 3. Genes with Large Expression Changes in ER- but 
Not in ER+ Breast Tumors 
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EB12 


4.0 


27.9 


GZKfB 


3.8 


21.9 


STK18 


4.7 


18.6 


GPR49 


1.0 


14.6 


MYO10 


1.6 


14.4 


I AO J 


-1.0 


13.5 


POIJE2 


4.2 


13.0 


HMG4 


4.4 


12.9 




—1.2 


12.3 


LRP8 


2.9 


12.2 


CCNB2 


1.0 


1 1.8 


CCNE2 


4.0 


11.6 


VCR 


—4.3 


1 1.1 


KIMS I ft 


2.9 


10.9 


ni r.7 


3.0 


10.2 


SERPINH2 


4.6 


10.2 


YAPl 


1.0 


10.0 


LPHB 


- 1.3 


-10.4 


TCEA2 


-1.1 


-10.8 


TFFJ 


1.3 


-11.4 


COU7A1 


-4.1 


-15.7 


POPS 


1.1 


-16.2 


BPAG1 


-4.6 


-22.3 


PDZKI 


-1.1 


-36.8 


VEGFC 


-2.8 


-51.5 


MUC6 


-1.4 


-64.9 


SERPJNA5 


-1.0 


-83.1 


MEISl 


-1.6 


-85.9 


CA12 


2.4 


-150.3 



Table 3. MedGene identified a set of relatively understudied, yet highly 
expressed genes in ER negative, but not ER positive breast tumors. All of 
thtise yunes havu tnther nevwr betm co-citiid with breast cancer or have a 
weak association except those marked with an *. 



reflects the many genes whose role in breast cancer may not 
involve large changes in expression in sporadic tumors (e.g., 
BRCA1 and BRCA2) and genes whose modest changes in 
expression may be unrelated to the disease. Strikingly, among 
genes with a 10-fold change or more In expression level, there 
was a strong and significant correlation between expression 
level and a published role in the disease, providing the first 
global validation of the micro-array approach to identifying 
disease-specific genes. 

The results derived from MedGene have two implications. 
First, a careful hunt for corroborating evidence of a role in 
breast cancer should precede any further study of genes with 
loss than 5-fold expression level changes. Second, any genes 
with 10- fold changes or more are Hkely to be related to breast 
cancer and warrant attention. It is likely mat this threshold will 
change depending on the disease as well as the experiment. 

Interestingly, the observed correlation was only found among 
ER-positive tumors, not ER-negative. This may reflect a bias 
in the literature to study the more prevalent type of tumor in 
the population. Furthermore, this emphasizes that caution 
must be taken when interpreting experiments that may contain 
subpopulations that behave very differently. The MedGene 
approach identified a set of relatively understudied, yet highly 
expressed genes in ER-negative tumors that are worthy of 
further examination (Table 3). 
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In conclusion, we have developed an automated method of 
summarizing and organizing the vast biomedical literature. To 
our knowledge, the resulting database is the most comprehen- 
sive and accurate of its kind. By generating a score that reflects 
the strength of the association, it provides an important tool 
for the rapid and flexible analysis of large datasets from various 
high-throughput screening experiments. Furthermore, it can 
be used for selecting subsets of genes for functional studies, 
Tot building disease-specific arrays, for looking at genes com- 
mon to multiple diseases and various other high- throughput 
applications. In the future, it will be possible to enhance the 
utility of the MedGene database by building links between 
genes and other MeSH terms as well as other biological 
processes and concepts, such as cell division and responses to 
small molecules. 
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Aneuploidy and cancer 

Subrata Sen, PhD 



Numeric aberrations in chromosomes, referred to as aneu- 
ploidy, is commonly observed in human cancer. Whether aneu- 
ploidy is a cause or consequence of cancer has long been 
debated. Three lines of evidence now make a compelling case 
for aneuploidy being a discrete chromosome mutation event 
that contributes to malignant transformation and progression 
process. First, precise assay of chromosome aneuploidy in 
several primary tumors with in situ hybridization and compara- 
tive genomic hybridization techniques have revealed that 
specific chromosome aneusomies correlate with distinct tumor 
phenotypes. Second, aneuploid tumor cell lines and in vitro 
transformed rodent celts have been reported to display an 
elevated rate of chromosome instability, thereby indicating that 
aneuploidy is a dynamic chromosome mutation event associ- 
ated with transformation of cells. Third, and most important, a 
number of mitotic genes regulating chromosome segregation 
have been found mutated in human cancer cells, implicating 
such mutations in induction of aneuploidy in tumors. Some of 
these gene mutations, possibly allowing unequal segregations 
of chromosomes, also cause tumorigenic transformation of 
cells in vitro. In this review, the recent publications investigat- 
ing aneuploidy in human cancers, rate of chromosome instabil- 
ity in aneuploidy tumor celts, and genes implicated in regulat- 
ing chromosome segregation found mutated in cancer cells 

are discussed. Curr Opin Oncol 2000, 12:82-88 © 2000 Lippincott Williams 
& Wilkins, Inc. 
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Cancer research over the past decade has firmly estab- 
lished that malignant ceils accumulate a large number of 
genetic mutations that affect differentiation, prolifera- 
tion, and cell death processes. In addition, it is also 
recognized that most cancers arc clonal, although they 
display extensive heterogeneity with respect to kary- 
otypes and phenotypes of individual clonal populations. 
It is estimated that numeric chromosomal imbalance, 
referred to as aneuploidy t is the most prevalent genetic 
change recorded among over 20,000 solid tumors 
analyzed thus far (1]. Phenotypic diversity of the clonal 
populations in individual tumors involve differences in 
morphology, proliferative properties, antigen expression, 
drug sensitivity, and metastatic potentials. It has been 
proposed that an underlying acquired genetic instability 
is responsible for the multiple mutations detected in 
cancer cells that lead to tumor heterogeneity and 
progression [2]. In a somewhat contradictory argument, 
it has also been suggested that clonal expansion due to 
selection of cells undergoing normal rates of mutation 
can explain malignant transformation and progression 
process in humans [3]. Acquired genetic instability, 
nonetheless, is considered important for more rapid 
progression of the disease [4*»]. Although the original 
hypothesis on genetic instability in cancer primarily 
focused on chromosome imbalances in the form of aneu- 
ploidy in tumor cells, the actual relevance of such muta- 
tions in cancer remains a controversial issue. 

Whether or not aneuploidy contributes to the malignant 
transformation and progression process has long been 
debated. A prevalent idea on genetics of cancer referred 
to as "somatic gene mutation hypothesis" contends that 
gene mutations at the nucleotide level alone can cause 
cancer by cither activating cellular proto-oncogencs to 
dominant cancer causing oncogenes and/or by inactivat- 
ing growth inhibitory tumor suppressor genes. In this 
scheme of things chromosomal instability in the form of 
aneuploidy is a mere consequence rather than a cause of 
malignant transformation and progression process. 

In this review, some of the recent observations on the 
subject are discussed and compelling evidence is 
provided to suggest that aneuploidy is a distinct form of 
genetic instability in cancer that frequently correlates 
with specific phenotypes and stages of the disease. 
Furthermore, discrete genetic targets affecting chromo- 
somal stability in cancer cells, recently identified, are 
also discussed. These data provide a new direction 
toward elucidating the molecular mechanisms responsi- 



ble for induction of aneuploidy in cancer and may even- 
tually be exploited as novel therapeutic targets in the 
future. 

Genetic alterations in cancer 

Alterations in many genetic loci regulating growth, 
senescence, and apoptosis, identified in tumor cells, 
have led to the current understanding of cancer as a 
genetic disease. The genetic changes identified in 
tumors include: subtle mutations in genes at the 
nucleotide level; chromosomal translocations-leading to 
structural rearrangements in genes; and numeric 
changes in either partial segments of chromosomes or 
whole chromosomes (aneuploidy) causing imbalance in 
gene dosage. 

For the purpose of this review, both segmental and whole 
chromosome imbalances leading to altered DNA dosage 
in cancer cells are included as examples of aneuploidy. 

Incidence of aneuploidy in cancer 

Evidence of aneuploidy involving one or more chromo- 
somes have been commonly reported in human tumors. 
Although these observations were initially made using 
classic cytogenetic techniques late in a tumor's evolu- 
tion and were difficult to correlate with cancer progres- 
sion, more recent studies have reported association of. 
specific nonrandom chromosome aneuploidy with 
different biologic properties such as loss of hormone 
dependence and metastatic potential [5]. 

Classic cytogenetic studies performed on tumor cells 
had serious limitations in scope because they were 
applicable only to those cases in which mitotic chromo- 
somes could be obtained. Because of low spontaneous 
rates of cell division in primary tumors, analyses 
depended on cells either derived selectively from 
advanced metastases or those grown in vitro for variable 
periods of time. In both instances, metaphases analyzed 
represented only a subset of primary tumor cell popula- 
tion. Two major advances in cytogenetic analytic tech- 
niques, in situ hybridization (ISH) and comparative 
genomic hybridization (CGH), have allowed better reso- 
lution of chromosomal aberrations in freshly isolated 
tumor cells [6], ISH analyses with chromosome-specific 
DNA probes, a powerful adjunct to metaphasic analysis, 
allows assessment of chromosomal anomalies within 
tumor cell populations in the contexts of whole nuclear 
architecture and tissue organization. CGH allows 
genome wide screening of chromosomal anomalies 
without the use of specific probes even in the absence 
of prior knowledge of chromosomes involved. Although 
both techniques have certain limitations in terms of 
their resolution power, they nonetheless provide a 
better approximation of chromosomal changes occurring 
among tumors of various histology, grade, and stage 
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compared with what was possible with the classic cyto- 
genetic techniques. Genomic ploidy measurements 
have also been performed at the DNA level with flow 
cytometry and cytofluorometric methods. Although 
these assays underestimate chromosome ploidy due to a 
chromosomal gain occasionally masking a chromosomal 
lo&s in the same cell, several studies using these 
methods have supported the conclusion that DNA 
aneuploidy closely associates with poor prognosis in 
various cancers [7,8]. This discussion of some recent 
examples published on aneuploidy in cancer includes 
discussion of studies dealing with DNA ploidy measure- 
ments as well. Most of these observations are correlative 
without direct proof of specific involvement of genes on 
the respective chromosomes. Identification of putative 
oncogenes and tumor suppressor genes on gained and 
lost chromosomes in aneuploid tumors, however, are 
providing strong evidence that chromosomes involved in 
aneuploidy play a critical role in the tumorigenic 
process. . ; ■ ' 

In renal tumors, either segmental or whole chromosome 
aneuploidy appears to be uniquely associated with 
specific histologic subtypes [9]. Tumors from patients 
with hereditary papillary renal carcinomas (HPRC) 
commonly show trisomy of chromosome 7, when 
analyzed by CGH. Germline mutations of a putative 
oncogene MET have been detected in patients with 
HPRC. A recent study [10] has demonstrated that an 
extra copy of chromosome 7 results in nonrandom dupli- 
cation of the mutant MET allele in HPRC, thereby 
implicating this trisomy in tumorigenesis. The study 
suggested that mutation of MET may render the cells 
more susceptible to errors in chromosome replication, 
and that clonal expansion of cells harboring duplicated 
chromosome 7 reflects their proliferative advantage. In 
addition to chromosome 7, trisomy of chromosome 17 in 
papillary tumors and also of chromosome 8 in mesoblas- 
tic nephroma are commonly seen. Association of specific 
chromosome imbalances with benign and malignant 
forms of papillary renal tumors, therefore, not only 
contribute to an understanding of tumor origins and 
evolution, but also implicate aneuploidy of the respec- 
tive chromosomes in the tumorigenic transformation 
process. 

In colorectal tumors, chromosome aneuploidy is a 
common occurrence. In fact, molecular allelotyping 
studies have suggested that limited karyotyping data 
available from these tumors actually underestimate the 
true extent of these changes. Losses of heterozygosity 
reflecting loss of the maternal or paternal allele in 
tumors are widespread and often accompanied by a gain 
of the opposite allele. Therefore, for example, a tumor 
could lose a maternal chromosome while duplicating 
the same paternal chromosome, leaving the tumor cell 



84 Cancer biology 



with a normal karyotype and ploidy but an aberrant 
allelotype. It has been estimated that cancer of the 
colon, breast, pancreas, or prostate may lose an average 
of 25% of its alleles. It is not unusual to discover that a 
tumor has lost over half of its alleles [4]. In clinical 
settings, DNA ploidy measurements have revealed that 
DNA aneuploidy indicates high risk of developing 
severe premalignant changes in patients with ulcerative 
colitis, who are known to have an increased risk of 
developing colorectal cancer [11], DNA aneuploidy has 
been found to be one of the useful indicators of lymph 
node metastasis in patients with gastric carcinoma and 
associated with poor outcome compared with diploid 
cases [12,13]. CGH analyses of chromosome aneu- 
ploidy; on the other hand, was reported to correlate gain 
of chromosome 20q with high tumor S phase fractions 
and loss of 4q with low tumor apoptotic indices [14]. 
Aneuploidy of chromosome 4 in metastatic colorectal 
cancer has recently been confirmed in studies that used 
unbiased DNA fingerprinting with arbitrarily primed 
polymerase chain reactions to detect moderate gains 
and losses of specific chromosomal DNA sequences 
[15], The molecular karyotype (amplotype) generated 
from colorectal cancer revealed that moderate gains of 
sequences from chromosomes 8 and 13 occurred in 
most tumors, suggesting that overrepresentation of 
these chromosomal regions is a critical step for metasta- 
tic colorectal cancer. 

In addition to being implicated in tumorigenesis and 
correlated with distinct tumor phenotypes, chromosome 
aneuploidy has been used as a marker of risk assessment 
and prognosis in several other cancers. The potential 
value of aneuploidy as a noninvasive tool to identify 
individuals at high risk of developing head and neck 
cancer appears especially promising. Interphase fluores- 
cence in situ hybridization (FISH) revealed extensive 
aneuploidy in tumors from patients with head and neck 
squamous cell carcinomas (HNSCC) and also in clini- 
cally normal distant oral regions from the same individu- 
als [16,17]. It has been proposed that a panel of chromo- 
some probes in FISH analyses may serve as an 
important tool to detect subclinical tumorigenesis and 
for diagnosis of residual disease. The presence of aneu- 
ploid or tetraploid populations is seen in 90% to 95% of 
esophageal adenocarcinomas, and when seen in 
conjunction with Barrett's esophagus, a premalignant 
condition, predicts progression of disease [18,19], 
Chromosome ploidy analyses in conjunction with loss of 
heterozygosity and gene mutation studies in Barrett's 
esophagus reflect evolution of neoplastic cell lineages in 
vivo [20]. Evolution of neoplastic progeny from Barrett's 
esophagus following somatic genetic mutations 
frequently involves bifurcations and loss of heterozygos- 
ity at several chromosomal loci leading to aneuploidy 
and cancer. Accordingly, it is hypothesized that during 



tumor cell evolution diploid cell progenitors with 
somatic genetic abnormalities undergo expansion with 
acquired genetic instability. Such instability, often 
manifested in the form of increased incidence of aneu- 
ploidy, enters a phase of clonal evolution beginning in 
premalignant cells that proceeds over a period of time 
and occasionally leads to malignant transformation. The 
clonal evolution continues even after the emergence of 
cancer. 

The significance of DNA and chromosome aneu- 
ploidy in other human cancers continue to be evalu- 
ated. Among papillary thyroid carcinomas, aneuploid 
DNA content in tumor cells was reported to correlate 
with distant metastases, reflecting worsened progno- 
sis [21]. Genome wide screening of follicular thyroid 
tumors by CGH, on the other hand, revealed frequent 
loss of chromosome 22 in widely invasive follicular 
carcinomas [22]. Chromosome copy number gains in 
invasive neoplasm compared with foci of ductal carci- 
noma in situ (DCIS) with similar histology have been 
proposed to indicate involvement of aneuploidy in 
progression of human breast cancer [23]. ISH analyses 
of cervical intraepithelial neoplasia has provided 
suggestive evidence that chromosomes 1, 7 and X 
aneusomy is associated with progression toward cervi- 
cal carcinoma [24], 

Although the prognostic value of numeric aberrations 
remains a matter of debate in human hematopoietic 
neoplasia, there have been recent studies to suggest that 
the presence of monosomy 7 defines a distinct subgroup 
of acute myeloid leukemia patients [25]. It is interesting 
in this context that therapy-related myelodysplastic 
syndromes have been reported to display monosomy 5 
and 7 karyotypes, reflecting poor prognosis [26]. 

The clinical observations, mentioned previously, are 
supported by in vitro studies in human and rodent cells in 
which aneuploidy is induced at early stages of transforma- 
tion [27,28]. It is even suggested that aneuploidy may 
cause cell immortalization, in some instances, that is a 
critical step preceeding transformation. 

Finally, in an interesting study to develop transgenic 
mouse models of human chromosomal diseases, chromo- 
some segment specific duplication and deletions of the 
genome were reported to be constructed in mouse 
embryonic stem cells [29]. Three duplications for a 
portion of mouse chromosome 11 syntenic with human 
chromosome 17 were established in the mouse 
germline. Mice with 1Mb duplication developed corneal 
hyperplasia and thymic tumors. The findings represent 
the first transgenic mouse model of aneuploidy of a 
defined chromosome segment that documents the direct 
role of chromosome aneusomy in tumorigenesis. 
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Aneuploidy as "dynamic cancer-causing 
mutation" instead of a "consequential state" 
in cancer 

According to the hypothesis previously discussed, aneu- 
ploidy represents either a "gain of function" or "loss of 
function" mutation at the chromosome level with a 
causative influence on the tumorigenesis process. The 
hypothesis, however, is based only on circumstantial 
evidence even though existence of aneuploidy is corre- 
lated with different tumor phenotypes. The existence of 
numeric chromosomal alterations in a tumor does not 
mean that the change arose as a dynamic mutation due 
to genomic instability, because several factors could lead 
to consequential aneuploidy in tumors, also. Although 
aneuploidy as a dynamic mutation due to genomic insta- 
bility in tumor cells would occur at a certain measurable 
rate per cell generation, a consequential state of aneu- 
ploidy in tumors may not occur at a predictable rate 
under similar conditions or in tumors with similar 
phenotypes. In addition to genomic instability, differ- 
ences in environmental factors with selective pressure, 
could explain high incidence of aneuploidy and other 
somatic mutations in tumors compared with normal cells 
[4]. These include humoral, cell substratum, and cell- 
cell interaction differences between tumor and normal 
cell environments. It could be argued that despite 
similar rates of spontaneous aneuploidy induction in 
normal and tumor cells, the latter are selected to prolif- 
erate due to altered selective pressure in the tumor cell 
environment, whereas the normal cells are eliminated 
through activation of apoptosis. Alternatively, of course, 
one could postulate that selective expression or overex- 
pression of anti-apoptotic proteins or inactivation of 
proapoptotic proteins in tumor cells may counteract 
default induction of apoptosis in G2/M phase cells 
undergoing missegregation of chromosomes. Recent 
demonstration of overexpression of a G2/M phase anti- 
apoptotic protein survivin in cancer cells [30] suggests 
that this protein may favor aberrant progression of aneu- 
ploid transformed cells through mitosis. This would 
then lead to proliferation of aneuploid cell lineages, 
which may undergo clonal evolution. 

To ascertain that aneuploidy is a dynamic mutational 
event, various human tumor cell lines and transformed 
rodent cell lines have been analyzed for the rate of 
aneuploidy induction. When grown under controlled in 
vitro conditions, such conditions ensure that environ- 
mental factors do not influence selective proliferation of 
cells with chromosome instability. In one study, 
Lengauer et aL [31 •] provided unequivocal evidence by 
FISH analyses that losses or gains of multiple chromo- 
somes occurred in excess of 10" 2 per chromosome per 
generation in aneuploid colorectal cancer cell lines. The 
study further concluded that such chromosomal instabil- 
ity appeared to be a dominant trait. Using another in 



vitro model system of Chinese hamster embryo (CHE) 
cells, Duesberg et ai [32 # ] have also obtained similar 
results. With clonal cultures of CHE cells, transformed 
with nongenotoxic chemicals and a mitotic inhibitor, 
these authors demonstrated that the overwhelming 
majority of the transformed colonies contained more 
than 50% aneuploid cells, indicating that aneuploidy 
would have originated from the same cells that under- 
went transformation. All the transformed colonies tested 
were tumorigenic. It was further documented that the 
ploidy factor representing the quotient of the modal 
chromosome number divided by the normal diploid 
number, in each clone, correlated directly with the 
degree of chromosomal instability. Therefore, chromo- 
somal instability was found proportional to the degree of 
aneuploidy in the transformed cells and the authors 
hypothesized that aneuploidy is a unique mechanism of 
simultaneously altering and destabilizing, in a massive 
manner, the normal cellular phenotypes. In the absence 
of any evidence that the transforming chemicals used in 
the study did not induce other somatic mutations, it is 
difficult to rule out the contribution of such mutations 
in the transformation process. These results nonetheless 
make a strong case for aneuploidy being a dynamic chro- 
mosome mutation event intimately associated with 
cancer. 

Aneuploidy versus somatic gene mutation in 
cancer 

The idea that numeric chromosome imbalance or aneu- 
ploidy is a direct cause of cancer was proposed at the 
turn of the century by Theodore Boveri [33]. However, 
the hypothesis was largely ignored over the last several 
decades in favor of the somatic gene mutation hypothe- 
sis, mentioned earlier. Evidence accumulating in the 
literature lately on specific chromosome aneusomies 
recognized in primary tumors, incidence of aneuploidy 
in cells undergoing transformation, and aneuploid tumor 
cells showing a high rate of chromosome instability have 
led to the rejuvenation of Boveri's hypothesis. The 
concept has recently been discussed as a "vintage wine 
in a new bottle" [34*]. The author points out that 
except for rare cancers caused by dominant retroviral 
oncogenes, diploidy does not seem to occur in solid 
tumors, whereas aneuploidy is a rule rather than excep- 
tion in cancer. 

Aneuploidy as an effective mutagenic mechanism 
driving tumor progression, on the other hand, is being 
recognized as a viable solution to the paradox that with 
known mutation rate in non-germline cells (~10~ 7 per 
gene per cell generation) tumor cell lineages cannot 
accumulate enough mutant genes during a human life- 
time [35]. The concept is gaining significant credibility 
since genes that potentially affect chromosome segrega- 
tion were found mutated in human cancer. Some of 
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these genes have also been shown to have transforming 
capability in in vitro assays. Selected recent publications 
describing the findings are being discussed below in 
reference to the mitotic targets potentially involved in 
inducing chromosome segregation anomalies in cells. 

Potential mitotic targets and molecular 
mechanisms of aneuploidy 

Because aneuploidy represents numeric imbalance in 
chromosomes, it is reasonable to expect that aneuploidy 
arises due to missegregation of chromosomes during cell 
division. There are many potential mitotic targets, 
which could cause unequal segregation of chromosomes 
(Fig. 1). Recent investigations have identified several 
genes involved in regulating these mitotic targets and 
mitotic checkpoint functions, which can be implicated 
in induction of aneuploidy in tumor cells. This discus- 
sion is restricted to those mitotic targets and checkpoint 
genes whose abnormal functioning has been observed in 
cancer or has been shown to cause tumorigenic transfor- 
mation of cells, in recent years. The role of telomeres is 
discussed elsewhere in this issue. For a more detailed 
description of the components of mitotic machinery and 
their possible involvement in causing chromosome 
segregation abnormalities in tumor cells, readers may 
refer to a recently published review [36«]. 

Among, the mitotic targets implicated in cancer, centro- 
some defects have been observed in a wide variety of 
malignant human tumors. Centrosomes play a central role 
in organizing the microtubule network in interphase cells 
and mitotic spindle during cell division. Multipolar 
mitotic spindles have been observed in human cancers in 
situ and abnormalities in the form of supernumerary 



Figure 1. Potential mitotic targets causing aneuploidy in 
oncogenesis 
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Diagram illustrates that defects in several processes involving chromosomal, 
spindle microtubule, and centrosomal targets, in addition to abnormal cytokine- 
sis, may cause unequal partitioning of chromosomes during mitosis, leading to 
aneuploidy. Recently obtained evidence in favor of some of these possibilities is 
discussed in the text. 



centrosomes, centrosomes of aberrant size and shape as 
well as aberrant phosphorylation of ccntrosome proteins 
have been reported in prostate, colon, brain, and breast 
tumors [37,38]. In view of the findings that abnormal 
centrosomes retain the ability to nucleate microtubules in 
vitro, it is conceivable that cells with abnormal centro- 
somes may missegregate chromosomes producing aneu- 
ploid cells. The molecular and genetic bases of abnormal 
centrosome generation and the precise pathway through 
which they regulate the chromosome segregation process 
remain to be elucidated. Recent discovery of a centro- 
some-associated kinase STK15/BTAK/aurora2, naturally 
amplified and overexpressed in human cancers, has raised 
the interesting possibility that aberrant expression of this 
kinase is critically involved in abnormal centrosome func- 
tion and unequal chromosome segregation in tumor cells 
[39,40]. Exogenous expression of the kinase in rodent and 
human cells was found to correlate with an abnormal 
number of centrosomes, unequal partitioning of chromo- 
somes during division, and tumorigenic transformation of 
cells. It is relevant in this context to mention that the 
Xenopus homologue of human STK15/BTAK/aurora2 
kinase has recently been shown to phosphorylate a micro- 
tubule motor protein XIEgS, the human orthologue of 
which is known to participate in the centrosome separa- 
tion during mitosis [41]. Findings on STK15/aurora2 
kinase, thus, provide an interesting lead to a possible 
molecular mechanism of centrosome's role in oncogene- 
sis. Centrosomes have, of late, been implicated in onco- 
genesis from studies revealing supernumerary centro- 
somes in />5J-deficient fibroblasts and overexpression of 
another centrosome kinase PLK1 being detected in 
human non-small cell lung cancer [42]. 

One of the critical events that ensures equal partition- 
ing of the chromosomes during mitosis is the proper 
and timely separation of sister chromatids that are 
attached to each other and to the mitotic spindle. 
Untimely separation of sister chromatids has been 
suspected as a cause of aneuploidy in human tumors. 
Cohesion between sister chromatids is established 
during replication of chromosomes and is retained until 
the next metaphase/anaphase transition. It has been 
shown that during metaphase-anaphase transition, the 
anaphase promoting complex/cyclosome triggers the 
degradation of a group of proteins called securins that 
inhibit sister chromatid separation. A vertebrate securin 
(v-securin) has recently been identified that inhibits 
sister chromatid separation and is involved in transfor- 
mation and tumorigenesis. Subsequent analysis 
revealed that the human securin is identical to the 
product of the gene called pituitary tumor transforming 
gene, which is overexpressed in some tumors and 
exhibits transforming activity in NIH3T3 cells. It is 
proposed that elevated expression of the v-securin may 
contribute to generation of malignant tumors due to 
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chromosome gain or loss produced by errors in chro- 
matid separation [43»]. 

Normal progression through mitosis during prophase to 
anaphase transition is monitored at least at two check- 
points. One checkpoint operates during early prophase 
at G2 to metaphase progression while the second 
ensures proper segregation of chromosomes during 
metaphase to anaphase transition. Several mitotic 
checkpoint genes responding to mitotic spindle defects 
have been identified in yeast. The metaphase-anaphase 
transition is delayed following activation of this check- 
point during which kinetochores remain unattached to 
the spindle. The signal is transmitted through a kineto- 
chore protein complex consisting of Mpslp and several 
Mad and Bub proteins [44]. It is expected that for 
unequal chromosome segregation to be perpetuated 
through cell proliferation cycles giving rise to aneu- 
ploidy, checkpoint controls have to be abrogated. 

Following this logic, Vogelstein era/. [45»] hypothesized 
that aneuploid tumors would reveal mutation in mitotic 
spindle checkpoint genes. Subsequent studies by these 
investigators have proven the validity of this hypothesis 
and a small fraction of human colorectal cancers have 
revealed the presence of mutations in either hBubl or 
hBubRl checkpoint genes. It was further revealed that 
mutant BUB1 could function in a dominant negative 
manner conferring an abnormal spindle checkpoint 
when expressed exogenously. Inactivation of spindle 
checkpoint function in virally induced leukemia has also 
recently been documented following the finding that 
hMADl checkpoint protein is targeted by the Tax 
protein of the human T-cell leukemia virus type 1. 
Abrogation of hMADl function leads to multinucleation 
and aneuploidy [46], 

In addition to mitotic spindle checkpoint defects, failed 
DNA damage checkpoint function in yeast is frequently 
associated with aberrant chromosome segregation as 
well. It, therefore, appears intriguing yet relevant that 
the h umah BRCA1 gene, proposed to be involved in 
DNA damage checkpoint function, when mutated by a 
targeted deletion of exon 11 led to defective G2/M cell 
cycle checkpoint function and genetic instability in 
mouse embryonic fibroblasts [47]. The cells revealed 
multiple functional centrosomes and unequal chromo- 
some segregation and aneuploidy. Although the molecu- 
lar basis for these abnormalities is not known at this 
time, it raises the interesting possibility that such an 
aneuploidy-driven mechanism may be involved in 
tumorigenesis in individuals carrying germline muta- 
tions of BRCA1 gene. 



Conclusion 

Growing evidence from human tumor cytogenetic inves- 
tigations strongly suggest that aneuploidy is associated 
with the development of tumor phenotypes. Clinical 
findings of correlation between aneuploidy and tumori- 
genesis are supported by studies with in vitro grown 
transformed cell lines. Molecular genetic analyses of 
tumor cells provide credible evidence that mutations in 
genes controlling chromosome segregation during 
mitosis play a critical role in causing chromosome insta- 
bility leading to aneuploidy in cancer. Further elucida- 
tion of molecular and physiologic bases of chromosome 
instability and aneuploidy induction could lead to the 
development of new therapeutic approaches for 
common forms of cancer. 
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Abstract: Epithelial tumors develop through a multistep process driven by 
genomic instability frequently associated with etiologic agents such as pro- 
longed tobacco smoke exposure or human papilloma virus (HPV) infection. 
The purpose of the studies reported here was to examine the nature of genomic 
instability in epithelial tissues at cancer risk in order to identify tissue genetic 
biomarkers that might be used to assess an Individual's cancer risk and 
response to ch era opreven live intervention. As part of several chemoprevention 
trials, biopsies were obtained from risk tissues (i.e., bronchial biopsies from 
chronic smokers, oral or laryngeal biopsies from individuals with premalig- 
nancy) and examined for chromosome instability using in situ hybridization. 
Nearly all biopsy specimens show evidence for chromosome instability 
throughout the exposed tissue. Increased chromosome instability was observed 
with histologic progression in the normal to tumor transition of head and neck 
squamous cell carcinomas. Chromosome instability was also seen in premalig- 
nant head and neck lesions, and high levels were associated with subsequent 
tumor development. In bronchial biopsies of current smokers, the level of 
ongoing chromosome instability correlated with smoking intensity (e.g., 
packs/day), whereas the chromosome index (average number of chromosome 
copies per cell) correlated with cumulative tobacco exposure (i.e., pack-years). 
Spatial chromosome analyses of the epithelium demonstrated multifocal clonal 
outgrowths. In former smokers, random chromosome instability was reduced; 
however, clonal populations appeared to persist for many years, perhaps 
accounting for continued lung cancer risk following smoking cessation. 

Keywords: chromosome instability; epithelial cells; aerodigestive tract; 
chemoprevention; cancer risk 



THE NEED FOR BIOMARKERS OF CANCER RISK AND 
RESPONSE TO INTERVENTION 

Epithelial cancers remain a major health challenge in the world. Despite improve- 
ments in staging and the application and integration of surgery, radiotherapy, and 
chemotherapy, the 5-year survival rate for individuals with lung cancer is only about 
\5%. 1 Even if strategies for early detection are successful and lung cancers 
are detected at a stage where local tumor resection and treatment is curative, 
these patients will still be at significant risk for developing second primary tumors 
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associated with the problem of field cancerization. 2 Similarly, for individuals with a 
first head and neck primary tumor, even if the first malignancy is successfully treat- 
ed, the risk of developing a second primary in the tobacco smoke-exposed field is 
approximately 40%. 3 Similar cancer risk estimates exist for individuals who exhibit 
severe dysplasia in premalignant epithelial lesions. 4 For these reasons, it is important 
to focus on chemopreventive strategies to prevent the development of epithelial 
malignancies. 

Several problems confront chemoprevention trials designed to identify effica- 
cious agents. 5 First, chemoprevention trials with cancer incidence as a primary end- 
point require tens of thousands of subjects and tens of years of intervention and 
follow-up for statistical evaluation. For example, a recendy reported trial involved 
30,000 subjects and required 10 years in order to examine the impact of prevention 
strategies on lung cancer development, only to find a possible increased lung cancer 
incidence in current smokers who received P-carotene. 6 

The problem of large, long-term trials results from the difficulty in identifying 
individuals at highest cancer risk who might best benefit from chemopreventive 
intervention. For example, 20 pack-year smokers, while known to be at relatively 
increased risk for developing lung cancer, have approximately a 10% lifetime risk 
for developing lung cancer. This seriously limits the number of potentially useful 
strategies that can be clinically explored. A second problem facing chemoprevention 
trials is that little is known about what agents are likely to have efficacy, and even 
less is known regarding proper doses, schedules, and durations of treatment. Part of 
the reason for this problem is that too little is known about the physiologic processes 
that drive epithelial cancer development. 

In order to reduce the number of subjects and the time required to carry out 
chemoprevention trials and thus allow the exploration of multiple prevention strate- 
gies, two types of advances are necessary. First, it is important to identify individuals 
at significantly increased cancer risk who might best benefit from different types of 
intervention. Second, in order to allow the rapid identification of agents, doses, and 
schedules of potentially efficacious agents, it is necessary to identify and validate 
surrogate endpoints of response that indicate whether the agents are having a posi- 
tive impact on the target tissue during the chemopreventive intervention. 

One approach to identifying individuals at increased aerodigestive tract cancer 
risk is to explore epidemiologic features of potential subjects. Molecular epidemio- 
logic studies are beginning to identify intrinsic host factors that place some individ- 
uals at increased cancer risk, especially those with a chronic smoking history. 8 Most 
intrinsic factors identified thus far reflect levels of carcinogen metabolism, repair 
capabilities of the host following DNA damage, and other measures of intrinsic 
cellular sensitivity to mutagens. While these factors can provide statistically signif- 
icant risk ratios in case-control studies that are controlled for tobacco exposure, the 
detected risk ratios usually fall in the range of 1.5 to 10. Unfortunately, this is not 
sufficient for the individualization of treatment and is not sufficiently high to signif- 
icantly reduce the numbers of subjects required for chemoprevention trials with 
cancer incidence as the primary endpoint. 

Another approach to identifying individuals at increased cancer risk is to directly 
examine the target tissue of individuals with known carcinogen exposure (e.g., 
chronic tobacco smoke exposure), who have evidence of target organ dysfunction 
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(e.g., chronic obstructive pulmonary disease, changes in voice quality), or who 
have clinical evidence of premalignancy (e.g., bronchial metaplasia/dysplasia, oral 
leukoplakia/erythroplakia, cervical intraepithelial neoplasia). The conventional 
standard for assessing cancer risk in these situations is the degree of histological 
change. However, while individuals who show moderate to severe dysplasia are 
known to be at increased cancer risk when compared to individuals with lesser his- 
tologic changes, it is often difficult to distinguish reactive changes to carcinogenic 
insult from initiated and progressing lesions. Similarly, upon cessation of carcino- 
genic insult, histologic changes may reverse yet cancer risk may continue for many 
years. For example, while smoking cessation is associated with decreased bronchial 
metaplasia, 9 increased lung cancer risk continues for many years beyond smoking 
cessation. 10 In fact, nearly half the newly diagnosed lung cancer cases in the USA 
occur in former smokers. 1 1 

The development of assays to identify individuals at high epithelial cancer risk 
and to directly assess response to intervention in the target tissue is therefore an 
important research goal. Such assays should be objective and easily quantifiable and, 
if possible, minimally invasive. Moreover, they should reflect both the disease pro- 
cess and the targeted pathway and thereby be useful in assessing risk and monitoring 
response to intervention as well as directly testing the hypothesized mechanism of 
action of the chemopreventive strategy. 

In the chemoprevention setting it is important to recognize that one does not 
know the location of the future cancer. Thus, assays must necessarily be carried out 
on random biopsies of the field at risk. Even if there. are clinically evident premalig- 
nant lesions, this does not mean that this is the likely site for a future malignancy. 
For example, nearly half of the cancers that develop in individuals with oral leuko- 
plakia arise away from the original index lesion. Similarly, since many newly diag- 
nosed lung cancers arise in the peripheral parts of the lung (e.g., adenocarcinomas), 
especially in former smokers, and since endobronchoscopy predominantly accesses 
central components of the lung, it is important to identify biomarkers that can reflect 
global processes ongoing in the target epithelial field associated with increased can- 
cer risk. Their discovery requires a better understanding of the tumorigenesis pro- 
cess in epithelial fields at cancer risk. 



THE RATIONALE FOR STUDYING 
GENOMIC INSTABILITY AS A MARKER OF RISK 

Tumors of the aerodigestive tract have been proposed to reflect a "field canceriza- 
tion*' process whereby the whole tissue is exposed to carcinogenic insult (e.g., tob- 
acco smoke) and is at increased risk for multistep tumor development. 12,13 Several 
types of clinical and laboratory data support this notion, including the frequent 
occurrence of synchronous primary and subsequent second primary tumors in the 
aerodigestive tract (frequently exhibiting dissimilar histologies as well as distinct 
genetic signatures 14 " 16 ) and the presence of premalignant lesions that precede and/or 
accompany the tumor in the exposed tissue field. 17 The notion of a multistep tumor- 
igenesis process* is further supported by serial clinical and histologic evaluations of 
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target tissue or exfoliated cells where increasing degrees of histological abnormali- 
ties are observed over time. 18 

A working model for aerodigestive tract tumorigenesis is illustrated in Figure 1. 
Tumori genesis in the face of carcinogenic exposure likely involves a chronic process 
of tissue injury and wound healing. DNA damage induced by the carcinogen is likely 
fixed into permanent genetic changes (e.g., chromosome damage, chromosome non- 
disjunction, gene mutation, gene deletion, etc.) during the process of proliferation. 
This damage would be expected to be distributed throughout the exposed tissue field 
leading to a background of generalized genomic damage (depicted in Figure 1 as a 
background mat of increasing density). Chronic injury and repair likely leads to the 
accumulation of cells with increasing amounts of genetic changes as well as the out- 
growth of abnormal clones (triangles in Figure 1) carrying an accumulation of 
genetic changes important for selective survival, dysregulated growth, and preferen- 
tial epithelial take-over by initiated clones (see Figure 2). 

Cellular and molecular evidence for the field carcinogenesis and multistep tum- 
origenesis model comes from many laboratories. 19,20 With the advent of a wide array 
of molecular technologies, a large number of specific molecular genetic and epige- 
netic changes involving specific oncogenes, tumor suppressor genes, cell regulatory 
genes, and repair genes have now been described for aerodigestive tract cancers. The 
identification of these specific molecular changes have now provided probes to 
explore specific events occurring in premalignant lesions adjacent to aerodigestive 
tract tumors. 21 " 24 Frequendy, these premalignant lesions snowed a subset of the 
same molecular changes found in the associated tumor, suggesting that these lesions 
might represent precursor lesions for the associated tumors (i.e., a manifestation of 
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FIGURE 1. Field cancerization and multistep tumorigenesis. 
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FIGURE 2. Multiple focal clonal evolution during multistcp tumorigenesis. 



a multistep tumorigenesis process). For example, studies of the premalignant lesions 
adjacent to head and neck tumors have provided evidence for a gradual accumulation 
of genetic alterations accompanied by evidence for dysregulation of cellular control 
mechanisms (e.g., alterations in expression of PCNA, EGFR, TGF-p, p53, and 
cycIinDl). 25 - 28 

These types of studies have now also been applied to the target epithelium of indi- 
viduals at increased risk for aerodigestive tract cancer (i.e., individuals with a chron- 
ic smoking/alcohol history and/or prior aerodigestive tract cancer). Several groups 
(using polymerase chain reaction, PCR, analysis of microdissected epithelium) have 
now demonstrated the presence of clonal outgrowths in the target premalignant epi- 
theli um of individuals at increased risk for cancer. 2 For example, examination of 
bronchial biopsies derived from individuals with a 20 pack-year smoking history 
demonstrated that 76% of the cases showed evidence for LOH (3pl4, 9p21, or 
17pl3) in at least one of six lung biopsy sites. On a per site basis, some form of LOH 
was observed in 25% of the sites examined. 29 

If aerodigestive tract cancer development reflects a field cancerization process 
involving multistep events, then risk and response information should be able to be 
derived from random biopsies or exfoliated cells from the field at risk or from assess- 
ments of tissue undergoing similar processes. Hypothetically, lesions exhibiting the 
greatest degree of genomic instability, clonal outgrowth, and abnormal epithelial 
regulation would be at the highest relative aerodigestive tract cancer risk. Similarly, 
an active chemopreventive intervention might be expected to decrease these mani- 
festations of risk. Reduced risk manifestations include decreased levels of ongoing 
genetic instability, decreased frequency of clonal outgrowths, and increased epithe- 
lial growth regulation. 
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THE MEASUREMENT OF CHROMOSOME INSTABILITY USING 
CHROMOSOME IN SITU HYBRIDIZATION 

Molecular genetic techniques, while extremely useful for detecting clonal chang- 
es in targets tissues, are somewhat limited in their ability to detect random genetic 
instability. Conventional cytogenetic assays are useful for detecting chromosome 
instability and clonal chromosome changes. However, they require numbers of 
dividing cells for karyotypic analysis that are difficult to attain in the setting of biop- 
sies acquired during the course of a chemoprevention trial. A technique was there- 
fore needed that would allow chromosome instability measurements in situations 
where few cells are available (e.g. small biopsies, brushings, or sputum samples) and 
where the target material might be fixed. It was also desirable to have a technique 
that would be adaptable to tissue sections, whereby spatial information could be 
retained and genotype/phenotype associations could be determined on the same or 
adjacent tissue sections. The technique of in situ hybridization (ISH) involves the 
use of DNA probes that recognize either chromosome-specific repetitive target 
sequences, chromosome single gene copy sequences, or sequences along the whole 
chromosome length or chromosome segments. 32 We have adapted the ISH technique 
for formalin-fixed, paraffin-embedded tissue sections and have applied it to a variety 
of tissues, including the aerodigestive tract. 33,34 

Using probes that label the centromere regions of specific chromosomes, this 
assay permits determination of the average chromosome number per cell for each 
specimen. This assay is also useful for detecting generalized chromosome instability 
during the tumori genesis process. Normal diploid populations should have two cop- 
ies of each autosomal chromosome and should rarely show three or more chromo- 
some copies per cell (chromosome polysomy), especially in tissue sections where 
nuclear truncation results in an under-representation of chromosome copy number. 
Thus, the detection of cells with three or more chromosome copies would indicate 
the presence of chromosome instability. 

To examine this technique's potential for characterizing the multistep tumorigen- 
esis process in the aerodigestive tract, we measured the fraction of cells exhibiting 
three or more chromosome copies in apparently contiguous epithelial transitions 
from normal to hyperplastic to dysplastic to carcinomas, all on a single tissue slice 
of head and neck squamous cell carcinomas. 34 In these specimens, greater than 35% 
of the cases of adjacent "normal" epithelium, greater than 65% of the cases of hyper- 
plastic epithelium, and greater than 95% of the dysplastic and tumor regions showed 
evidence of chromosome polysomy. Of interest, similar transitions of chromosome 
instability were observed with at least four different chromosome probes. Similar 
trends have also been observed in amenable tissue from other epithelial malignan- 
cies, including cervix, bladder, and breast. 35 These results thus suggested that the 
notions of field cancerization and multistep tumorigenesis might apply to several 
epithelial tissues and that measures of chromosome instability might be useful for 
monitoring this process. 

In the situations described above, the premalignant lesions examined might be 
considered to represent epithelium at 100% risk of being in a cancer field, since they 
were located in the adjacent epithelium to the cancer. This then raises the question 
of the nature of genetic instability in the epithelium of individuals at increased risk 
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for developing cancer. To explore this issue, we obtained biopsies during the course 
of leukoplakia chemoprevention trials exploring the use of 13-cis-retinoic acid in 
reversing leukoplakia and probed them for genetic instability using in situ hybridiza- 
tion. In one retrospective study and in one prospective study of subjects with oral 
leukoplakia, the results indicate that those subjects whose pretreatment biopsies har- 
bor relatively high levels of genomic instability (i.e., more than 3% of the cells 
examined showing at least 3 chromosome 9 copies per cell) have a significandy 
higher likelihoo4 of suffering early onset of head and neck cancer. 36,37 Interestingly, 
half of the tumors that did develop occurred away from the biopsy site used to mea- 
sure genetic instability. This result suggests that genomic instability measurements 
in carcinogen-exposed tissue can provide useful cancer risk estimates. 



THE RELATIONSHIP BETWEEN TOBACCO EXPOSURE AND 
CHROMOSOME INSTABILITY 

In recent years, the aerodigestive tract chemoprevention group at M.D. Anderson 
Cancer Center has initiated three sequential biomarker- associated chemoprevention 
trials involving chronic smokers with a greater than 20 pack-year smoking history. 
In each of these studies, endobronchial biopsies were obtained from six defined sites 
within the lung, including the carina and at bifurcation points at the upper, middle, 
and lower right lung and at the upper and lower left lung. Biopsies were obtained pri- 
or to and following chemopreventive intervention and were subjected to in situ 
hybridization analysis in addition to analyses for other biomarkers. The first impor- 
tant finding was that some degree of chromosome polysomy was evident in all lung- 
sites examined, and this was observed independently of the particular chromosome 
probe utilized. 38 This finding supports the notion that random chromosome changes 
may be occurring throughout the exposed lung field. 

In a second study, bronchial biopsies were obtained from individuals with a 20 
pack-year smoking history. In this study, most of the subjects involved were current 
smokers. 39 Interestingly, all cases who showed metaplasia at one of six biopsy sites 
also showed chromosome polysomy in at least one biopsy site; overall, 88% of the 
sites showed some evidence of chromosome 9 polysomy. 40 Evidence for genetic 
instability was also detected in patients who did not show evidence of bronchial 
metaplasia in any of six biopsy sites despite a strong smoking history. In fact, more 
than 90% of the cases and more than 60% of the sites showed significant chromo- 
some polysomy (i.e., at least three copies in at least 2 % of the cells examined). 
These results suggest that the lungs of long-term smokers show significant evidence 
of genetic instability, and this instability can be detected throughout the accessible 
bronchial tree, even when bronchial metaplasia is not evident. 

These studies in current smokers has allowed us to examine the relationship 
between the levels of genetic instability detected and subject characteristics such as 
smoking status (current or former), smoking history, and lung tissue pathologic 
changes. Evaluable biopsy material has now been obtained from more than 108 cur- 
rent smokers, including more than 480 evaluable biopsy sites. The mean metaplasia 
index in these current smokers was 30.4%. For the total population studied, the 
median chromosome index for the bronchial biopsies was 1.41 (range, 1.04-1.61) 
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and the median chromosome polysomy index was 2.0% (range 0-8.7%). This can be 
compared to a mean chromosome index between 1.2-1.4 for lymphocytes and very 
rare chromosome polysomy. Interestingly, the intrasubject variability in chromo- 
some instability was relatively low in most subjects and was less than the intersub- 
ject variability. These results suggested that chronic smokers harbor detectable 
chromosome instability throughout the accessible bronchial tree (supporting the 
field carcinogenesis notion) and that information from one biopsy site might yield 
representative information for the rest of the lung field. 

Since most of the current smokers exhibited bronchial metaplasia in at least one 
of the biopsied sites, this allowed us to examine the relationship between chromo- 
some instability and histologic changes, both on a site-by-site basis and on a per case 
basis. On a site-by-site basis, the chromosome indices of lesions showing squamous 
metaplasia were similar to those not showing metaplasia (i.e., median 1,43 vs. 1.43), 
and the degree of chromosome polysomy in metaplastic lesions were only slightly 
higher than in non-metaplastic sites (medians: 2.2% vs. 1.8%, respectively). Thus, 
the presence or absence of squamous metaplasia at a biopsy site does not necessarily 
correlate with the degree of underlying genomic instability. On the other hand, those 
subjects with metaplasia indices of at least 15% also showed higher levels of chro- 
mosome polysomy than did subjects with metaplasia index below 15% (medians: 
2.4% vs. 1 .8%, p = 0.005). Thus, these chromosome instability assessments in cur- 
rent smokers appeared to reflect a more global process in the lung field. 

Tobacco exposure has been shown to significantly increase the risk of developing 
lung cancer, and the degree of risk is related to the extent of tobacco exposure. We 
were interested in determining the relationship between individuals* smoking histo- 
ry parameters and the levels of chromosome change found in their lungs following 
years of tobacco exposure. While there was significant intersubject variation for sim- 
ilar tobacco exposure histories, overall there was a significant correlation between 
the degree of chromosome polysomy and the intensity of ongoing tobacco exposure 
(packs/day, p = 0.02 on a per site basis) and with the extent of tobacco exposure 
(pack-years, p = 0.003). Thus the amount of chromosome polysomy reflects the 
intensity and extent of tobacco exposure. At the same time, individuals with similar 
smoking histories showed widely divergent amounts of chromosome polysomy, pos- 
sibly reflecting differences in intrinsic sensitivity between subjects. There was also 
strong correlation between the chromosome index and the duration of the smoking 
history (smoking years) and total accumulated exposure (pack-years, p = 0.0001). 
These results suggest that tobacco exposure is associated with the initiation and 
accumulation of chromosome instability in the exposed lung; however individuals 
are differentially sensitive to carcinogenic insult. The working hypothesis is that 
those individuals who accumulate the highest degree of chromosome changes will 
be at the highest lung cancer risk. 

Many of the bronchial biopsies from chronic smokers examined by in situ hybrid- 
ization showed a rise in the chromosome index above that expected for a diploid cell 
population, especially in subjects with an extensive smoking history. The rise in 
chromosome index was also accompanied by an increase in the fraction of cells 
exhibiting at least 3 chromosome copies per cell. To determine if a rise in the tissue 
chromosome index was due to clonal expansion of populations with chromosome tri- 
somy, the chromosome copy number and relative coordinates of each cell scored in 



HITTELMAN: GENETIC INSTABILITY IN TISSUES AT RISK FOR CANCER 



9 



the bronchial epithelium was recorded and a spatial genetic map was created. 41 We 
then developed algorithms for calculating localized chromosome indices within the 
tissue. Since trisomic clones would have, on average, three chromosomes instead of 
two, those cells involved in neighborhoods with chromosome indices three-halves 
that of diploid populations could be marked as being part of a trisomic clone. Simi- 
larly, groups of cells with chromosome indices half that of diploid populations could 
be marked as being part of a monosomic clone. This allowed the generation of a sec- 
ond-order, two-dimensional genetic map representation of the bronchial epithelium 
showing the relative locations of cells involved in monosomic and trisomic clonal 
outgrowths. When adjacent tissue sections from the same -bronchial biopsy were 
probed separately for different chromosomes, the detected clones appeared to occu- 
py separate subregions of the epithelium. This result suggests that not only are the 
lungs of chronic smokers undergoing a process of genetic instability, they are expe- 
riencing the outgrowth of multiple clones throughout the exposed lung field, as pos- 
tulated by the models shown in Figures 1 and 2. One advantage of this clonal 
approach is that the contribution of both monosomic and multisomic clones can be 
detected. 

Since smoking cessation has been suggested to reduce the lung cancer risk, it was 
of interest to determine whether the levels of chromosome instability would decrease 
following smoking cessation. This question was possible to examine because our 
third sequential chemoprevention trial involved subjects who had discontinued 
smoking. So far, more than 220 subjects (more than 650 biopsies) who have quit 
smoking (mean 9.9 quit-years) have been evaluated for chromosome instability in 
their lungs. Despite the fact that the mean metaplasia index in this group is 5.8% 
(considerably less than that in current smokers), chromosome instability is still 
observed in the majority of subjects. 42 While the mean chromosome polysomy level 
is reduced to 1.0%, some individuals continue to show polysomy levels above 5%. 
Interestingly, while the overall chromosome polysomy levels were reduced in these 
individuals who stopped smoking, the mean chromosome index remained at about 
1 .4 with some individuals exhibiting chromosome indices as high as 1 .8. Initial chro- 
mosome mapping studies suggest that while random chromosome instability seems 
to decrease following smoking cessation, the clonal outgrowths may remain for 
many years in the lung. The working hypothesis is that those individuals who show 
the greatest degree of remaining chromosome instability are at the highest lung can- 
cer risk despite smoking cessation. Long-term follow-up on these subjects will be 
necessary to test this hypothesis. 



SUMMARY AND CONCLUSIONS 

Aerodigestive tract tumorigenesis appears to be a multistep process taking place 
throughout the tissue fields of exposure. When viewed in the context of chromosome 
changes, carcinogen exposure appears to be associated with the random acquisition 
of chromosome polysomy throughout the exposed field, the degree of which is relat- 
ed to the degree and extent of carcinogen exposure as well as to the instrinsic suscep- 
tibility of the exposed individual. Continued exposure leads to continued acquisition 
of new changes and, in association with chronic wound-healing processes, to the 
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accumulation of clonal outgrowths throughout the target tissue. Although the ulti- 
mate malignancy may occur in only one or few tissue sites, manifestations of the 
instability process that drives tumorigenesis is globally present in the tissue. Thus 
random biopsies may provide useful risk information for the exposed field as a 
whole. Even when carcinogen exposure is reduced or chemopreventive strategies are 
initiated and histologic manifestations of the tumorigenesis process subside, the 
genetic scars of prior exposure remain in the form of clonal outgrowths and may 
explain continued lung cancer risk in ex-smokers. Future chemoprevention strategies 
need to focus on reducing the degree of chromosome instability and on trying to 
eliminate residual abnormal clonal outgrowths in the aerodigestive tract. In this set- 
ting, the measurement of chromosome instability in the target tissue will be useful in 
assessing cancer risk as well as response to intervention. 
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CORRESPONDENCE 



is that risk relates to the environment and 
human health. On die other hand, recent 
studies have repeatedly shown that public 
hesitance also includes a number of ethical 
issues (e.&, market dominance of a few large 
companies and GM crops threatening 
natural or divine orders, refs 
1 ,2). Our worry is that the 
US government is 
neglecting widespread 
concerns of the European 
public that include more 
than environmental risk 
and human health. 

Research carried out by 
our group in Denmark 1 
indicates that, although 
many people are confident 
that the public authorities 
are able to manage the risks 
here and now, people are less confident about 
their ability to handle long-term effects 
because of the scientific uncertainty. 
Attempts to conceal these or other limits to 
scientific knowledge do not prevent 
controversies from arising; rather, the 
opposite happens because trust in business, 
scientific experts and public authorities is 
undermined (witness the handling of the 
BSE controversy in the United Kingdom). 

hi the long run, a policy of openness about 
the different dimensions of uncertainty 
would be more likely to increase trust in 
scientific risk assessment Of course, this 
will not guarantee public acceptance of GM 
food, but experience in Europe shows that 
transparency and dialog are prerequisites for 
decreasing concerns about new technology. 




The argument that the EU's resistance 
to GM food has had negative consequences 
for developing countries, denying them 
access to a technology that could alleviate 
food provision, is regarded sympathetically 
by many among the European public. 

Indeed, here most people 
abandon the simple 
dichotomy between 
'unacceptable* GM food and 
the much more acceptable 
medical uses. This is because 
GM foods 

are seen as a means to help 
people in distress. Many 
counter such humanitarian 
uses, however, by the 
observation that, in general, 
GM crops are developed not 
to benefit people in the 
developing world, but to make money. 
Needless to say, according to those who 
point this out, making money is not in 
itself an acceptable objective. Thus, the fear 
is that the benefits will never accrue to , 
those who are at present suffering. 

Krtstian Borch, JesperlAssen 
& Rikke H Jvrgensen 

Centre for Hioethics and Risk Assessment, 
Systems Analysis Department, 
PO Box 49, DK-4000 Roskilde, Denmark 
e- math krtstian. borch @risoe. dk 
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Mining the literature and large 
datasets 



(or perhaps was exacerbated by) a UK 
government seen to be welcoming of GM 
foods and crops. Another negative was that 
it was major transnational corporations — 
another questionable community in the 
eyes of much of the public here — that 
Avere seeking to push their new products 
onto the public without previous debate 
and without there being any perceptible 
benefit And finally, the potentially negative 
impact of GM crops on organic farmers — 
who are seen by some as crucially 
important for the sustainable future of 
food production — and the relatively small 
scale of agricultural production in the 
United Kingdom {and Europe) have also 
been important issues. 

The question to be answered, therefore, 
is not how to force the EU to accept GM 
foods and crops against its own public 
opinion, but how to change public opinion 
in the EU. The UK government is currently 
conducting several exercises that it hopes 
will provide the facts to support a relaxation 
of the moratorium on growing GM crops. 
These include a major review of the costs 
and benefit'; of GM crops (just finished), 
a scientific review of the issues (also now 
finished), a series of crop trials (results in 
September) and a public debate on GM 
crops, 'GM nation' (just finished). 

Whether these will change attitudes is 
moot: the costs»and-benefUs review has 
concluded that the economic value of the 
few currently available GM crops that could 
be grown in the UK is likely to be limited 
because of negative consumer attitudes to 
GM foods. 

MH Barnes 

J The Spinney, 

Watford WD174QF, UK 

e-mail: MBarnes466@aoLcom 

To the editor: 

Several articles in the July and August issues 
of Nature Biotechnology (21, 735-738, 2003; 
21, 852-854, 2003) discuss whether the US 
strategy of forcing the European Union 
(EU: Brussels, Belgium) to accept GM foods 
by referring to World Trade Organisation 
(WTO; Geneva* Switzerland) rules will bear 
fruit We do not believe so — rather the 
opposite. 

A central claim in the arguments of 
both President Bush and US commerce 
representative Robert B. Zoellick is that the 
risk of GM foods is negligible. The veracity 
of that statement, however, depends on what 
is defined as risk. A common understanding 



To the editor: 

In the accelerating quest for disease 
biomarkers, the use of high- throughput 
technologies, such as DNA microarrays 
and proteomics experiments, has produced 
vast datasets identifying thousands of 
genes whose expression patterns differ in 
diseased versus normal samples. Although 
many of these differences may reach 
statistical significance, they are not always 
biologically meaningful. For example, 
reports of mRNA or protein changes of 
as little as two- fold are not uncommon, 
and although some changes of this 
magnitude turn out to be important, most 



are attributable to disease-independent 
differences between the samples. Evidence 
gleaned from other studies linking genes to 
the disease is helpful, but with such large 
datasets, a manual literature review is often 
not practical. Thus, the power of these 
emerging technologies — the ability to 
quickly generate large sets of data — has 
challenged current means of evaluating 
and validating these data. One study from 
1999, for example, reveals that a researcher 
would have to scan 130 different journals 
and read 27 papers per day to follow a 
single disease, such as breast cancer 1 . 
To address this need, my group at 
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Harvard recently developed a freely 
accessible automated literature- 
mining tool, termed MedGene, that 
comprehensively summarizes the 
relationships among over 50,000 named 
human genes (and their synonyms) and 
over 4,000 human diseases from over 
12 million records in Medline 
(bttp://hipscq.med.harvardLedu/MedGene). 
Several key features of this resource are 
worth noting. First, MedGene is not 
limited to any specific relationship type, 
but rather encompasses all reported 
gene -disease links, including the 
genetic, biochemical, pharmacological, 
epidemiological and physiological. Second, 
the database assigns a mathematical score 
summarizing the strength of the 
association between the disease and the 
gene, which allows semiquantitative 
analysis and organizes the genes in rank 
order, finally, the relationships are 
identified automatically by advanced text 
searching and filtering algorithms that 
result in low 

rates of false -positive and false-negative 
linkages 2 . In one query, MedGene 
identified nearly 2,400 breast 



cancer-related genes, whereas the same 
search in four commonly used databases 
yielded a combined total of 286 genes, 
260 of which were included in the 
MedGene list 1 " 5 . 

A summary of all gene-disease 
relationships offers the unique opportunity 
to both evaluate and validate the outcome 
of high- throughput experiments. For 
example, we used MedGene to analyze a 
DNA microarray experiment in which over 
2,000 genes demonstrated statistically 
significant differences in expression 
between normal breast tissue and breast 
cancer. It was able to identify the subset of 
these genes previously described as breast 
cancer-related genes in the literature. To 
determine whether gene expression level 
correlated with the strength of the 
association between gene and breast 
cancer, we plotted gene expression levels 
against the breast cancer literature 
relationship scores assigned by MedGene. 
Interestingly, 

there is no correlation when considering 
expression differences as high as fivefold; 
however, a significant correlation is 
observed (r = 0.41; P = 0.05) among genes 



showing a difference of tenfold or more. 
Thus, for this experiment, expression level 
differences as high as fivefold cannot be . 
attributed to the disease without 
corroborating evidence. It will be 
interesting to learn if similar results hold 
for other diseases and other experiments. 

As the search for disease biomarkers and 
drug targets comes to rely increasingly 
upon genomic-scale technologies, demand 
will grow for automated resources, such as 
MedGene, that help process the resulting 
data volume. 

Joshua ixiBaer 

Institute of Proteomics, 
Harvard Medical School, 
250 Longwood Ave., BCMP, 
Boston, Massachusetts 021 15, USA 
c~ math josh @hms.harvard.cdu 
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An increased high-mobility group A2 expression level is associated with 
malignant phenotype iu pancreatic exocrine tissue. 

AbeN, Watanabe T . Suzulci V , Matsumoto N . Masaki T , Mori T , Sugiyama M . 
Chiappctta G , Fusco A , Atom! Y . 

First Department of Surgery, Kyorin University School of Medicine, 6-20-2, Shinkawa, 
Mitaka, Tokyo 181-861 1, Japan, abenbtg@kyoria-u.acjp 

The altered form of the high-mobility group A2 (HMGA2) gene is somehow related to 
the generation of human benign and malignant rumours of mesenchymal origin. 
However, only a few data on the expression of HMGA2 in malignant tumour originating 
from epithelial tissue are available. In this study, we examined the HMGA2 expression 
level in pancreatic carcinoma, and investigated whether alterations in the HMGA2 
expression level are associated with a malignant phenotype in pancreatic tissue. High- - 
mobility group A2 mRNA and protein expression was determined in eight surgically 
resected specimens of non-neoplastic tissue (six specimens of normal pancreatic tissue 
and two of chrome pancreatitis tissue) and 27 pancreatic carcinomas by highly sensitive 
reverse transcriptase-polymerase chain reaction (RT-PCR) techniques and 
immunohistochemical staining, respectively. Reverse transcriptase-polymerase chain 
reaction analysis revealed the expression of the HMGA2 gene in non-neoplastic 
pancreatic tissue, although its expression level was significantly lower than that in 
carcinoma, fmmunohistochemicai analysis indicated that the presence of the HMGA2 
gene in non-neoplastic pancreatic tissue observed in RT-PCR reflects its abundant 
expression in islet cells, together with its focal expression in duct epithelial cells. Intense 
and multifocal or diffuse HMGA2 immunoreactivity was noted in all the pancreatic 
carcinoma examined, A strong correlation between HMGA2 overexpression and the 
diagnosis of carcinoma was statistically verified. Based on these findings, we propose 
that an increased expression level of . the HMGA2 protein is closely associated - with the 
malignant phenotype in the pancreatic exocrine system, and accordingly, HMGA2 could 
serve as a potential diagnostic molecular marker for distinguishing pancreatic malignant 
cells from non-neoplastic pancreatic exocrine cells. 
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extracts. ]f these minor cell proteins differ among cells to the same extent as the 
more abundant proteins, as is commonly assumed, only a small number of pro- 
tein differences (perhaps several hundred) suffice to create very large differences 
in cell morphology and behavior. 

A Cell Can Change the Expression of Its Genes 
in Response to External Signals 3 

Most of the specialized cells in a multicellular organism are capable of altering 
their patterns of gene expression in response to extracellular cues. If a liver cell 
is exposed to a glucocorticoid hormone, for example, the production of several 
specific proteins is dramatically increased. Glucocorticoids are released during 
periods of starvation or intense exercise and signal the liver to increase the 
production of glucose from amino acids and other small molecules; the set of 
proteins whose production is induced includes enzymes such as tyrosine amino- 
transferase, which helps to convert tyrosine to glucose. When the hormone is no 
longer present, the production of these proteins drops to its normal level. 

Other cell types respond to glucocorticoids in different ways. In fat cells, for 
example, the production of tyrosine aminotransferase is reduced, while some 
other cell types do not respond to glucocorticoids at all. These examples illustrate 
a general feature of cell specialization — different cell types often respond in dif- 
ferent ways to (he same extracellular signal. Underlying (his specialization are 
features that do not change, which give each cell type its permanently distinc- 
tive character. These features reflect the persistent expression ol different sets of 
genes. 

Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from ON A to RNA to Protein 4 

If differences between the various cell types of an organism depend on the par- 
ticular genes that the cells express, at what level is the control of gene expression 
exercised? There are many steps in the pathway leading from DNA to protein, and 
all of them can in principle he regulated. Finis a cell can control the proteins it 
makes by 11) controlling when and how often a given gene is transcribed (tran- 
scriptional control), {2) controlling how the primary - RNA transcript is spliced or 
otherwise processed (RNA processing control), (;J) selecting which completed 
mRNAs in the cell nucleus are exported to the cytoplasm (RNA transport con- 
trol). (4) selecting which rnRNAs in the cytoplasm are translated by ribosomes 
ttranslational control), (S) selectively destabilizing certain mRNA molecules in 
the cytoplasm (mRNA degradation control), or (6) selectively activating, inacti- 
vating, or compartmentalizing specific protein molecules after they have been 
made (protein activity control) (Figure 9-2). 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure *J-2, only transcrip- 
tional control ensures that no superfluous intermediates are synthesized. In the 
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Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that oin.se different genes to he transcribed in different cells. Since many spe- 
cialized animal cells fan maintain their unit/lie character when grown in cult are, the 
gene regulatory mechanisms involved in creating them must be stable once estab- 
Ushedfind heritable when the cell divides, endowing the cell with a memory of its 
developmental history. l>wcaryotes and yeasts provide unusually accessible model 
systems in which to study gene, regulatory mechanisms, some of which may be rel- 
evant to the creation of specialized cell types in higher ettcaryotes. One such mecha- 
nism involves a competitive interaction between two lor more) gene regulatory pro- 
teins, ench of which inhibits the synthesis of the other; this can create a flip-flop 
switch tlmt switches u cell between two alternative patterns of gene expression. Di- 
rect or indirect positive feedback loops, which enable gene regulatory proteins to 
perpetmile iheir mint synthesis, provide ft genera f mechanism for cell memory. 

In encaryotes gene transcription is generally controlled by combinations oj gene 
regidntory proteins. It is thought tlmt ench type of cell in a higher eunuyotit: organism 
contains a specific combination of gene regulatory proteins that ensures the expres- 
sion of only those genes appropriate to that type of cell. A given gene regulatory pro- 
tein nuty be expressed in ft variety of circumstances find typically is involved in the 
regulation of many genes. 

In tnhhtion to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also utilized by eucaryotk cells to regalate gene expression. In ver- 
tebrates DNA metliyltttion ftlso plays a part, mainly fts u device to reinforce decisions 
about gene c.\ pression that ore made initially by other mechanisms. 



Although iodiidK oh ilit: initiation ol i;enc I ranser ipiion are the predominant 
horn ol n -"illation for most e,enes. other controls can art later in the pathway 
linn) una io ptolrin lo modulate the amount of e;ene product that is made. Al- 
though du m' post transcriptional com nds, which operate at lei UNA polymerase 
ha\ hen mil io i he e.ene's promoter and hc^tm UNA synthesis, are less common 
than trunsetipiimttd control, lor many Irenes I hey :ue crucial. Il seems that every 
step in ;:eiie <-\picssion I hat could he eonti oiled in pnnciple is likely to he ■ emu- 
lated nndei some circumstances lor some c;enes. 

We eonsiifci the varieties Ol post transcriptional regulation in temporal or- 
dei. aeeordm;; to the sentience of events thai miejn he e.\pei ieneed hy an RNA 
molecule all ei "us I \ arise tip) ion has begun (Figure !>-7^]. 
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FROM DNA TO RNA 



Transcription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amount of 
protein rapidly when necessary. But each gene can also be transcribed and 
translated with a different efficiency, allowing the cell to make vast quantities of 
some proteins and liny quantities of others (Figure 6-3). Moreover, as we see in 
the next chapter, a cell can change {or regulate) the expression of each of its 
genes according to the needs of the moment — most obviously by controlling 
the production of its RNA. 

Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell lakes in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence— a gene— into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still wrinen in essentially the same language as it is in DNA — 
the language of a nucleotide sequence. Hence the name transcription. 

Uke DNA, RNA is a linear polymer made of four different types or nucleotide 
subunits linked together by phosphodi ester bonds (Figure 6-1). )t differs from 
DNA chemically in two respects: ()) the nucleotides in RNA are 
ribonucleotides— \ha\ is, they contain the sugar ribose (hence the name ribonu- 
cleic acid) rather than deoxyribose; (2) although, like DNA, RNA contains the 
bases adenine (A), guanine (G), and cylosine (C), it contains. the base uracil (VJ) 
instead of the thymine (T) in DNA. Since U, like X can base- pair by hydrogen- 
bonding with A (Figure 6-5). the complementary base-pairing properties 
described for DNA in Chapters -I and 5 apply also to RNA (in RNA, G pairs with 
C, and A pairs with U). li is not uncommon, however, to find other types of base 
pairs in RNA: foi example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matically in overall structure. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single-stranded. RNA chains therefore fold up into a 
variety of shapes, just as a polypeptide chain folds up to form the fin a J shape of 
a protein (Figure 6-6). As we see later in this chapter, the ability to iokl into com- 
plex three-dimensional shapes allows some RNA molecules to have stiuc Jural 
and catalytic functions. 

Transcription Produces RNA Complementary to 
One Strand of DNA 

AJI of the RNA in a cell is made by DNA iransei ipiion, a process that has cer- 
tain similarities to the process of DNA replication discussed in Chapter 5. 
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Figure 6-90 The production of a 
protein by a eucaryotic cell. The final 
level of each protein in a eircaryotic cell 
depends upon the efficiency of each step 
depicted. 



ore 6-90) could be regulated by (he ceil for each individual protein. However, as 
we shall see in Chapter 7, the initiation of transcription is the most common 
point foi n cell to regulate the expression ol each of iis genes. This makes sense, 
inasmuch as the most efficient way to keep a gene from being expressed is to 
block the very first step — the transcription of its DNA sequence into an RNA 
molecule. 



Sumrnnry 

The translation of the nucleotide sequence of an mliNA molecule into protein takes 
place in the cytoplasm on a large rihonncleoproiein assembly called a ri bosom e. The 
amino acids used for protein synthesis are first a ft ached to a family of tHNA 
molecules, each of which recognizes, by complementary base- pair interactions, par- 
ticular sets of three nucleotides in the mliNA (an ions). The sequence of nucleotides in 
the mliNA is then read from one end to the other in sets of three according to the 
genetic code. 

To initiate translation, a small n bosom nl subunit binds to the in UNA molecule 
at a start cotton (AUG) that is recognized by a unique initiator iHNA molecule. A 
large ribosomal subunit bittds to complete the ribosonte ami begin the elongation 
phase of protein synthesis. During this phase, aminoacyl tliNAs — each bearing a 
specific amino acid bind sequentially to the appropriate cod on in mliNA by forming 
complementary base pairs with the tHNA anticodon. Tach itmino acid is added to the 
C- terminal end of the growing polypeptide by means of n cycle of three setftiential 
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Figure 7-5 Six steps at which 
cucaryotic gene expression can be 
controlled. Controls that operate at 
steps I through 5 are discussed in this 
chapter. Step 6, the regulation of protein 
activity, includes reversible activation or 
inactivation by protein phosphorylation 
(discussed in Chapter 3) as well as 
irreversible inactivation by proteolytic 
degradation (discussed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 

If differences among the various cell type's of an organism depend on the partic- 
ular genes that the cells express, at what level is the control of gene expression 
exercised? As we saw in the last chapter, 'there are many steps in the pathway 
leading from DNA to protein, and all of them can in principle be regulated. Thus 
a cell can control the proteins it makes by (!) controlling when and how often a 
given gene is transcribed (transcriptional control), (2) controlling how the RNA 
transcript is spliced or otherwise processed (RNA processing control), O) 
selecting which completed mRNAs in (he cell nucleus are exported to the cylosol 
and determining where in the cylosol I hey are localized (RNA transport and 
focaiization control), (4) selecting which mRNAs in the cytoplasm are translated 
by ribosomes (iranslational control), (5) selectively destabilizing "certain mRNA 
molecules in the cyloplasm (mRNA degradation control), or (6) selectively acti- 
vating,, inactivating, degrading, or compartmentalizing specific protein 
molecules after i hey have been made (protein activity control) ( figure 7-5). 

Kor most genes transcriptional controls are paramount. This makes "sense 
because, of all the possible control points illustrated in Figure 7- r >, only Iran- 
icripiional control ensures that the cell will noi synthesize superlluous interme- 
diates. In the following sections we discuss the DNA and protein components 
•nat perlorm this function by regulating the initiation of gene transcription. We 
>hall return al the end of the chapter to the additional ways of regulating gene 
repression. 



nummary 

Vhc genome of a cell contains in its DNA sapience the information to make ninny 
thousands of different protein ami UNA molecules. A cell typically expresses only a 
jraction of its genes, and the different types of cells in multicellular organisms arise 
.-.cause different sets of genes arc expressed. Moreover, cells can change the pattern 
;•■/ genes they express in response to changes in their environment, such as signals 
'•car other cells. Although all of the steps inetdrcd in expressing a gene can in prin- 
•.-.pie be regulated, for most genes the initiation of RNA transcription is the most 

•important point of control. 



; i\v does a cell determine which of iis thousands of genes to transcribe';' As 
--niiom-d briefly in Chapters -I and 6, the itansciiphon of each gene is. con- 
fled by a regulatory region of DNA ielaiively near the site where transcription 
;ms. .Some regulatoty regions ate simple anil act as switches that are thrown 

- :* single signal. Many others are complex and act as liny microprocessors. 

■ ponding to a variety of signals ihat they inierpiei and integrate to switch the 

■ :; ; : .hboiing gene on oi off. Whether complex or simple, these switching-devices 
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Selective apoptosis of natural killer-cell tumours by 1-asparaginase, 

Ando M , Sugimoto 1( , Kitott T , Sasaki M , Mukai K , Ando J , Egaslura M , Schuster 
SM , Oshimi K . 

Department of Haematology, Juntendo University School of Medicine, Tokyo, Japan. 

We examined the effectiveness of various anti -tumour agents to natural killer (NK)-ceil 
tumour cell lines and samples, which are generally resistant to chemotherapy, using flow 
cytometric terminal deoxy nucleotidyl trans ferase-mediated dUTP-biotin nick end- 
labelling (TUNEL) assay. Although NK-YS and NK-92 were highly resistant to various 
anti-tumour agents, l-asparaginase induced apoptosis in these two NfC-cell lines. NFC-cell 
leukaemia/lymphoma and acute lymphoblastic leukaemia (ALL) samples were . 
selectively sensitive to l-asparaginase and to doxorubicin (DXR) respectively. Samples of 
chronic NK lymphocytosis, an NFC-cell disorder with an indolent clinical course, were 
resistant to both drugs. Our study clearly separated two major categories of NK-cell 
disorders and ALL according to the sensitivity to DXR and l-asparaginase. We examined 
asparagine synthetase levels by real-time quantitative polymerase chain reaction (RQ- 
PCR) and immunostaining in these samples. At least in nasal-type NK-cell lymphoma, 
there was a good correlation among asparagine synthetase expression, in vitro sensitivity 
and clinical response to l-asparaginase. In aggressive NK-cell leukaemia, although 
asparagine synthetase expression was high at both mRNA and protein levels, I- 
asparaginase induced considerable apoptosis. Furthermore, samples of each disease entity 
occupied a distinct area in two-dimensional plotting with asparagine synthetase mRNA 
level (RQ-PCR) and in vilrol-asparaginase sensitivity (TUNEL assay). We confirmed 
rather specific anti-tumour activity of l-asparaginase against NK-cell tumours in vitro, 
which provides an experimental background to the clinical use of l-asparaginase for NK- 
cell tumours. 
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Human thyroid carcinoma cell lines and normal thyrocytes: expression 
and regulation of matrix metalloproteinase-l and tissue matrix 
metaUoproteinase inhibitor-1 messenger-RNA and protein. 

AustG, Hofmann A, Laue S/ Rost A T Kohler T , Scherbaum WA . 

Institut of Anatomy, University of Leipzig, Germany. 

Matrix metaUoproteinase- 1 (MMP-1 ) and tissue matrix metaUoproteinase inhibitor 1 
(TIM?- 1) play an important role in remodeling the extracellular matrix in normal and 
. pathological processes. The effect of phorbol-myristate acetate (PMA), interleukin-l (IL- 
.1), and tumor necrosis factor-alpha (TNF-alpha) on MMP-1 and TMP-I expression was 
studied on highly purified thyrocytes and undifferentiated 8505 C, C 643, HTh 74, SW 
1736 thyroid carcinoma cells compared with thyroid-derived fibroblasts. Messenger RNA 
(mRNA) levels were monitored by competitive semiquantitative reverse transcriptase 
polymerase chain reaction (RT-PCR) after 24 hours. Culture supernatants were assayed 
for free and/or complexed MMP-1 and TIMP-1 after 48 hours using enzyme-linked 
immunosorbent assay (ELISA) systems (detection limit; <2 ng/mL). MMP-1 and TIMP-1 
mRNA were present in all cell types, although thyrocytes showed MMP-1 mRNA levels 
near the detection limit. 8505 C expressed MMP-1 mRNA levels of up to 10(6) times 
those of the other cells analyzed. PMA and IL-1 increased MMP-1 mRNA in most cell 
types. TIMP-1 mRNA increased after treatment with PMA in all cells except 8505 C, 
whereas only slight effects were shown after IL-1 stimulation. MMP-1 protein was 
undetectable in normal thyrocyte cultures, but was secreted spontaneously by all cell 
lines ([ng/mL]; C 643: 15+/-7; HTh 74: 81+/-1; SW 1736: 13+/-2; 8505 C: 2097+/-320). 
There was a strong correlation between levels of MMP-1 mRN A and protein (r = 0.99, p 
< .0001). PMA and IL-1 increased MMP-1 secretion in all cell types after 48 hours. 
Fibroblasts ([ng/mL] 5 17+/- 5 5) and the cell lines (C 643: 142+/-48; HTh 74: 1 15+/-13; 
SW 1736: 202+/- 1 4; 8505C: 120+/-19) secreted TIMP-1 in unstimulated cultures, 
whereas only a trace amount was detected in thyrocyte cultures, even after PMA 
treatment. IL-1 upregulated TIMP-1 secretion after 48 hours in SW 1736, HTh 74, and C 
643 cells. Our data suggest that in contrast to normal thyrocytes, dedifferentiated thyroid 
carcinoma cell lines are potential producers of MMP-1 as well as TIMP-1 . High MMP-1 
or MMP-1 /TIMP-1 expression may play a role in tissue invasion of undifferentiated 
thyroid cancer cells. 
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Thyrocytes: Expression and Regulation of Matrix 
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ABSTRACT 

Matrix metalloproteinase-1 (MMP-1) and tissue matrix metalloproteinase inhibitor 1 (T1MP-1) play an impor 
tant role in remodeling the extracellular matrix in norma] and pathological processes. The effect of phorbol-myris- 
tate acetate (PMA), interleukin-1 (H-l), and rumor necrosis factor-a (TNF-a) on MMP-1 and T1MP-1 expres- 
sion was studied on highly purified thyrocytes and undifferentiated 8505 C, C 643, HTh 74, SW 1736 thyroid 
carcinoma cells compared with thyroid-derived fibroblasts. Messenger RNA (mRNA) levels were monitored by 
competitive semiquantitative reverse transcriptase polymerase chain reaction (RT-PCR) after 24 hours. Culture 
supernatants were assayed for free and/or complexed MMP-1 and T1MP-1 after 48 hours using enzyme-linked 
immunosorbent assay (EUSA) systems (detection limit; <2 ng/mL). MMP-1 and T1MP-1 mRNA were present in 
all cell types, although thyrocytes showed MMP-1 mRNA levels near the detection limit. 8505 C expressed MMJP- 
1 mRNA levels of up to 10 6 rimes those of the other ceDs analyzed. PMA and H-l increased MMP-1 mRNA in 
most cell types. T1MP-1 mRNA increased after treatment with PMA in all cells except 8505 C, whereas only 
slight effects were shown after EL-1 stimulation. MMP-1 protein was undetectable in normal thyrocyte cultures, 
but was secreted spontaneously by all cell lines ([ng/mL]; C 643: 15 ± 7; HTh 74: 81 ± 1; SW 1736: 13 ± 2; 
8505C; 2097 ± 320). There was a strong correlation between levels of MMP-1 mRNA and protein (r = 0.99, 
p < .0001). PMA and IL-1 increased MMP-1 secretion in all cell types after 48 hours. Fibroblasts ([ng/mL] 517 ± 
55) and the cell lines {C 643: 142 ± 48; HTh 74: 115 ± 13; SW 1736: 202 ± 14; 8505 C: 120 ± 19) secreted 
T1MP-1 in unstimulated cultures, whereas only a trace amount was detected in thyrocyte cultures, even after PMA 
treatment. IL-1 upregulated T1MP-1 secretion after 48 hours in SW 1736, HTh 74, and C 643 cells. Our data 
suggest that in contrast to normal thyrocytes, dedifferentiated thyroid carcinoma cell lines are potential produc- 
ers of MMP-1 as well as TIMP-1. High MMP-1 or MMP-l/TIMP-1 expression may play a role in tissue inva- 
sion of undifferentiated thyroid cancer cells. 



INTRODUCTION suppressed on external stimulation, ie, with cytokines, 

Mphorbol 12-myristate 13-acetate (PMA), lipopolysaccaride 

ATRrx metalloproteinases, (MMPs) constitute a (LPS), or retinoic acid (2,3). After secretion at post-tran- 

ui r ° f SlruCturaI,y reIatcd P rote °lytic enzymes re- scriptional level, latent MMP proenzymes are regulated by 

sponsible for the proteolytic degradation of extracellular proteolytic activation and interaction with tissue inhibitors 

matrix (ECM) components. They are important partici- of matrix metalloproteinase (TlMPs), their specific in- 

pants in normal tissue remodeling and contribute to the hibitors. Any imbalance berween the proteolytic MMPs ao 

phenorype of several pathological conditions that arc as^ tivities and the TlMPs that could be influenced and caused 

sociated with progressive ECM degradation. MMPs are by cytokines could potentially lead to pathological condi- 

highly regulated at different levels (1). At the transcrip- tions (4). 

tional level, MMP expression can be directly induced or MMP-1, although known as an interstitial collagenase, 
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is the only enzyme active at neutral pH thai can degrade 
extracellular fibers comprised of collagen types I, 11, and 
III. With this initial step, MMP-1 provides the cleavage 
products to other collagenase types (5). The major specific 
inhibitor of MMP-1 is TIMP-1, a 2 8.5- led glycoprotein, 
which forms 1:1 stoichiometric complexes with the pro- 
tease (6). Cytokines and growth factors have been shown 
to' regulate -the expression of both MMP-1 and T1MP-1 
(1,7,8). 

Although the participation of MMP-1 as the initial col- 
lagenase in tissue breakdown during tumor development is 
well documented (9-11), only one study has described the 
expression (12) but no study has as yet investigated the reg-: 
ulation of this enzyme in different thyroid tumors. Few stud- 
ies have been published investigating the role of other 
MMPs in normal and pathological thyroid tissue by in situ 
hybridization and immunohistochemistry (13-16). Fur- 
thermore, tissue remodeling includes both the action of 
MMPs and their inhibitors; thus, these enzymes could be 
involved in autoimmune and other nonautoimmune thyroid 
diseases during morphological changes (17,18). It is still un- 
known whether or not thyrocytes are able to express MMPs 
and TlMPs. Although type IV collagenases (MMP-2 and 
MMP-9) were detected in various human epithelial cells of 
different tissue origin (19,20), only one study described the 
secretion of MMP-1 by epithelial cells (21). 

Highly purified normal thyrocytes and four thyroid car- 
cinoma cell lines were included in this study to investigate 
the involvement of these cells in MMP-1 and T1MP-1 pro- 
duction during thyroid tissue remodeling processes and in 
malignant thyroid neoplasms. MMP-1 and TIMP-1 ex- 
pression were studied at both the mRNA and protein level 
by semiquantitative RT-PCR and ELISA measurement, re- 
spectively. 

In unstimulated carcinoma cell lines both MMP-1 and 
TIMP-1 mRNA were expressed, partly at a high level, fol- 
lowed by the spontaneous secretion of the proteins. The 
various conditions for the. stimulation of the different cell 
lines by cytokines and PMA were defined. In contrast to 
the cell lines, normal thyrocytes did not secrete MMP-1 
and only trace amount of TIMP-1, even after stimulation 
with PMA. 



MATERIALS AND METHODS 

Preparation of tissues, thyroid-derived cells, and 
cell lines 

Thyrocytes were prepared from surgical thyroid speci- 
mens from 3 patients (1 Graves' disease, 2 nontoxic goi- 
ter; mean age 543 i 5.0 years). Fibroblasts were separated 
from thyroid tissue of 5 other patients (3 Graves' disease, 
2 nontoxic goiter; mean age 43.6 ± 6.4 years). Graves' dis- 
ease and nontoxic goiter were diagnosed on the strength 
of clinical, biochemical, and immunologic features as well 
as thyroid scintiscans. 

Thyroid tissue was trimmed of fat and connective tissue 
immediately after surgery. Thyroid-derived cells were en- 
riched aher gradual enzymatic digestion of tissue and cul- 
tured over a period of 16 hours as described. Thyrocytes 
were obtained from the adherent fraction by incubating 
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the cell monolayer with phosphate buffered saline (PBS) 
without Ca 2 ~/Mg 2+ for 45 minutes (22). Residual fibro- 
blasts were removed after subsequent incubation of the 
cells with the fibroblast-specific mab FibASOl (22) and 
goat-anti-mouse IgG-DYNABEADS® M450 (DYNAL, 
Hamburg, Germany) according to the manufacturer's pro- 
tocol. 

Thyroid-derived fibroblasts were obtained after culturing 
small pieces of thyroid tissue in Dulbeccos's Modified Ea- 
gle's Medium (DMEM) with 10% fetal calf serum (FCS) 
and harvested in the 5th to 7th passage. The purity of the 
thyrocytes and fibroblasts was determined by using indirect 
immunofluorescence technique on a FACS-Scan (Becton 
Dickinson GmbH, Heidelberg, Germany) as described (22). 

The following human anaplastic thyroid carcinoma eel! 
lines were cultured in DMEM with 10% FCS: C 643 (23); 
SW 1736 (23); and HTh 74 (24). The cell line 8505 C (25) 
was purchased from the German Collection of Microorr 
ganisms and Animal Cell Cultures (DSM ACC219). This 
cell line was established from a primary thyroid tumor 
characterized histologically as a undifferentiated carcinoma 
that was partially composed of poorly differentiated pap- 
illary cells (25). This is a feature of a subgroup of anaplas- 
tic carcinoma (26). The majority of these coexistent better 
differentiated carcinoma foci in anaplastic carcinoma were 
papillary (26). 

In vitro cultures 

Using 24-welI plates, 1 X 10 5 cells were cultured for 24 
hours. The medium was aspirated and replaced with 500 
ML OPTI-MEM (G1BCO BRL, Grand Island, NY) without 
FCS to eliminate possible stimulation of MMP-1 arid 
TIMP-1 production by FCS. The medium contained the 
desired concentration of human 1L-1 or (10 U/ml; Pepro 
Tech EC Ltd., London, UK), TNF-a (100 U/mL; Pepro 
Tech EC Ltd.), interferon-y (IFN-y) (500 U/mL; Pepro 
Tech EC Ltd.), or 10 ng/mL PMA (SIGMA). 

Triplicate cultures of each stimulator were analyzed af- 
ter 3, 6, and 24 hours at the mRNA and after 24 and 48 
hours at the protein level. The supernatants were removed 
and stored at -80°C for further use. First,. a collagenolytic 
assay based on the digestion of type I collagen was per- 
formed. This method showed direct evidence of free pro- 
MMP-1 enzyme in the cell culture supernatants of un- 
stimulated and 1L-1 or stimulated 8505 C, HTh 74, and 
C634 cells (data not shown). However, the method does 
not allow quantitation of MMP-1 enzyme activity. Thus, 
the cell culture supernatants were assayed for MMP-1, 
TIMP-1, and MMPl /TIMP-1 complex by ELISA (Amer- 
sham Life Sciences, Braunschweig, Germany). The MMP- 
1 assay {sensitiviry: 1.7 ng/mL) detected only total human 
MMP-1, ie, free MMP-1 and MMP-1 complexed with in- 
hibitors such as TIMP-1. It did not detect MMP-1 bound 
by the nonspecific protease inhibitor a 2 -macrogIobulin. 
The MMP-I/TLMP-1 assay (sensitivity: 1.5 ng/mL) de- 
tected MMP-1 /TIMP-1 complex, ie, activated MMP-1 that 
has been subsequently complexed with the specific MMP- 
1 inhibitor TIMP-1. It did not detect free actrve MMP-1, 
free TIMP-1, or pro-MMP-1. There was no cross-reactiv- 
ity with active MMP-1 bound by the nonspecific protease 
inhibitor a?-macroglobulin. The TIMP-1 assay (sensitivity: 
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1.25 ng/mL) detected total human TIMP-1, ie, free TIMP- 
1 and that complexed with MMPs. The assay did fully 
cross-react with TIMP-1 in complexes with other MMP. 
It did not cross-react with TIMP-2. 

RNA extraction and cDNA synthesis 

For gene expression studies, 5 mL RNAzol™ B (Biotex 
Laboratories Inc., Houston, TX) was added to the cell cul- 
ture wells. The content of three wells of any cell type was 
pooled and then stored frozen for further mRNA analysis 
in liquid nitrogen. Total cellular RNA (cDNA) was iso- 
lated from the probes according to the manufacturer's pro- 
tocol. RNA was fractionated on a denaturing 1.0% agarose 
gel and stained with ethidium bromide to confirm that 
spectrophotometric measurements were accurate arid that 
the RNA had not been degraded. Five micrograms total 
RNA was reverse-transcribed to cDNA using the First- 
strand cDNA synthesis kit of Pharmacia (Uppsala, Swe- 
den), in a total reaction volume of 15 pL. 

mRNA analysis by competitive RT-PCR 

To correct for variations across different cDNA prepa- 
rationsi all samples were first adjusted to contain equal in- 
put gIyceralderyde-3-phosphate dehydrogenase (GAPDH) 
cDNA concentrations. Semi-quantitative GAPDH RT-PCR 
was used with a heterologeous synthetic competitor frag- 
ment. The generation of the specific PCR products from 
the competitor and the cDNA with the GAPDH primers 
were published earlier (22,27). 

We then estimated the MMP-1 and TUvlP-1 cDNA in 
these adjusted samples. The primers were selected using the 
DNAsis computer program (Hitachi Software Engineering 
Co, Yokohama, Japan). The primer pairs span one or more 
introns to allow unambiguous discrimination between 
cDNA and unwelcome contaminating genomic DNA. In 
quantitating MMP-1 and TTMP-1 cDNA, a rapid one-step 
method was introduced to synthesize an internal horriolo- 
geous competitor (plan diagram of procedure: Fig. 1, ex- 
emplary for MMP-1 [28J. A hybrid primer was synthesized 
{MMP-lhy) that consisted of two segments (seg], seg2). It 
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FIG. 1. Genera) scheme lor generating homologous com- 
petitors used for quantitative PCR. 
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had a length of 40 nucleotides, in which 20 nucleotides (seg,) 
at the 3' end corresponded to the opposite strand of the tar- 
get sequence a predetermined distance from primer MMP- 
If* and 20 nucleotides at the 5' end (seg* = MMP-lr) that 
corresponded to the target sequence upstream from the seg- 
ment segi. Amplification with the primers MMP-lf and 
MMP-lhy from the cDNA resulted in a 478-base pair (bp) 
(polymerase chain reaction (PCR) product. It was freed from 
excess primers and deoxynuclepside-triphosphates (dNTPs) 
using the Qiaquick Gel Extraction -.Kit (Qiagen GmbH, 
Hilden, Germany) and quantified. A known number of 
copies of the competitor was introduced in the GAPDH-ad- 
justed samples and amplified with the primers MMP-lf and 
MMP-lr. With this approach, two products were generated, 
one derived from the cDNA (560 bp) and another, 82 bp 
smaller in size derived from the interna) competitor (Fig. 1). 
PCR products were resolved by gel electrophoresis (1.5% 
agarose gel). The relative amounts of sample cDNA and 
competitor were quantified by measuring the intensity of 
ethidium fluorescence with a CCD image sensor and ana- 
lyzing the data with the EASY program (Herolab, Wiesloch, 
Germany). The initial amounts of sample cDNA and com- 
petitor were assumed to be equal in those reactions where 
the ratio of the two products was judged to be equal. This 
was expressed in arbitrary units (AU) (22,29). One AU was 
defined as the lowest concentration of competitor yielding 
a detectable amplification product when added to PCR 
alone. For example, if equivalence between sample cDNA 
and competitor was reached using a 100-fold concentrated 
competitor the relative sample cDNA concentration was 100 
AU. Thyrocytes and the ceil lines were analyzed for the ex- 
pression of thyroid-specific and cytokine receptor mRNAs 
in a simple RT-PCR. The sequences of the TPO and cy- 
tokine receptor primer pairs have been published by Wat- 
son et al. (30) and Tada et al. (31) and gave the following 
product sizes: TPO: 506 bp; IL-1R type I (p80):300 bp; 1L- 
1R type H (p68): 392 bp; TNF-aR (p75): 324 bp; TNF-aR. 
(p55): 587 bp and IFN-yR: 899 bp. The thyroglobulin (Tg) 
and thyroid stimulating hormone receptor (TSH-R) primer 
pairs were selected according to the published sequences us- 
ing the DNAsis program (Table 1). 

Each 25-/iL amplification reaction contained 2.5 /iJL 
10 X concentrated PCR buffer (15 mM MgCl 2> Boehringer 
Mannheim, Germany), 0.3 U Taq DNA polymerase 
(Boehringer Mannheim, Germany), 100 /xM dNTPs 
(Perkin Elmer, Weiterstadt, Germany), 0.1 /iM of each 
primer (1MB, Jena, Germany), and 1 /jJL cDNA and com- 
petitor in adjusted dilution. Furthermore, restriction map- 
ping (restriction enzymes: Boehringer Mannheim GmbH, 
Germany) was carried out to confirm the originality of the 
PCR product (Fig. 1, Table 1). 

Statistics 

Protein levels of ihyrocyte or fibroblast cultures from the 
different patients and of the thyroid carcinoma cell lines ob- 
tained from three separate experiments were presented as 
mean ± SEM values. Statistical comparisons between un- 
stimulated and stimulated cell cultures were performed by 
the alternate (Welch) t-tesr The correlation between basal 
mRNA levels and the unstimulated protein secretion in all 
cell types was calculated according to the Spearman method. 
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Table J. Primers, Length of Amplified Tempuvtes, Restriction Mapping and Assay Conditions for RT-PCR 







Primer 




♦ / ^noth of 


/ en&th of 


Annealing 


Numhrr rxf 






y- 


-r 


cDNA (bp) 


competitor (bp) 


temperature 


cycles 


Tg 


forward 


GCAGATCTTACTGAGTGGCr 




416 




«>,. 


35 


reverse 


TGTCAGCACAGTGGCAATAC 












TSH-R 


forward 


ACTTGCTGCAGCTGGTGCT 




354 




65 


35 


exons 1-4 


reverse 


TGAGGGCATCAGGGTCTATG 












TSH-R 


forward 


GAAATTCGGAATACCAGGAACTTA ACT 


890 




53 


35 


exons 4-10 


reverse 


AACTCATCGGACTTGGGGGTACA 












MMP-1 


forward 
reverse 
hybrid 


TGGGAGCAAACACATCTGAC 
ATCACTTCTCCCCGAATCGT 
ATCACTTCTCCCCGAATCGT 
CCATATATGGGTTGGATGeC 




560 


478 


64 


33 


TIMP-1 


forward 
reverse 
hybrid 


CTTAGGGGATGCCGCTGACA 
GGCAGGCAGGCAAGGTGACG 
GGCAGGCAGGCAAGGTGACG 
GGATGGATAAACAGGGAAAC 




351 


* 274. 


64 


30 



Tg indicates thyroglobuline; TSH-R, thyroid stimulating hormone receptor, MMP-1, matrix raetalloprottinase-I; 1TMP : 1, tissue inhibitor of 
metalloproteinasc-l; bp, base pair. 



RESULTS 

Thyroid specific and cytokine receptor 
mRNA expression 

Isolated thyrocytes as well as 8505 C cells expressed Tg 
•and thyroperoxidase (TPO) mRNA, whereas transcripts of 
the TSH-R (exons 1-4, 354 bp, exons 4-10, 896 bp) were 
present only in the thyrocytes. The three anaplastic thy- 
roid carcinoma cell lines SW 1736^ C 634, and HTh 74 
were completely negative for the Tg, TPO, and TSH-R mR- 
NAs (Fig. 2). All cell lines and thyrocytes expressed 1L-1R 
(type I and type H), TNF-aR (p75 and p55) and IFN-yR 
mRNA (Fig. 2). 

Basal MMP-1 and TJMP1 mRNA and ' ' 
protein expression 

In most stimulation experiments, mRNA levels did not 
increase until 24 hours of incubation. The 24-hour mRNA 
levels are shown in Figures 3 and 4. The 3- and 6-hour 
levels are demonstrated in those experiments where the 
mRNA levels reached their peak before 24 hours of stim- 
ulation. If not otherwise indicated, the MMP-1 levels were 
measured using the ELISA system, which recognizes 
free/complexed MMP-1. 

MMP-1 and TIMP-1 mRNA were found during un- 
stimulated culture in all investigated cell types, although 
the mRNA levels varied over a great range. 8505 C showed 
a basal MMP-1 mRNA level 20 times as high as those of 
the HTh 74 cells, 6 x \0 A times as high as C 643, and 2 X 
10 6 times as high as SW J 736 cells. In thyrocytes, MMP- 
I mRNA levels were found near the detection limit (Figs. 
3 and 4). 

Generally, when analyzing the noted cell types, the mea- 
sured basal MMP-1 or TIMP-1 mRNA levels correlated 
well with the basal protein expression (MMP-1: r = 0.99, 
p < .0001;T1MP-1: r = 0.98, p < .002). Corresponding to 
the high MMP-1 mRNA level, 8505 C cells secreted ex- . 
ircmely high levels of MMP-1. No MMP-1 or TIMP-1 was 



detected in unstimulated thyrocyte cultures at any time- 
point examined. All other cell types showed a spontaneous 
MMP-1 and TIMP-1 secretion (Figs. 5 and 6). Thyroid- 
derived fibroblasts produced basal TIMP-1 levels of up to 
4 times higher in the four carcinoma cell lines, which se- 
creted nearly the same amounts of basal TIMP-1 protein. 
Nevertheless, TIMP-1 secretion of fibroblasts was found 
at lower levels than expected after TTMP-1 mRNA mea- 
surement in 4 of 5 analyzed patients. The results of the fi- 
broblast cultures from patient five showing a higher TTMP- 
1 expression than those from the 4 other patients (basal 
24 hour: 50 ± 2; PMA 24 hour: 90 ± 6 ng/mL TTMP-1) 
was omitted in Figure 6. 

Comparing the basal amount of free/complexed and 
TIMP-1 completed MMP-1 after 24 hours of stimulation, 
a significant level of MMP-1 was not cornplexed with 
TIMP-1 in 8505 C cultures, whereas in fibroblast cultures 
most of the MMP-1 activity was inhibited by TTMP-1. The 
anaplastic carcinoma cell line HTh 74 did not show such 
a great discrepancy between free/complexed and TTMP-1 
cornplexed MMP-1 level as 8505 C cells (Fig. 7). 

Effects of IL-la on MMP-1 and TIMP-1 mRNA and 
protein expression 

Experiments were performed to determine whether hu- 
man thyroid epithelial cells and thyroid carcinoma cell lines 
could produce or increase basal MMP-1 and T1MPT se- 
cretion after exposure to various stimuli. The results from 
these stimulation experiments are summarized in Figures 
3 through 5. Generally, there was a delay in protein se- 
cretion level in comparison to the mRNA expression level. 
At the protein level, the cytokine-mediated stimulating or 
inhibiting effect is more distinct after 48 hours compared 
with 24 hours, even when the mRNA level had already de- 
creased after 6 hours. 

1L-1 upregulated MMP-1 mRNA in SW 1736-cells up 
to 100 times and, in thyroid-derived fibroblasts, up to 12 
times after 24 hours of incubation (Fig. 3). This increased 
mRNA level was accompanied by a significantly enhanced 
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FIG. 2. Amplification of thyroid specific and imerJeukin-receptor mRNA in thyrocytes (1), SW 1736 (2), C 643 (3), HTh 
74 (4) and 8505 C (5) cells using RT-PCR; M r = 100-bp ladder (GIBCO). 



MMP-1 secretion after 48 hours. Furthermore, IL-ler in- 
creased MMP-1 mRNA expression in thyrocytes up to 
seven times after 6 hours, but no MMP-1 protein could be 
detected in thyrocyte cultures. IL-1 had no stimulatory ef- 
fect on MMP-1 mRNA expression in C 643, HTh 74, and 
8505 C cells after 24 hours, although a significant increase 
of MMP-1 secretion was found in HTh-74 and SW 1736 
ceils after 48 hours of incubation (Fig. 5). This discrepancy 
may be explained by a possible increase in MMP-1 mRNA 
level after 24 hours of stimulation. The same effect could 
also be observed in the IL-1 stimulated T1MP-1 at the 
mRNA as well as the protein level: the only slight effect 
of IL-1 on TIMP-1 mRNA expression in carcinoma cell 
lines after 24 hours was accompanied by a significant in- 
crease of TIMP-1 secretion in 8505 C and HTh 74 cells 
after 48 hours (Figs. 4 and 6). 

Effects ofTNF-a on both MMP-1/T1MP-1 mRNA 
and protein expression 

In contrast to IL-1, "TNF-or did not stimulate the MMP- 
] and TIMP-1 mRNA and protein levels in all carcinoma 
cell lines and thyrocytes. Only thyroid-derived fibroblasts 
responded with a slight upregulation of MMP-1 and TIMP- 
1 mRNA expression after TNF-a stimulation, which was 
not accompanied by an increase of MMP-1 and TIMP-1 
secretion. 



Effects of PMA, and IFN-y on MMP-1 and TIMP-1 
mRNA and protein expression 

PMA was included in our study as a positive control 
because it is known to upregulateor induce both MMP- 
1 and TIMP-1 secretion in various cell types (1,32). In- 
deed, PMA was able to induce or enhance MMP-1 
mRNA levels in all cell types investigated, although the 
detected levels varied to a large extent (Fig. 3). This re- . 
suit is in good correlation with the significantly increased 
MMP-1 protein levels that were already detectable after 
24 hours of stimulation {Fig. 5). PMA upregulated TIMP- 

1 mRNA levels by up to 20 times in C 643, and up to 

2 times in SW 1736 and HTh 74 cells, fibroblasts and 
thyrocytes, but it did not change the TIMP-1 mRNA con- 
tent in 8505 cells (Fig. 4). At the protein level, we found 
a significant stimulation of TIMP-1 secretion in C 643 
and HTh 74 cells, as well as in thyroid-derived fibrob- 
lasts (Fig. 6). 

ln contrast to PMA, lFN--y was without effect on stim- 
ulation or downregulation of MMP-1 and TIMP-1 mRNA 
or protein in anv of the cell types investigated (Figs 5 and 
6). 

The main inhibitor of MMP-1 is TIMP-1, which forms 
1:1 stoichiometric complexes with MMP-1, although some 
other inhibitors can also bind MMP-1. On the other hand, 
TIMP-1 can bind other MMP types. 
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FIG. 3. Representative samples of competitive amplified MMP-1 mRNA of thyrocytes, thyroid-derived fibroblasts and thy- 
roid carcinoma cell lines without stimulation (control) and after stimulation with 10 U/mL IL-lot and 100 U/mL TNF-a and 
10 ng/mL PMA after 24 hours. Serial dilutions of known amounts of the competitor fragment were coamplified with identi- 
cal aliquots of cDNA.The 560-bp (cDNA) and 478-bp (competitor) PCR products were visualized by agarose gel electrophoresis 
and ethidiumbromide staining. The relative concentration of the added competitor was given in arbitrary units (AU) in the 
figure. One AU was defined as the lowest. concentration of the competitor yielding a detectable amplification for MMP-1 
mRNA. The ratio of competitor to cDNA fragments was determined by measuring the intensity of ethidium fluorescence with 
a CCD image sensor and analysis of data. Measured cDNA concentration can expressed in AU. 




DISCUSSION 

Our findings demonstrate for the first time thai thyroid 
carcinoma cell lines are able to express MMP-1 and ThMP- 
1 mRNA and protein at significant levels in vitro. The ob- 
servation of spontaneous release of MMP-1 and T1MP-1 
corresponds well with earlier studies covering the secretion 
of these proteins by several carcinoma cell lines (33,34). 

However, in contrast to its clear physiological function 
in extracellular matrix breakdown, the role of MMP-1 in 
tumor growth and metastases is still controversial 
(9-11,35). Recently, Murray et al. (JO) demonstrated that 
MMP-1 is associated with poor prognosis in colorectal can- 
cer, and has a prognostic value independent of the Dukes 
stage. Therefore, MMP-1 could be a target for therapeu- 
tic intervention in such rumors. Furthermore, the hypoth- 
esis of whether or not cancer cells Themselves are able to 



produce MMP, or whether cancer celts stimulate the sur- 
rounding stromal cells to secrete MMP/n vivo, is disputed. 
MMP-1 mRNA and protein were detected by both in situ 
hybridization and immunohistochemistry in stromal as 
well as tumor cells of head, neck, gastric, colorectal, and 
mammary carcinomas (9,10,36,37). In contrast, 
Kameyama (12) demonstrated by in situ hybridization that 
the MMP-1 mRNA was not expressed in the cancer cells 
bur in the surrounding fibrous capsules of strongly differ- 
entiated papillary thyroid carcinoma tissue. Highly differ- 
entiated follicular carcinomas and follicular adenomas 
were depleted for MMP- 1 transcripts. Undifferentiated fol- 
licular^ papillary, and aggressive anaplastic carcinomas 
that showed poor prognosis and strong tumor invasive and 
metastatic potential and that can be compared in their 
morphological, genetic and growth features with undiffer- 
entiated rhyroid carcinoma cell lines were nor included rhis 
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FIG 4. Compcritive TIMP-1 mRNA RT-PCR yielding a 351-bp (cDNA) and a 274-bp (competitor) PCR product. For fur- 
ther details see Figure 3. \ ' . - 



study. However, the missing expression of MMP-1 by nor- 
mal thyrocytes and the spontaneous secretion of this pro- 
tein by highly malignant thyroid carcinoma cell lines, as 
demonstrated in our study, indicate the involvement of 
MMP-1 secretion of transformed thyrocytes in aggressive 
thyroid tumors. 

Although all cell lines analyzed in our study sponta- 
neously secreted MMP-1, we observed marked differences 
in the basal secretion capacity. The highest MMP-1 levels 
were determined in cultures of 8505 C cells. Only 8505 C 
cells expressed TPO and Tg mRNA that may be put down 
to residual differentiated components in the cell line (see 
Materials). However, none of the analyzed cell lines ex- 
pressed TSH-R mRNA. The cell population doubling times 
were less than 40 hours. All cell lines had accumulations 
of multiple genetic events. These facts indicate the undif- 
ferentiated pathology of the studied lines. It is well known 
that anaplastic carcinoma cell lines well retain the malig- 
nant characterises of their parental tumors (38^10). 

Furthermore, we found a distorted proportion between 
MMP-1 and TIMP-1 mRNA/proiein for carcinoma cell 
lines but not for normal thyroid-derived fibroblasts. The 
most disadvantageous constellation between MMP-1 and 
TUV1P-1 was found in 8505 C cells. Similar to other stud- 
ies (41), these results suggest the influence of an altered 
MMP/TIMP relation on tumor progression. However, it 



should be mentioned that most studies, including the pre- 
sent one, do nor take into consideration that a number of 
inhibitors distinct from TIMP-1 may regulate MMP-1 ac- 
tivity. Taking into account that the balance of active en- 
zyme and TIMP-1 concentration strongly influence the ex- 
tent of local matrix degradation, a number of studies 
showed unexpectedly high levels of TIMP-1 in malignant 
neoplasms (9,42,43). There is a great discussion as to 
whether the overall expression of MMP-1 and TIMP-1 or 
the amount of noncomplexed MMP-1 could be critical in 
aggressive tumor development. This fact underlines the na- 
ture of tissue breakdown, reflecting the complicated net- 
work of selective and coordinated production of individ- 
ual proteinases . and inhibitors under normal and 
pathophysiological conditions. Thus, the invasive and 
metastatic potential of thyroid tumors depends on the lo- 
cal net level of active MMPs. 

The synthesis of MMP-1 and TIMP-1 is influenced by a 
variety of biochemical stimuli. The recent findings on 
MMP-1 and TIMP-1 gene promoters are useful in under- 
standing the complex mechanisms implied in the regula- 
tion of MMP synthesis modulated by cytokines an,d tumor 
promoters (34/14,45). The promoter regions contain tu- 
mor promoter responsive elements (TRE) and binding mo- 
tifs for the transcription factor PEA-3, which are recog- 
nized by proto-oncogenic transcription factors T such as the 
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FIG. 7. Comparison between (i) free/complex and (ii) T1MP- 
1 completed MMP-1 levels, and (in) TIMP-1 levels in super- 
natants of unstimulated 8505 C and HTh 74 cells, and . thy- 
roid-derived fibroblasts after 24 hours using a (i) MMP-1 
ELISA that recognizes total MMP-1 (see Figure. 5). The (ii) 
MMP-1/T1MP-1 assay recognizes MMP- 1/I1MP- 1 complexes, 
ie, activated MMP-1 that has subsequently been complexed 
with the specific MMP inhibitor TIMP-1. The (hi) TIMP-1 
ELISA recognizes total TIMP-1, ie, free TIMP-1 and that com- 
plexed with MMPs. 



fos and jun family (45-47). IL-3, TNF-a, and PMA De- 
regulate proto-oncogenes like fos and fun, resulting in the 
stimulation of MMP-1 and TIMP-1 (45,48). The action of 
the cytokines is mediated by their specific receptors. In our 
study, 1L-1R (type 1 and type II), TNF-aR.{p75 and p55) 
and IFN-yR mRNAs were demonstrated in all investigated 
cell types. PMA and IL-1 were shown to elevate MMP-1 
and TIMP-1 in nearly all cell types investigated, thus con- 
firming the results of several studies on other epithelial cells 
(reviewed in refs. 1,7,49). In :?he majority of experiments, 
we found a concordant expression of MMP-1 and T1MP- 
1 after stimulation, possibly achieved by the coordinated 
actions of the nuclear transcription factors, although MMP- 
1 and TIMP-1 expression can also be independently or even 
reciprocally regulated (I). The effect of TNF-a was not as 
distinct as in the case of PMA and IL-1, although several 
investigators found a pronounced effect of TNF-a particu- 
larly on TUV1P-] secretion (4,34). In contrast to studies per- 
formed with other cell types (863,864,819), IFN-ydid not 
influence MMP-1 and TIMP-1 expression in thyroid carci- 
noma cell lines. In summary, ihe involvement of the in- 
trathyroidal physiological and pathological cytokine mi- 
croenvironment in the regulation of MMP-1 and TiMP-l 
induction activation and inhibition is strongly suggested. 

Furthermore, the data demonstrate that regular human 
thyrocyres did not produce MMP-1, even after powerful 
stimulation with PMA. Investigating other mammalian ep- 
ithelial cells, only one study revealed the production of 



MMP-1 by rabbit corneal cells (21). It is yet not clear 
whether the MMP-1 mRNA detected in thyrocytes is due 
to a low level of constitutive transcription of the MMP-1 
gene (illegitimate transcription), an existing pool of stable 
MMP-1 mRNA, or in vitro induction of MMP-1 mRNA. 
But it seems more likely that residual fibroblasts contained 
in the purified thyrocyte preparation (<0.2%) are respon- 
sible for the slightly positive RT-PCR results. Another ex- 
planation could be that thyrocytes are indeed MMP-1 pro- 
ducers, but the ELISA detection system used was not 
sensitive enough to measure extremely low MMP-1 secre- 
tion levels. Furthermore, the discrepancy between elevated 
TIMP-1 mRNA levels pf thyrocytes and the extremely low 
TIMP-1 protein secretion by these cells is difficult to ex- 
plain. Post-transcriptional regulatory events may be re- 
sponsible for this confounding result. 

Taken together, the present study suggests that the in- 
trathyroidal cytokine microenvironment is involved in the 
regulation of MMP-1 and its inhibitor HMP-1 in the thy- 
roid, and that both proteins may be secreted by dediffer- 
entiated thyroid carcinoma cells and involved in aggressive 
thyroid tumors in vivo. * 
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Expression of embryonic fibrouecthi isoform EHIA parallels alpha-smooth 
muscle actin in maturing and diseased kidney. 

Barnes VL , Miisa X Mitchell RJ , Barnes JL 

Department of Medicine, Division of Nephrology, University of Texas Health Science 
Center, San Antonio, Texas, USA. 

In this study we examined if an association exists between expression of an alternatively 
spliced "embryonic" fibronectin isoform EIIIA (Fn-EIIIA) and alpha-smooth muscle 
actin (alpha-SMA) in the maturing and adult rat kidney and in two unrelated models of 
glomerular disease, passive accelerated anti -glomerular basement membrane (GBM) 
nephritis and Habu venom (HV)-induced proliferative glomerulonephritis, using, 
immunohistochemistry and in situ hybridization. Fn-EIIIA and alpha-SMA proteins were 
abundantly expressed in rhesangium and in periglomerular and peritubular interstitium of 
20-day embryonic and 7-day (D-7) postnatal kidneys in regions of tubule and glomerular 
development- Staining was markedly reduced in these structures in maturing juvenile (D- 
14) kidney and was largely lost in adult kidney. Expression of Fn-EIIIA and alpha-SMA 
was reinitiated in the mesangium and the periglomerular and peritubular interstitium in 
both models and was also observed in glomerular crescents in anti-GBM nephritis. 
Increased expression of Fn-EIIIA mRNA by in situ hybridization corresponded to the 
localization of protein staining. Dual labeling experiments verified co-localization of Fn- 
EIIIA and alpha-SMA, showing a strong correlation of staining between location and 
staining intensity during kidney development, maturation, and disease. Expression of 
EIIIA mRNA corresponded to protein expression in developing and diseased kidneys and 
was lost in adult kidney. These studies show a recapitulation of the co-expression of Fn- 
EIIIA and alpha-SMA in anti-GBM disease and suggest a functional link for these two 
proteins. 
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BMI-1 gene amplification and ovcrexpression in hematological 
malignancies occur mainly in mantle cell lymphomas. 



Bca'S, Tort F, Pin vol M . Puig X , Hernandez L , Hernandez S . Fernandez PL. van 
Lohuizen 1YL Colomer D . Canmo E . 

The Hematopathology Section, Laboratory of Anatomic Pathology, Hospital Clinic, 
Institut ^Investigations Biomediques August Pi i Sunyer.(IDIBAPS), University of 
Barcelona, Spain. 

The BMI-1 gene is a putative oncogene belonging to the Polycomb group family that 
cooperates with c-myc in the generation of mouse lymphomas and seems to participate in 
cell cycle regulation and senescence by acting as a transcriptional repressor of the 
INfC4a/ARF locus. The BMI-1 gene has been located on chromosome I Op 13, a region 
involved in chromosomal translocations in infant leukemias, and amplified in occasional 
non-Hod gkin's lymphomas (NHLs) and solid rumors. To determine the possible 
alterations of this gene in human malignancies, we have examined 160 
lymphoproliferative disorders, 13 myeloid leukemias, and 89 carcinomas by Southern 
blot analysis and detected BMM gene amplification (3- to 7-fold) in 4 of 36 (1 1%) 
mantle cell lymphomas (MCLs) with no alterations in the INK4a/ARF locus. BMI-1 and 
pI6INKL4a mRNA and protein expression were also studied by real-time quantitative 
reverse transcription-PCR and Western blot, respectively, in a subset of NHLs. BMI-1 
expression was significantly higher in chronic lymphocytic leukemia and MCL than in 
follicular lymphoma and large B cell lymphoma. The four tumors with gene 
amplification showed significantly higher mRNA levels than other MCLs and NHLs with 
the BMI-l gene in germline configuration. Five additional MCLs also showed very high 
mRNA levels without gene amplification. A good correlation between BMI-1 mRNA 
levels and protein expression was observed in all types of lymphomas. No relationship 
was detected between BMI-1 and pl61NK4a mRNA leveis. These findings suggest that 
BMI-l gene alterations in human neoplasms are uncommon, but they may contribute to 
the pathogenesis in a subset of malignant lymphomas, particularly of mantle cell type. . 
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BMI-1 Gene Amplification and Overexpression in Hematological Malignancies 
Occur Mainly in Mantle Cell Lymphomas 1 
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Abstract 

The BMJ-J gene is a putative oncogene belonging to the Polyenmb 
group family that cooperates with c-myc in the generation of mouse 
lymphomas and seems to participate in cell cycle regulation and senes- 
cence by acting as a transcriptional repressor of the INK4a/ARF locus.. 
The BMl-l gene has been located on chromosome 1 Op 13, a region involved 
in chromosomal translocations in infant leukemias, and amplified in. 
occasional non-Bodgkin's lymphomas (NHLs) and solid tumors. To de- 
termine the possible alterations of this gene in human malignancies, we 
have examined 160 lympboproliferative disorders, 13 myeloid leukemias, 
and 89 carcinomas by Southern blot analysis and detected BMJ-J gene 
amplification (3- to 7-fnld) in 4 of 36 (1J%) mantle cell lymphomas 
(MCLs) with no- alterations in the JNKda/ARF locus. BMI-1 and pJ6 1NK "*' 
mRNA and protein expression were also studied by real-time quantitative 
reverse transcription-PCR and Western blot, respectively, in a subset of 
NHLs. BMJ-I expression was significantly higher in chronic lymphocytic 
leukemia and MCL than in follicular lymphoma and large B cell lym- 
phoma. The four tumors with gene amplification showed significantly 
. higher mRNA levels than other MCLs and NHLs with the BMJ-J gene in 
germ line configuration. Fire additional MCLs also showed very high 
mRNA levels without gene amplification. A good correlation between 
BMI-1 mRNA levels and protein expression was observed in all types of 
lymphomas. No relationship was detected between BMI-1 and p)6 ,NK4 " 
mRNA levels. These findings suggest thai BMJ-J gene alterations in 
human neoplasms are uncommon, but they may contribute to the patho- 
genesis in a subset of malignant lymphomas, particularly of mantle cell 

Introduction 

The BMJ-P gene is a putative oncogene of the Poly comb group 
originally identified by retroviral inserttonal mutagenesis in E/x-c- 
myc transgenic mice infected with the Moloney murine leukemia 
vims (1, 2). These animals had a rapid development of pre- B cell 
lymphomas showing frequent proviral insertions near the BMI-I gene. 
This integration resulted in BMI-1 overexpression suggesting a coop- 
erative effect between C-MYC and BMJ-J genes in the development of 
these rumors (3, 4). Recent studies have indicated that the BMI-I gene 
may also participate in cell cycle control and senescence through the 
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JNK4a/ARF locus by acting as an upstream negative regulator of 
pl6 ,NK4a and P 14/pl9 ARF gene expression (5). The human BMJ-J 
gene has been mapped to chromosome 1 Op 13 (6), a region involved in 
chromosomal translocations in infant leukemias (7) and rearrange- 
ments in malignant T cell lymphomas (8, 9). More recently, high-level 
DNA amplifications of this region have been found by comparative 
genomic hybridization in NHLs and solid tumors (10, I I). However, 
the possible implication of the BMI-1 gene in these alterations and its 
role in the pathogenesis of human tumors is not known. The aim of 
this study was to analyze- the possible BMI-I gene alterations, and 
expression in a large series of human neoplasms and to determine the 
relationship with JNKAaJARF locus aberrations. 

Materials and Methods 

Case Selection. A series of 2.62 human rumors, including 173 hematolog- 
ical malignancies and 89 carcinomas (Table 1), matched normal tissues from 
all carcinomas, 1 1 samples of normal peripheral mononuclear cells, and 5 
reactive lymph nodes and tonsils, were selected based on the availability of 
frozen samplesTor molecular analysis. 

DNA Extraction and Southern Blot Analysis. Genomic DNA was ob- 
tained using Proteinase K/RNase treatment. 15 fxg were digested with £coRJ 
and A/mdlll restriction enzymes (Life Technologies, Inc., Gaithersburg, MD), 
for Southern blot analysis and hybridized with a L5-kb Pst\ fragment of the 
partial BMJ-1 cDNA (6). 

RNA Extraction and Real-time Quantitative RT-PCR. Total RNA was 
obtained from 67 lymphoid neoplasms (10 CLLs„ 27 MCLs. 8 FLs, and 22 
LCLs) using guanidine/isoihiocyanate extraction and cesium/chloride gradient 
centrifugaiion. One u.g of total RNA was transcribed into cDNA using 
MJVJLV-reverse transcriptase (Life Technologies, Inc.) and random hexamers, 
following manufacturer's directions. Sequences of the BMJ-J and the pJ6 
detection probes and primers were designed using the Primer Express program 
(Applied Biosystems, Foster City) as follows; BMI-I sense, 5'-CTGGTTGC- 
CCATTGACAGC-3'; .BMJ-J antisense, 5 : CAGAAAATGAATGCGAG- 
CCA-3'; P J6 sense, 5'-CAACGCACCGAATAGTTACCG-3'; pI6 antisense, 
5'-AACTTCGTCCTCCAGAGTCGC-3\ The probes BMI-J t 5'-CAGCTC- 
GCTTCA A G ATGGCCGC-3 ', and pI6, 5'-CGGAGGCCGATCCAGGTGG- 
GTA-3\ were labeled with 6-carboxy-fluorescein as the reporter dye. The 
TaqMan-GAPDH Control Rc3genls (Applied Biosystems) were used io am- 
plify and detect the GAPDH gene, as recommended by the manufacturer. The 
quantitative assay amplified 1 /xml of cDNA in two to four replicates using the 
primers and probes described above and the standard master mix (Applied 
Biosystems). All reactions were performed in an AB1 PRISM 7700 Sequence 
Deiector System (Applied Biosystems). GAPDH. BMJ-1, and pi 6 ,WK **" ex- 
pression was related to a standard curve derived from serial dilutions of Raji 
cDNA. The RUs of BMJ-1 and plC™ 4 ' expression were defined as the 
mRNA levels of these genes normalized to the GADP1I expression level in 
each case. 

Protein Analysis. Whole-cell protein extracts were obtained from addi- 
tional frozen tissue available in 31 cases (7 CLLs, 12 MCLs, 8 FLs, and *t 
LCLs), loaded onto a J0% SDS-polyacrylamide gel, and eJecrroblotted to a 
nitrocellulose membrane (Amersham). Blocked membranes were incubated 
sequentially with the monoclonal antibody BMJ-F6 (12), antimouse conju- 
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Tabic I Hematological malignancies and solid rumor samples analyzed for BMJ-I 
gene alter alions 



Tissue samples 


No> of CDSCS 


Hematological malignancies 




Hodgkin's disease 


2 


B cell lymphoprolifcrative disorders 




B- Acute lymphoblastic leukemia 


14 


CLL 


29 


Hairy cell leukemia 


. 4 


FL 


15 


MCL 


36 


LCL 


40 


T cell rymphoprolifcralive disorders 




T-Acutc lymphoblastic leukemia 


8 


Large granular cell leukemia 


4 


Peripheral T-cell lymphoma 


8 


Myeloproliferative disorders 




Acute myeloid leukemia 


7 


Chronic myeloid leukemia 




Solid tumors 




Colon carcinoma 


26 


Breast carcinoma 


29 


Laryngeal squamous cell carcinoma 


34' 


Total 


262 



gated to horseradish peroxidase (Amersbam), and detected by enhanced chemi- 
luminescence (Amersham) according to the manufacturer's recommendations. 

Statistical Analysis. Because of the non-normal distribution of the samples 
and the small. size of some subsets of rumors, the statistical evaluation was 
performed using nonparametric tests (SPSS, version 9.0). Comparison between 
mRNA expression levels in the different groups of NHLs was performed using 
the Kruskal-Wallis Test, with a P for significance set at 0.05. For differences 
between particular groups, the conservative Bonferroni procedure was per- 
formed, and the P was set at 0.005. The remaining statistical analyses were 
carried out using the Mann- Whitney nonparametric U test (significance, P 
<0.05). The comparison between BMM and p!6 ,NK4 ' quantitative mRNA 
levels was also performed using the Pearson's correlation coefficient. 

Results 

BMJ-J Gene Amplification. The BMJ-J gene was examined by 
Southern blot in a large series of human tumors and normal samples 
(Table 1). The cDNA probe used in the study detected three EcoKl 
fragments of 7.3, 3.8, and 2.6 kb and three Hindm fragments of 6.2, 
A, and 3.5 kb. BMI-J gene amplification (3- to 7- fold) was detected in 
4 of 36 (11%) MCLs (Fig. 1). The amplifications were con finned with 
both restriction enzymes. The amplified MCLs were two blastoid and 
two typical variants. No amplifications were observed in any of the 
solid tumors when compared with their respective matched non- 
neoplastic mucosa. No BMJ-J gene rearrangements were observed in 
any of the samples examined. 

BM1-1 mRNA Expression. To determine the BMM expression 
pattern in NHL we analyzed BMM mRNA levels by real-time quan- 
titative RT-PCR in 67 lymphomas ( 1 0 CLLs, 27 MCLs, 8 FLs, and 22 
LCLs), including the four tumors with gene amplification. A distinct 
BMM mRNA expression pattern was observed in the different types 
of lymphomas (Fig. 2; Kruskal-Wallis Test; P < 0.001). The BM1 
mRNA levels in CLLs (mean, 2.2 RU; SD, 1.3) and MCLs with no 
BMJ-J gene amplification (mean, 2.5 RU; SD, 2.3) were significantly 
higher than in FLs (mean, 0.9 RU; SD, 0.8) and LCLs (mean, 0.6 RU; 
SD, OA; Mann- Whitney nonparametric U test; P < 0.0 J). The -1 
MCLs with BMJ-J gene amplification showed significantly higher 
levels of expression than all other groups of tumors (mean, 5. 1 RU; 
SD, 1.6; P < 0.005). In addition, five typical MCLs with no structural 
alterations of the gene also showed very high levels of BMM mRNA 
expression ranging from 4 to 9.8 RU, similar lo cases with gene 
amplification (Fig. Z4). 

BMM Protein Expression. BMM protein expression was exam- 
ined by Western blot in 31 tumors (7 CLLs; 12 MCLs, including two 
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cases with BMJ-J gene amplification and 4 cases with mRNA over- 
expression andno structural alteration of the gene; 8 FLs, and 4 LCLs) 
in which additional frozen tissue was available. The monoclonal 
antibody against BMM delected three closely migrating proteins of 
M r 45.000-48,000 (2). The two more slowly migrating bands prob- 
ably represent phosphorylated isoforms of the protein (1'2). The two 
MCLs wjth gene amplification and three of four cases with mRNA 
overexpression without amplification of the gene showed very high 
levels of protein expression. The remaining MCLs and CLLs showed . 
intermediate levels of protein expression, whereas low- or no-expres- 
sion signals were detected in the LCLs and FLs included in the study 
(Fig. 3). These results indicate that BMI-J protein expression in NHL 
is concordant with the mRNA levels observed by real-time quantita- 
tive RT-PCR. 

Relationship between BMI-J and pl6 JNK4a Gene Alterations. 

The JNK4a/ARF locus has been recently identified as a downstream 
target of the transcriptional repressing activity of the BMJ-J gene, 
suggesting that this gene may contribute to . human neoplasias with 
wild type JNK4/ARF (5). Most of the lymphoprolifcrative disorders 
analyzed in the present study, including the four cases with BMJ-J 
gene amplification, had been previously examined for p53 gene mu- 
tations and JNK4a/ARF locus alterations, including gene deletions, 
mutations, hypermethylation, and expression (13, 14). The four MCLs 
with BMJ-J gene amplification and mRNA overexpression and the. 
five tumors, with BMM mRNA overexpression with no structural 
alterations of the gene showed a wild-type configuration of the 
JNK4a/ARF locus (13). However, one case w\lh BMJ-J gene ampli- 
fication and one case with mRNA overexpression with no alteration of 
the gene showed p53 gene mutations associated with allelic deletions. 

To determine the possible relationship between BMM and 
pl6 1NK4a mRNA expression, p]6 JNK " a mRNA levels were evaluated 
by real-time quantitative RT-PCR in 50 tumors (10 CLLs, 27 MCLs, 
and 13 LCLs), including 6 cases with alterations in the JNK4a/ARF 
locus (2 MCLs and 1 LCL with pJ6 iWK/i * gene deletion, 2 LCLs with 
pi 6 promoter hypermethylation. and 1 CLL with pJ6™ KAa gene 
mutation), and the 4 lymphomas wilh BMJ-J amplification. Negative 
or negligible levels of p)6 ,NK4:> were observed in the 6 tumors with 
lNK4a/ARF locus alterations. These cases were not included in the 
comparisons between BMM and pI6 INK4 " mRNA expression. The 
pI6 rNK4a e:( p jess j on JeveIs were relatively similar in the different 
types of tumors. Only LCLs tended to have lower levels of expression, 
but the differences did not reach statistical significance (Fig. 2B). No 
differences were observed in the pie 1 ™*" 3 mRNA levels between 
tumors with BMJ-J gene amplification and overexpression and lym- 
phomas with germline configuration of the gene. 
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Fig. I. Southern blot analysis of BMl-J genc. Foui MCLs (MCL*) showed BMJ-J gene 
amplification (3- io 7- fold) compaied with non-ncoptasiic tissues (N) and oibcr NHLs" No 
amplifications or gene rearrangements were detected in (he remaining NHLs and carci- 
nomas included in the sftidv. 
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CLL 



MCL 



LCL 



Fig. 2. A, quantitative BMM mRNA transcript analysis (median and range) uiing 
real-time RT-PCR in a series of NHLs. MCLs with BMI-J gene ai'nplificntion {MCL*) 
revealed significantly higher overall BM1-I mRNA levels than all other types of NHLs. 
including MCLs with no structural alterations of the gene {P < 0.005). MCLs and CLLs 
expressed significantly higher levels than FLs and LCLs [P < 0.001). Results are depicted 
as ihc ratio of absolute BMM:GADPH mRNA transcript numbers (RU). Bars, SD. 
quanritative pl6 ,NK '" mRNA transcript analysis (median and range) using real-time 
RT-PCR in a series of NHLs. Expression levels were relatively similar in the different 
types of tumors. Results arc depicted as the ratio of absolute pl6" gK "":GADPH mRNA 
transcript numbers (RU). Bars, SD. 



Discussion 

Jn ihe present study, we have examined a large series of human 
tumors for ihe presence of gene alterations and mRNA expression of 
the BMI-J gene. Gene amplification was identified in four MCLs. 
These tumors showed significantly higher levels of mRNA and pro- 
tein expression compared with other lymphomas with BMJ-J in germ- 
line configuration. BMM expression levels were also highly up- 
regulated in a subset of MCLs with no apparent structural alterations 
of the gene. No alterations were detected in any of the different types 
of carcinomas included in the study. BMI-J is considered an oncogene 
belonging to the Polycomb group family of genes. These proteins 
mainly act as transcriptional regulators, controlling specific target 
genes involved in development, cell differentiation, proliferation, and 
senescence. Different studies have shown the implication of BMM 
overexpression in the development of lymphomas in murine and 
feline animal models (3 ? 4). The findings of the present study indicate 
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for the first time that BMI-J gene alterations in human neoplasms are 
an uncommon phenomenon, but they seem to occur mainly in a subset 
of NHLs, particularly of mantle cell type. 

The human BMI-1 gene has been mapped to chromosome JOpB. 
High-level DNA amplifications and gains in this region have been 
identified by. comparative genomic hybridization in occasional solid 
rumors and NHLs (1G\ II). Different chromosomal translocations 
involving the 10pl3 region have also been identified in infant leuke- 
mias and T cell lymphoproliferative disorders (7, 8, 15). Most acute 
Ieukemias with this chromosomal alteration occur in children <J2 
months of age, whereas it seems to be extremely rare in adults. I Op 
translocations in T-cell lymphoproliferative disorders have been ob- 
served mainly in adult T cell Jeukemia/lymphomas and occasional 
cutaneous T cell lymphomas. In our study, we did not observe BMJ-J 
rearrangements or amplifications in any of the acute Ieukemias or T 
cell lymphomas. However, all of the acute Ieukemias in this study 
were diagnosed in patients over I 6 years, and no adult T cell leuke- 
mia/lymphomas or cutaneous lymphomas could be included in the 
series. Similarly, high-level DNA amplifications at the J0pl3 region 
have been detected in head and neck carcinomas and other solid 
tumors. Although we found no evidence for BMI-J gene rearrange- 
ments or amplifications in a substantial set of carcinomas, this does 
not exclude the possibility of increased gene expression or protein 
levels in these tumors. Additional studies are required to elucidate the 
possible involvement of BMI-J in these particular groups of human 
neoplasms. . . 

Jn human hematopoietic cells, BMI-1 is preferentially expressed in 
primitive CD34 + bone marrow cells, whereas it is negative or very 
low in more mature CD34 - cells ( 1 6). )n peripheral lymphocytes, and 
particularly in follicular, B cells, BMM protein expression has been 
detected in resting cells of the mantle zone, whereas it is down- 
regulated in proliferating germinal center cells (17, 18). These obser- 
vations indicate that BMI-1 expression in normal hematopoietic cells 
is tightly regulated in relation with cell differentiation in bone marrow 
and antigen-specific response in peripheral lymphocytes. BMM ex- 
pression in human tumors has not been examined previously. In this 
study, we have demonstrated that BMI-1 mRNA and protein expres- 
sion show a distinct pattern in different types of lymphomas. Thus, 
BMM levels were low in LCLs and FLs and significantly higher in 
MCLs and CLLs. These findings suggest that BMM expression 
patterns in B cell lymphomas maintain in part the expression profile 
of their normal cell counterparts; because FLs and at least a subgroup 
of LCLs are considered lymphomas derived from follicular germinal 
center cells, whereas MCLs and CLLs are tumors mainly derived from 
naive pregerminal center cells. However, the four MCLs with BMJ-I 
gene amplification expressed significantly higher mRNA levels than 
all other tumors. In addition, five MCLs with no structural alterations 1 
of the gene showed high mRNA levels similar to those observed in 
tumors with BMJ-I gene amplification, suggesting that other mecha- 
nisms may be involved in up-regulation of the gene in these lympho- 
mas. Different studies using animal models have shown a dose- 
dependent effect of BMI-J gene expression on skeleton development 
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Fig. 3. Western bfol analysis of BM1- j protein in NHLs. The amplified MCL (17624) 
showed i he highest it Ml- 1 protein levels, whereas other MCLs and CLLs had inter mediate 
levels of opicbsion. Very lenv or negative signal wa 5 observed in FLs and LCLs. 
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and lymphomagenesis (J . 3). These observations suggest that the high 
mRNA and protein levels detected in a subset of MCLs may play a 
roJe in the pathogenesis of these neoplasms. 

Recent studies have identified the JNK4/ARF locus as a down- 
stream target of the BM1-1 transcriptional repressor activity, suggest- 
ing that BMM overexpression may contribute to human neoplasias 
that retain the wild-type JNK4a/ARF Jocus (5). Interestingly, in our 
study, BMJ-J amplification and overexpression appeared in rumors 
with no alterations in pJ6 WK4 * and pJ4 ARr genes. However, we could 
not detect differences in the expression leveJs of pl6 1NK4i " in tumors 
with arid without BMJ-J gene alterations. The reasons for this apparent 
discrepancy with experimental observations are not clear. One possi- 
bility may be that genes other than INK4a/ARF are the main targets of 
BMM repressor activity in these rumors. Particularly, different genes 
of the HOX family are regulated by BMM and may also be involved 
in lymphomagenesis (19, 20). 

In conclusion, the findings of this study indicate that BMJ-J gene 
. expression is differentially regulated in B cell lymphomas. Alterations 
of the gene seem to be an uncommon phenomenon in human neo- 
plasms, but they may contribute to the pathogenesis in. a subset of 
MCLs, Although, BMJ-J gene alterations occurred in tumors with 
wild-type JNK4a/ARF locus, the possible cooperation between these 
genes and the oncogenic mechanisms of BMM in human neoplasms 
require additional analysis. 
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Histopathology is insufficient to predict disease progression and clinical outcome in lung adeno- 
carcinoma. Here we show that gene-expression profiles based on microarray analysis can be 
used to predict patient survival in early-stage lung adenocarcinomas. Genes most related to sur- 
vival were identified with univariate Cox analysis. Using either two equivalent but independent 
training and testing sets, or 'leave-one-out' cross-validation analysis with all tumors, a risk index 
based on the top 50 genes identified low-risk and high-risk stage I lung adenocarcinomas, which 
differed significantly with respect to survival. This risk index was then validated using an inde- 
pendent sample of lung adenocarcinomas that predicted high- and low-risk groups. This index 
included genes not previously associated with survival. The identification of a set of genes that 
predict survival in early-stage lung adenocarcinoma allows delineation of a high-risk group that 
may benefit from adjuvant therapy. 



Lung cancer remains the leading cause of cancer. death in indus- 
trialized countries. Most patients with non-small cell Jung can- 
cer (NSCLC) present with advanced disease, and despite recent 
advances in multi-modality therapy, the overall 10-year survival 
rate remains a dismal 8-10%V However, a significant minority of 
patients (-25-30%) with NSCLC have stage 1 disease and ieceive 
surgical intervention alone. Although 35-50% of patients with 
stage I disease will relapse within 5 years 2 "*, it is not currently 
possible to identify specific high-risk patients. 

Adenocarcinoma is currently the predominant histological 
subtype of NSCLC (refs. 1,5,6). Although morphological assess- 
ment of lung carcinomas can roughly stratify patients, there is a 
need to identify patients at high risk for recurrent or metastatic 
disease. Preoperative variables that affect survival of patients 
with NSCLC have been identified 7 " ,D . Tumor si2e, vascular inva- 
sion, poor differentiation, high tumor-proliferative index and 
several genetic alterations, including K-ras (refs. 13,12) and p53 
(refs. 10,13) mutations, have prognostic significance. Multiple 
independently assessed genes oi gene products have also been 
investigated to better predict patient prognosis in lung can- 
cer'*'*. Technologies that simultaneously analyze the expression 
of thousands of genes" can be used to correlate gene-expression 
patterns with numerous clinical parameters — including patient 
outcome—to better predict tumor behavior in individual pa- 
tients™. Analyses of Jung cancers using array technologies have 
identified subgroups of tumors that differ according to tumor 
type and histological .subclasses and, to a lesser extent, survival 
among adenocarcinoma paticnts :i?7 . Mere we correlated gene- 
expression profiles wiih clinical outcome in a cohort of patients 
with lung adenocarcinoma and identified specific genes that 



predict survival among patients with stage 1 disease. For further 
validation, we also show that the risk index predicted survival in 
an independent cohort of stage I lung adenocarcinomas. 

Hierarchical profile clustering yields three tumor subsets 
Using oligonucleotide arrays, we generated gene-expression pro- 
files for 86 primary lung adenocarcinomas, including 67 stage 1 
and 19 stage 111 tumors, as well as 10 non-neoplastic lung sam- 
ples. Selected sample replicates showed high correlation among 
coefficients and reliable reproducibility. We determined tran- 
script abundance using 3 custom algorithm and the data set was 
trimmed of genes expressed at extremely low levels, that is, 
genes were excluded if the measure of their 75th percentile value 
was less than 100. Although potentially resulting in the loss of 
some information, trimming in this manner decreased the possi- 
bility that the clustering algorithm would be strongly influenced 
by genes with little or no expression in these samples.' 
Hierarchical clustering with the resulting 4,966 genes yielded 3 
clusters of tumOTS (Fig. 1). All 10 non-nebplastic samples clus- 
tered tightly together within Cluster 1 (data not shown). We ex- 
amined the relationships between cluster and patient and tumor 
characteristics (Fig. 1 and Supplementary Figure A online). There 
were associations between cluster and stage (P = 0.030). and be- 
tween cluster and differentiation {P - 0.03). Cluster 1 contained 
the greatest percentage (42.8%) of well differentiated tumors, 
followed by Cluster 2 (27%) and Cluster 3 (4.7%). Cluster 3 con- 
tained the highest percentage of both poorly differentiated 
(47.6%) and stage II! tumors (42.8%), yet contained 3 (14.3%) 
moderately differentiated and 1 (5%) well differentiated stage 1 
tumor. Notably, 11 stage 1 tumors were present in Cluster 3, sug- 
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gesting a common gene-expression profile for 
this subset of stage I and stage HI rumors. 

For patients with stage I and stage III tumors, 
the average ages were 68 J and 64.5 years and 
the percentage of smokers was 88.9% and 
89.5%, respectively. Marginally significant as- 
sociations between cluster and smoking his- 
tory were observed {P = 0.06). A significant 
relationship between histopathological classifi- 
cation and cluster was only discemable for 
bionchioloalveolar adenocarcinomas (BAs), 
which were only present in Clusters 1 and 2 
(F= 0.0055) and comprised 35.7% and 12.3% 
of tumors for Clusters 3 and 2, respectively. 

We examined the heterogeneity in gene-ex- 
pression profiles based on the trimmed data set among normal 
lung samples and stage 1 and stage 111 adenocarcinomas by calcu- 
lating correlation coefficients between all pairs of samples. In 
contrast to normal lung samples that displayed highly similar 
gene-expression profiles (median correlation, 0.9), both stage 1 
and Ul lung tumors demonstrated much greater heterogeneity in 
their expression profiles with lower correlation coefficients (me- 
dian values, 0.82 and 0.79, respectively). 

Northern-blot and immunohistocberrmtry analyses 
Of the 4,966 genes examined, 967 differed significantly between 
stage I and 111 adenocarcinomas, a number in excess of that ex- 
pected by chance alone (248 at alpha level (a) = 0.05). Three 
genes were arbitrarily selected to verify the microarray expiession 
data. The mRNA from 20 of the normal lung and tumor samples 
was examined by northern-blot hybridization with probes for in- 
sulin-like growth factor-binding protein 3 {JGFBP3), cystatin C 
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Fig. 1 Unsupervised classification analysts of lung adenocarcinomas. 3 classes of tumors identi- 
fied by agglomerauve hierarchical clustering of gene-expression profiles using the 4;966 expressed 
genes. Patient and histopathological information for each lung adenocarcinoma case by cluster 
designation and methods for K-ros 12/13th-codon mutational status and nuclear p53 protein ac- 
cumulation are provided (Supplementary Figure A online). TN classification denotes information 
regarding patient tumor size and nodal involvement Associations between cluster membership 
and patient or histopathological variables are indicated at significance level (P< 0.05). 



arid lactate dehydrogenase A {LDH A) (Fig. 2<a). Two gene probes 
not represented on the microarrays were used as controls, includ- 
ing histone H4, a potential index of overall cell proliferation, and 
28S liposomal RNA, a control for sample loading and transfer. 
The relative amounts of 1GFBP3, cystatin C and LDH-A mRNA 
strongly correlated with microarray-based measurements (Fig. 
2b). In both assays, 1GFBPZ and LDH-A mRNA levels increased 
from stage I to stage III adenocarcinomas and were higher than 
those in normal lung. Cystatin C mRNA levels were more variable 
but relatively greater in normal Jung than tumors. These results 
suggest that the oligonucleotide microarrays provided reliable 
measures of gene expression. The rumors showed slightly greater 
histone H4 expression than the normal lung, likely reflecting in- 
creased proliferation of tumor cells. 

Immunohistochemistry was performed lor 1GFBP3, cystatin C 
and HSP-70 to determine whether mRNA oveiexpression was re : 
fleeted by an increase of their corresponding proteins in tumors. 
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fig. 2 Validation analyses of gene-eapres- 
sion piofiling. a. Northern- blot analysis of 
selected candidate genes tor verification of 
dots obtained hom oligonucleotide arrays. 
7he same sample RNA loi the 4 uninvofved 
lung, 8 stage I and 8 stage Ml tumors was 

used for the northern-blot and oligonucleotide array analyses. 
6, Correlation analysis of quantitative data obtained from oligonucleotide 
anays and nonhern blols measured by integrated phosphorimagcr-based 
signals loi the IGIBP3 and IDH-A genes. "The ratio of fOfBP3, cystatin C 
and IDH A mRNA to 28b rRNA was determined. The relative values loi 
each gene from each sample are shown, n, nonneoplastic normal tung; 
1, stage I tumors; 3, stage (If tumors. c r Immunohistochemical analysis ol 
IGfBP-3, 1 .5P- 70 and cyslalin C in lung and tuny adenocarcinomas. 
Cytoplasmic. IC! BP- 3 immunoreactivity in a neoplastic gland (tumor 12?) 
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with prominent apical staining (blue reaciant staining, arrow, upper left). 
Diffuse cytoplasmic HSP-70 immunoreactivity (tumor 127), yet stromal el- 
ements show no reactivity (upper right). Normal lung parenchyma (lower 
left) shows cytoplasmic cystatin C immunoreactivity in alveolar pneumo- 
cytes (arrow) and intra- alveolar macrophages but tumor (190) shows dif- 
fuse cytoplasmic cystatin C immunoreactivity with prominent apical 
staining (lower right). Magnification, x?00 
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Immunoreactivity for both JGFBP-3 and HSP-70 (Fig. 2c) was de- 
tected in the cytoplasm of the adenocarcinomas, with little de- 
tectable reactivity in the stromal or inflammatory cells. Cystatin 
C was detected in alveolar pneumocytes and intra-alveolar 
macrophages in non neoplastic lung parenchyma and also con- 
sistently in the cytoplasm of neoplastic cells. 

Gene-expression profiles predict survival 

As expected, Kaplan-Meier survival curves (Fig. 3a) and log-rank 
tests indicated poorer survival among stage 1)1 compated with 
stage I adenocarcinomas (P = <0.000l). Two statistical ap- 
pioaches were used to determine whether gene-expression pro- 
files could predict survival using the data set of 4,966 genes. In 
one approach, equal numbers of randomly assigned stage 1 anct 
Mage HI tumors constituted training {ri~ 43) and testing {n = 43) 
sets. In the training set, the top 10, 20, 50 or 75 genes were used 
to aeale risk indices that were evaluated lor their association 
with survival using the 50th, 60th or 70tb percentile cutoff 
points to categorize patients into high or low groups. The results 
were similar across cutoff points but the 50 gene risk index had 
the best overall association with survival in" the training set. 

818 



Fig. 3 Gene-expression profiles and patient survival, o, Relationship be- 
tween tumor stage and patient survival (stage 1 and stage 3 differ signifi- 
cantly, P< 0.0001). b, Relationship between the survival in the 43 test 
samples and their risk assignments based on the 50-gene risk index esti- 
mated in the 43 training samples. The high- and low-risk groups differ sig- 
nificantly (P = 0.024). €, Relationship between patient survival and the risk 
assignments in test samples (in b) conditional for tumor stage. The high- 
and low-risk stage I groups differ significantly (P= 0.028), whereas stage III 
low- and high-risk groups did not (P- 0.634). d, Relationship between sur- 
vival in the test cases and their risk assignments based on the 86 leave-one- 
out' cross-validation of the 50-gene risk index. The high- and low-risk 
groups differ significantly (/>= 0.0006). e, Relationship between test case's 
risk assignment and survival (in d) conditional on tumor stage. The high- 
and low-risk stage I lung adenocarcinoma groups differ significantly from 
each other (P = 0.003), whereas low- and high-risk stage 111 tumors do not. 
f, Relationship between tumor class identified by hierarchical clustering and 
patient survival. Survival for patients in Cluster 3 differed relative to the tu- 
mors in Cluster 2 (P = 0.037) and approached significance for Cluster 1 and 
2 combined (P = 0.06). g, Analysis of the Michigan-based risk index using 
top cross-validated survival genes identify a low- and high-risk group in an 
independent cohort of B4 Massachusetts- based lung adenocarcinomas that 
are significantly different (P = 0.003). h, Among the 62 stage I lung adeno- 
carcinomas in the Massachusetts sample, the high- and low-risk groups dif- 
fered significantly (P = 0.006). 



After conservatively choosing the 60th percentile cutoff point 
from the training set, we then applied this risk index and cutoff 
point to the testing set. The risk index of the top 50 genes cor- 
rectly identified low- and high-risk individuals within the inde- 
pendent testing set (P = 0.024) (Fig. 3b and Supplementary 
Methods online). Notably, 1] stage 1 tumors were included in 
the high-risk subgroup. When this irsk assignment was then 
conditionally examined for stage progression (Fig. 3c), low- and 
high-risk groups among stage 1 tumors were found to differ (P = 
0.028) in their survival. 

Identification of a robust set of survival genes 
Although predictive of patient survival, a single training-testing 
set may not provide the most robust set of genes due to random 
sampling issues. Therefore, a 'leave-one-out' cross-validation ap- 
proach was used to identify genes associated with survival from 
all 86-tumor samples. We first developed a 50-gene risk index in 
each training set, and then applied the risk index to the test case 
held out from the full set of tumors and assigned the held out 
tumor to the high- or low-risk groups (Fig. 3d). The high and 
low-risk subgroups determined in the test cases differed signifi- 
cantly in their overall survival (P = O.00O6). Among the larger 
group of stage 1 lung adenocarcinomas, the low-risk {n = 46) and 
high-risk {n = 21) groups had markedly different survival (P = 
0.003) (Fig. 3e). Table 1 lists selected examples of the cumulative 
top 100 genes derived from this cross-validation procedure 
(complete list in Supplementary Table A online). 

It was also noted thai many of the stage 1 patients in the high- 
risk subgroup (Fig. 3*0 were present in Cluster 3 (Fig. 1). 
Kaplan-Meier analysis (Fig. 3/) demonstrated a significantly 
worse survival [P = 0.037) lor patients in Cluster 3 relative to pa- 
tients in Cluster 2 and approaching significance for Cluster 1 
and 2 combined </>= 0.06). This further indicates the important 
relationship between gene-expression profiles and patient sur- 
vival, independent of disease stage. 

Consistent with previous analyses of Jung adenocarcinomas' 3 , 
40% of stage I and 57.8% of stage III tumors had 12th or 13th 
codon K ras gene mutations. Those patients with tumors con- 
taining K-ras mutations showed a trend of poorer survival, but 
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Table 1 Selected examples of the top 100 genes from cross-validation 



Cene name P % P 

(normal versus . Change in tumor (stage I versus 
tumor Mest) stage 1)1 Mest) 



% Change in Coefficient 
stage III f) 



Unigene comment 



CL 
D 
O 



CASP4 
P63 



KR77 
LAMB1 

BMP 2 
CDC6 

51 OOP 
SERPINE1 

STX1A 

ADM 

AKAP 12 

ARHE 

CRB7 

VECF 

WNT10B 



~ HSPA8 

3 
£L 

Q» ERBB2 
B 

JJ FXYD3 
£ SLC20A1 



CSTB 
CTSL 
CYP24 

FUT3 

IV1LN64 
PDE7A 
PtCt 
51C1A6 



COPEB 
CRK 



RE LA 



K1AA000S 
MCB1 



0-56 
9.73E-04 



8.02E-08 
0.14 

0.54 
1.3U-05 

2.10E-08 
2.89E-03 

8.65E-08 

0.05 
8:53E-03 

0.06 
2.02E-03 
6.50E-08 

0.05 



0.36 

0.04 
0.10 
1.34E-03 



1.57E-04 

0.48 
3.16E-06 

1.O7E-07 

0.20 
0.12 
0.04 
0.07 



0.10 
0.10 



2.2H-04 
0.27 



-6% 
37% 



126% 
-20% 

-21% 
1070% 

1572% 
72% 

54% 

39% 
-47% 
-39% 
38% 
174% 
31%' 



92% 
111% 
58% 



-10% 
N/A 

114% 

32% 
33% 
-68% 
-32% 



-33% 
32% 



-7% 



40% 
125% 



0.02 
0.03 



0.11 
0.01 

0.27 
0.05 

0.19 
0.25 

0.07 

0.04 
0.05 
0.05 
0.63 
0.02 
0.48 



9.01 E-04 
0.37 
0.31 
0.02 



0.15 
0.03 
0.97 

0.97 

0.42 
0.01 
0.35 . 
0.12 



0.26 
0.03 



0.02 
0.33 



57% 
43% 



55% 
60% 

47% 
148% 

77% 
30% 

26% 

117% 
214% 
87% 
15% 

85% ' 
20% 



51% 
120% 
73% 
66% 



34% 
67% 
2% . 

-1% 

80% 
-35% 
-1 70% 

86% 



25% 
48% 



20% 



45% 
459% 



0.0022 
0.0010 



0.0003 
0.0027 

0.0044 
0.0124 

0.0001 
0.0008 

0.0031 

0.0016 
0.0010 
0.0092 
0.0030 
0.0013 
0.0022 



0.0008 
0.0013 
0.0046 
0.0021 



0.0001 
0.0007 
0.0008 

0.0033 

0.0007 
-0.0187 
-0.001 1 
0.0069 



0.0016 
0.0098 



0.0010 
0.0018 



Apoptosis-r elated 
Caspase 4, apoptosis- 

related cysteine protease 
Transmembrane protein (63 kD), 

endoplasmic reticulum/ 

Colgi intermediate compartment 

Cell adhesion and structure 
Keratin 7 
Laminin, pi 

Cell cycle and growth regulators 
Bone morphogenetic protein 2 
CDC6 (cell division cycle 6, 

Socchqwmyces cerevhioe homolog) 
S1 00 calcium- binding protein P 
Serine (or cysteine) proteinase inhibitor, 

clade E (nexin) 
Syntaxin 1A (brain) 

Cell signaling 
adrenomedullin 

A kinase (PRKA) anchor protein (gravin) 1 2 
ras homolog gene family, member E 
Growth factor receptor-bound protein 7 
Vascular endothelial growth factor 
Wingless- type MMTV integration site family, 
member 1 0B 

C ha per ones 

Heat-shock 70 kD protein 8 
Receptors 

v-erb-b2 avian erythroblastic leukemia viral 

oncogene homolog 2 

FXVD domain-containing ion transport 

regulator 3 
Solute carrier family 20 (phosphate 

transporter), member 1 

Enzymes, cellular metabolism 
Cystatin B (stefin B) 
Cathepsin L 

Cytochrome P450, subfamily XXIV 

(vitamin D 24-hydroxylase) 
Fucosyltransferase 3 (galacloside 3(4)-L- 

fucosyltransf erase, Lewis blood group included) 
Steroidogenic acute regulatory protein related 
Phosphodiesterase 7A 
Plasminogen-ltke 

Solute carrier family 1 (high- affinity aspartate/ 
glutamate transporter), member 6 

Transcription and translation 

Core promoter element binding protein 

v-crk avian sarcoma virus 0 10 oncogene 

homolog 
v-rel avian reticuloendotheJiosis viral 

oncogene homolog A 
Unknown function 
KIAA0005 gene product 
Mammaglobin 1 



Bolded genei were also jigniliranl tot su/vivs) in 43 tumor training set (fig. 3b). 



Table 1 Selected e>amples ot the cumulative top 100 genes identified using 
training- testing, cioss- validation of all 86 lung tumor samples. The percent 
change, as well as the ditet tion, lor the average values of the 1 0 non- neoplastic 
lung to all tumors, and lor the 67 stage I to the 1 9 stage III tumors are shown. A 
positive coefficient fi value is indicative of a relationship of gene expression to a 



poorer pauent outcome. The genes a»e listed in potential functional categories. 
Genes that were also present in the top 50 survival genes using the 43- tumor 
training set (Fig. 3b) are indicated in bold type. Complete listing of the gene 
probe sets and annotated gene and unigene identifiers can be found in the 
Supplementary Methods. 
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Fig. 4 Gene expression pat t ems of top survival genes o, Gene- expr ession patterns de- 
termined using agglomerative hierarchical clustering of the 86 lung adenocarcinomas 
against the 100 survival- related genes (Table 1) identified by the training- testing, cross- 
validation analysis. Substantially elevated (red) or decreased (green) e>piession of the 
genes is observed in individual tumors. Some tumors (black arrow and expanded area) 
show extremely elevated expression of specific genes, b. An outlier gene-expression pat- 
tern {>b times the interquartile range among all samples) is observed tor the f/fcB2 and 
RegIA genes (top left and right, respectively). The $ 7 OOP and crk genes (bottom left and 
right, respectively) show a graded pattern of expression related to patient survival, O, 
alive; •, dead {also in c). t, "The number of outlieis per person idrmitird in the lop 100 
genes plotted by survival distribution. 
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this difference did not reach statistical significance among all 
patients (F = 0.25), between patients within tumor clusters (? = 
0.41) or when analyzed separately among stage I {P = 0.22) and 
stage III (F= 0.53) patients. Nuclear accumulation of p53 was de- 
tected in 17.9% stage 1 and in 22.2% stage 111 tumors. No signifi- 
. cant relationship was observed for p53 staining and patient 
survival, cluster or tumor stage. 

Confirmation using an independent set of adenocarcinomas 
The robustness of our 50-gene risk index in predicting survival in 
lung adenocarcinomas was tested using oligonucleotide gene-ex- 
pression data obtained from a completely independent 
(Massachusetts-based) sample of 84 lung adenocarcinomas (62 
stage 1, 14 stage 11 and 8 stage 111; ref. 21, and dataset A at 
www.genome.wi.mit.edu/MPR/lung). To ensure equivalent 
power, for testing and comparability of samples, the criteria for 
including tumors in the analysis were 40% or greater tumor cellu- 
larity, no mixed histology (that is, adenosquamous) and patient 
survival information. To obtain comparative gene- expression 
measures between the two data sets, gene sequences present on 
the U95A and HuGeneFL array were examined, and expression 
data for our top 50 cross-validation genes for all 84 Massachusetts 
samples were obtained and processed 24 (see also Supplementary 
Methods online ). When we examined the risk assignment of 
these 84 samples, employing the identical cutoff point used for 
the 86 Michigan-based lung samples, we observed low- and high- 
risk groups (Fig. 3*; P = 0.003). Notably', among the 62 stage I tu- 
mors, high- and low-risk groups were observed that differed 
significantly (P = 0.006) in their survival (Fig. 3h). 

Survival genes had graded and outlier expression patterns 

A statistical and graphical analysis of the ] 00 survival-related 



genes (Table 1) clustered against all 86 tumors revealed individ- 
ual rumors with substantially elevated expression in both a lim- 
ited and larger number of genes (Fig. 4a). Among these genes, we 
observed two distinct patterns of expression related to patient 
survival. One pattern, designated 'outlier', included genes show- 
ing substantially elevated expression (greater than five times the 
interquartile range among all samples), whereas the other pat- 
tern, designated 'graded', was characterized by continuously dis- 
tributed expression with patient survival (Fig. 4b). The erbB2 and 
Reg} A genes are examples of outlier expression patterns and 
SWOP and crk genes of graded patterns. The number of outliers 
per person in the top 100 genes was identified and plotted ac- 
cording to survival times and events (Fig. 4c). Both stage I and 
stage HI lung adenocarcinomas showed outlier gene patterns 
and 10 tumors contained 3 or more outlier genes. 

Because gene amplification may result in increased gene ex- 
pression, the nine genes with outlier expression patterns (erbB2, 
5LCJA6, Wnt 1, MGB1, Reg} A, AKAPM, PACE, CYP24, KYNU) 
and one gene with a graded expression pattern (0X18) were ex- 
amined using quantitative genomic PCR to evaluate genomic 
copy number (Fig. 5a). Gene amplification of erbB2 (17ql2) was 
detected in tumor L94, which had the highest erbB2 mRNA ex- 
pression (Fig. 4a). Gene amplification was not detected for any 
of the other seven tested genes in tumor L94, as well as in other 
tumors. The two genes most frequently demonstrating the out- 
lier pattern in these Jung adenocarcinomas were KYNU and 
CYP24, and were present in 10 and 9 tumors, respectively. 
CYP24 has been described as a gene amplified and overexpressed 
in breast cancer 25 , and these results indicate elevated expression 
in lung adenocarcinoma. 

To determine whether the graded or outlier gene-expression 
patterns also occur at the protein-expression level, 10 of the 100 
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Fig. 5 Gene amplification and protein expression of surviv-at-related genes, 
o. Analysis ol potential gene amplication loi 9 genes showing out lief expres- 
sion patterns in the lung tumors (erbB2, MCI AS, Wnt 1, MCB1, Reg'\A, 
AKAP^2 r PACl, CYP24 and KYNU) and examined using quantitative genomic 
PCR. A gene showing graded expression pattern (Kftn8), and one gene 
(PACE4) with a similar chromosome location as PACf, were used as controls. 
Only rr£?B2 and fteoTA are shown. An esophageal adenocarcinoma with 
V.nown high- level genomic amplification ol erbB? was used as a positive con- 
lrol and normal esophagus DNA was used as a negative control (Of). PCR 
tiagments sizes we.e 343 bp for GAPDH, 166 bp for c/6B? and 126 bp for 



/teal A. DNA is horn normal lung (N) and tumorfT) from each patient (for ex- 
ample L37). h t Imrnunohis toe hemic ol analysis of survival related genes with 
lung adenocarcinoma mkroarrays using the tumors from this study. Fhe 
transmembrane erbB2 protein (top lelt) expression is substantially increased 
in tumor 194 containing the amplified rrf;B2 gene (Fig. 4a and b). Expression 
of VFXF (top right) and 51 OOP (bottom left) was located within the neoplas- 
tic cells and the pattern of imrnunoteactivity was consistent with the graded 
expression pattern demonstrated by their mRNA profiles. Expression of the 
oncogene crk (bottom right) was abundantly e>pressed in neoplastic lung 
cells. Magnification, x<!00 (erbB?); >?00 (VEGF, 51 OOP and crk). 
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top survival genes (Table 1) for which specific antibodies were 
available were chosen for immunohistochemicaj analysis using 
lung-tumor arrays from this study (Fig. 5£). Expression of mem- 
brane erbB2 protein was substantially increased in the er£B2-am- 
plified tumor L94 and very low levels of expression were present 
in other tumors, consistent with mRNA- express! on measure- 
ments (Fig. 4a and b). CDC6 protein expression was also sub- 
stantially higher in tumor L94, consistent with mRNA levels 
(data not shown). Expression of vascular endothelial growth fac- 
tor (VEGF) and S100P (Fig. Sb), as well as cytokeratin 18 (KRT18), 
cytokeratin 7 (KRT7) and fas-associated death domain (FADD) 
protein (data not shown), was located within the lung tumor 
cells and consistent with the graded expression pattern of the 
mRNA profiles. The oncogene ak showed both graded mRNA as 
well as a graded protein-expression pattern with survival, and 
was abundantly expressed in the tumor cells (Fig. Sb). These re- 
sults indicate that many survival-associated genes are expressed 
at the protein level and demonstrate similar mRNA and protein- 
expression patterns. 

Discussion 

We used several approaches for the analysis of gene- expression 
data related to clinicopathological variables and patient surr 
vival. One approach, hierarchical clustering, was used to exam- 
ine similarities among lung adenocarcinomas in their patterns 
of gene expression. Previous studies of lung tumors 21 - 22 have also 
used this method to describe subclasses. of lung tumors. Here, 
we found three clusters that showed significant differences with 
respect to tumor stage and tumor differentiation. This suggests, 
as expected, that tumors with similar histological features of 
differentiation demonstrate similarities in gene expression. 
This feature also partly underlies the observed statistical associ- 
ation of tumor stage and cluster, as many of the higher-stage tu- 
mors, often poorly differentiated and previously associated 
with a reduced survival'- 10 , were located in Cluster 3. Although 
this cluster, contained the highest percentage of stage III tu- 
mors, it also contained a nearly equal mixture of stage 1 and 
stage III tumors and not all tumors were poorly differentiated. 
This indicates that a subset of stage 1 lung adenocarcinomas 
share gene-expression profiles with higher-stage tumors. 
Notably, 10 of the 1 1 stage I tumors found in Cluster 3 were the 
high-risk stage I tumors identified using the risk index in the 
'leave-one-out' cross-validation. 

In contrast to previous analyses of lung adenocarcinomas 21,22 , 
we validated the expression data from the arrays. The strong cor- 
relation of nonhern-blot analysis and oligonucleotide-array data 
for gene expression in the same samples (Fig. 2b) indicates that 
these studies provide robust gene-expression estimates. 
Immunohistochemistry using the same tumor samples in tissue 
arrays demonstrates protein expression within the lung tumor 
cells. Together, these studies indicate that many of the genes 
identified using gene-expression profiles are likely relevant to 
lung adenocarcinoma. For example, JGFBP3 gene expression is 
increased in lung adenocarcinomas (Fig. 2c). 1GFBP3 protein 
modulates the autocrine or paracrine effects of insulin-like 
growth factors, elevated IGFBP3 expression is observed in colon 
cancer*, and increased serum IGFBP3 is associated with progres- 
sion in breast cancer 7 . Heat-shock protein 70 {HSP-70) is in- 
creased in lung adenocarcinomas of smokers 7 * and is associated 
with increased metastatic potential in breast cancer". Increased 
serum lactate dehydrogenase is correlated with tumor stage and 
tumor burden 30 , and cystatin C, a cysteine protease inhibitor ex- 



pressed in human lung cancers 31 , is prognostic in some cancers 52 . 
The decreased expression of this protease inhibitor may affect 
the invasive properties of the tumor cell. 

. The cross-validation analytical strategy we used is particularly 
informative for these types of gene- expression analyses for dis- 
ease outcome"- 34 , and identification of cross-validated genes with 
a larger tumor cohort may help refine this risk index for use in a 
clinical setting. The gene-expression data also provide opportune 
ties to observe overarching patterns that advance our under- 
standing of associations between genes and disease. For example, 
the top 100 survival genes include those involved in signaling, 
cell cycle and growth, transcription, translation and metabolism. 
Expression of many of these genes is likely a function of increased 
proliferation and metabolism in the more aggressive tumors. 
Some genes, such as erb%2 and ReglA (Fig. 4a and b), were highly 
overex pressed in a few patients having poor survival. In one 
tumor, the erbBZ gene was amplified (Fig. 5a), demonstrating thai 
genomic changes may underlie the over expression of a subset of 
these outlier genes. Immunohistochemistry confirmed protein 
overexpression in this patient's tumor (Fig. Sb). Notably, seven of 
the eight outlier genes were not amplified, indicating that other 
mechanisms underlie the increased mRNA expression of these 
survival-related genes. 

Most genes showed a graded relationship between expression 
and patient survival. Genes such as that encoding VEGF, known 
to be strongly associated with survival in lung cancer 3 *' 36 were 
identified as related to patient survival in our study. VEGF . 
demonstrated a graded expression pattern, as did the S100P and 
ak oncogene (Fig. Sb). SI OOP is a calcium-regulated pjotein not 
previously reported in lung cancer. The ak gene, the cellular ho- 
molog of the v-crk oncogene, is a member of a family of adaptor 
proteins involved in signal transduction and interacts directly 
with c-jun N-terminal kinase 1 (JNK1) 37 . Although ak has not 
been shown to have a role lung cancer, its role in the MAP-ki- 
nase pathway, which leads to activation of matrix metallopro- 
teinase secretion and cell invasion 38 , indicates potential 
involvement in the the tumor cell invasion or metastasis of 
some lung adenocarcinomas. Among the many genes identified ' 
in this study, like ak, that may be causally involved in lung can- 
cer progression (Table 1), some were related to survival in many 
patients, and others in only smaller subsets of patients. This re- 
sult is consistent with the complex molecular architecture of tu- 
mors in general, the heterogeneity of lung adenocarcinomas in 
particular and the multiple mechanisms underlying tumor-cell 
survival, invasion and metastasis 3 '. 

Our results demonstrate that a gene- ex press ion risk profile- 
based on the genes most associated with patient survival — can 
distinguish stage 1 lung adenocarcinomas and differentiate prog- 
noses. The particular genes that define the clusters, or are associ- 
ated with survival, likely reflect the characteristics of the 
particular tumors included in the analysis. Current therapy for 
patients with stage I disease usually consists of surgical resection 
without adjuvant treatment 23 . Clearly, the identification of a 
high-risk group among patients with stage J disease would lead 
to consideration of additional therapeutic intervention for this 
group, possibly leading to improved survival of these patients. 

Methods 

Patient population. Sequential patients seen at the University of Michigan 
Hospital between May 1994 and July 2000 lor stage I or stage [ft lung ade- 
nocarcinoma weie evaluated for this study. Consent was received and the 
project was approved by the local Institutional Review Board. Primary tu- 
mors and adjacent non neoplastic lung tissue were obtained at the lime ol 
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surgery. Peripheral portions of resected lung carcinomas were sectioned, 
evaluated by a study pathologist and compared with routine H&E sections 
of the same tumors, and utilized for mRNA isolation. Regions chosen for 
analysis contained a tumor. cellular ity greater than 70%, no mixed histol- 
ogy, potential metastatic origin, extensive lymphocytic infiltration or fibro- 
sis. Tumors were histopathologicalry divided into two categories based on 
their growth patternf bronchial-derived, if they exhibited invasive features 
with architectural destruction, and bronchioloafveplar, if they exhibited 
preservation of the lung architecture. AH stage I patients received only sur- 
gical resection with intra thoracic nodal sampling and no other treatments. 
Stage. Ill patients received surgical resection plus chemotherapy and radio- 
therapy. 

Gene-expression profiling and K-ros mutation analysis. RNA isolation, 
cRNA synthesis and gene-expression profiling were performed as de- . 
. scribed 14 . Details of gene annotation and K-ros mutation analysis are pro- 
vided in supplementary information. 

Northern-blot analysis. Total cellular RNA (1 0 jag) was separated in 1 .2% 
agarose-formaldehyde gels and vacuum-transferred to Gene Screen Plus 
(NEN Life Science Products, Boston, Massachusetts). Hybridization condi- 
tions and probe labeling were as described 40 . Individual sequence-validated 
cDNA image clones for human ICFBPS (clone 1407750), LDH A (clone 
2420241), cyslatin C (CTS3; clone 949938) were from Research Genetics 
(Huntsville, Alabama). The human histone H4 cDNA and the 28$ ribosomal 
RNA 26-mer oligonucleotide probe were prepared and labeled as de- 
scribed 40 . 

Gene-amplification analysis. 1 1 genes were selected for the analysis of ge- 
nomic alterations. Primers were designed using PrimerSelect 4.05 Windows 
32 software (DNASTAR, Madison, Wisconsin), avoiding pseudogenes or po- 
tential homologous regions. Forward and reverse primers for the genes are 
provided (Supplementary Methods online). Quantitative genomic- PCR was 
then applied and analyzed as described 41 . 

Immunohistochemical staining. The H&E-stained sfides of all primary 
lung tumors were used to identify the most representative regions of each 
tumor and a tissue microarray (TMA) block was constructed as described 43 . 
ImmunoNstocbemistry (1HC) was performed using both routine and sec- 
tions (rom the TMA block as described 3 *. Detailed methods and the con- 
centrations used for all antibodies are provided in the Supplementary 
Methods. 

Statistical methods. Mests were used to identify differences in mean gene- 
expression levels between comparison groups. Agglomerative hierarchical 
clustering 41 was applied using the average linkage method to investigate 
whether there was evidence for natural groupings of tumor samples based 
on correlations between gene-expression profiles. To investigate the ro- 
bustness of the clustering inference, gene- expression values were per- 
turbed by adding random Gaussian error of magnitude obtained from a 
duplicate sample to each" data point and then reclustered to determine con- 
cordance in the tumor's class membership. Pearson, x 1 3r, d fisher's exact 
tests were used to assess whether cluster membership was associated with 
physical and genetic characteristics of the tumors. 

7o determine whether gene-expression profiles were associated with 
variability in survival times, 2 separate but complementary approaches 
were used. In the first approach, the 86 tumors were randomly assigned to 
equivalent tiaining and testing sets consisting o( equal numbers of stage I 
and lit tumors in order to validate a novel risk-index function that captured 
the effect of many genes at once ; In the second approach, cross- validation" 
was used to more robustly identify the genes associated with survival. 
Briefly, a 'leave- one- out' cross-validation procedure in which 85 ot the 86 
tumors (the training set) was used to identify genes that were univarialely 
associated with survival. The risk index was defined as a linear combination 
of the gene- expression values for the top genes identified by univariate Cox 
proportional- hazard regression modeling* 1 , weighted by their estimated re- 
gression coefficients. Kaplan-Meier survival plots and Ion-rank tests were 
then used to assess whether the risk-index assignment to high/low cate- 
gories was validated in the test set. A more detailed description is provided 
(Supplementary Methods online). 



Note: Supplementary information h ovoilable on the Nature Medicine website. 
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Rapid quantitation of proinflammatory and chemoattractant cytokine 
expression in small tissue samples and monocyte-derived dendritic cells: 
validation of a new real-time RT-PCR technology. 

Blaschke V, Reich K . Blaschkc S. Zipprich S , Neumann C . 

Department of Dermatology, von-SieboId-Str. 3, D-37075, Goettingen, Germany. 
vblasch@gwdg!de 

The analysis of cytokine profiles plays a central part in the characterization of disease- 
related inflammatory pathways and the identification of functional properties of immune 
cell subpopulations. Because tissue biopsy samples are too small to allow the detection of 
cytokine protein, the detection of mRNA by RT-PCR analysis is often used to investigate 
the cytokine milieu in inflammatory lesions. RT-PCR itself-is a qualitative method, 
indicating the presence or absence of specific transcripts. With the use of internal or 
external standards it may also serve as a quantitative method. The most widely accepted 
method is quantitative competitive RT-PCR, based on internal shortened standards. 
Recently, online real-time PCR has been introduced (LightCycler), which allows 
quantitation in less than 30 min. Here, we have tested its use for the analysis of cytokine . 
gene expression in different experimental in vitro and ex vivo settings. First, we 
compared quantitative competitive RT-PCR with real-time RT-PCR in the quantitation of 
transcription levels of the CD4(+) cell-specific chemoattractant Interleukin- 1 6 during the 
maturation of monocyte-derived dendritic cells, and found a good correlation between 
both methods. Second, differences in the amounts of IL-16 mRNA in synovial tissue 
from patients with rheumatoid arthritis and osteoarthritis as assessed by real-time RT- . 
PCR paralleled differences in the level of IL-16 protein in the synovial fluid. Finally, we 
employed real-time RT-PCR to study the cutaneous expression of several cytokines 
during experimental immunomodulatory therapy of psoriasis by Interleukin- 10, and 
demonstrate that the technique is suitable for pharmacogenomic monitoring. In summary, 
real-time RT-PCR is a sensitive and rapid tool for quantifying mRNA expression even 
with small quantities of tissue. The results obtained do not differ from those generated hy 
quantitative competitive RT-PCR. 

Publication Types: 

• Evaluation Studies i 
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Butyrate-induced reversal of dexarnethasone resistance in autonomous rat 
Nb2 lymphoma cells. 

Buckley AR, Krumenackei JS, Buckley DJ . LeffMA , Magnusou NS . Reed JC 
Miyashita T . dc Jong G . Gout PW ; ' 

Department of Pharmacology and Toxicology, University of North Dakota School of 
Medicine and Health Sciences, Grand Forks 58202-9037, USA. 
abuckIey@maiLm6d.und.nodak.edu 

The parental rat Nb2 lymphoma is a prolactin (PRL)-dependent T cell line. Exposure of a 
PRL-independent subline,. Nb2-SFJCD I , to sodium butyrate (NaBT) causes transient 
reversal of their growth factor-independent proliferation in association with constitutive 
expression of protooncogenes pim-1 and c-myc. In the present study, we investigated the 
effect of NaBT treatment on the sensitivity of Nb2-SFJCD1 cells to dexamethasone 
(DEX)-induced apoptosis. Pretreatment with NaBT (2 mM, 72 h) partially reversed 
resistance to apoptosis in Nb2-SFJCD1 cells exposed to DEX (100 nM) for 12 h, 
assessed by flow cytometric analyses of DNA fragmentation. However, the cytolytic 
effect of DEX was abrogated by PRL in a time- and concentration-dependent manner. 
Evaluation of apoptosis-associated gene expression in NaBT-pre-treated cultures 
incubated with DEX or DEX+PRL indicated that the apoptosis resistance did not stem 
from altered bcl-2 or bax expression. However, there was a strong correlation between 
die resistance to DEX-activated apoptosis and their enhanced expression of pim-1 mRNA 
and protein. The results show that it is possible to reverse DEX-induced apoptosis of Nb2 
pre-T cells and suggest the pim-1 gene product has an important role as a suppressor of 
this process, perhaps functioning as a mediator of PRL action. 
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Alterations in neuropeptide Y levels and Yl binding sites in the Flinders 
Sensitive Line rats, a genetic animal model of depression. 

Caberlotto L , Jimenez P. Overstreet PH . Hard YL, Mathe AA , Fuxe K . 

Department of Neuroscience, Karolinska Institute, Stockholm, Sweden. 

Previously, we observed specific alterations of neuropeptide Y (NPY) and Yl receptor 
mRNA expression in discrete regions of the Flinders Sensitive Line rats (FSL), an animal 
model of depression. In order to clarify the correlation between mRNA expression and 
protein content, radioimmunoassay and receptor autoradiography were currently 
performed. In the FSL rats, NPY-like immunoreactivity (NPY-LI) was decreased in the 
. hippocampal CA region, while Yl binding sites were increased; NPY-LI was increased 
in the arcuate nucleus. Fluoxetine treatment elevated NPY-LI in the arcuate and anterior 
cingulate cortex and increased Yl binding sites in the medial amygdala and occipital 
cortex in both strains. No differences were found regarding the Y2 binding sites. The 
results demonstrate a good correlation between NPY peptide and mRNA expression, and 
sustain the possible involvement of NPY and Yl receptors in depression. 
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Neurokinin 1 receptor and relative abundance of the short and long 
isoforrns in the human brain. 

Cabcrlotto L , Hm d YL , Murdock P t Wahlin JP , Mclotto S , Corsi M , Carlctti R . 

Department of Biology, Psychiatry CEDD, GlaxoSmithKIine Medicine Research Centre, 
Verona, Italy. LauraX.Caberlotto@gsk.com 

Substance P exerts its various biochemical effects mainly via interactions through 
neurokinin^ I receptors (NKl). Recently, the NKl receptor has attracted considerable 
interest for its possible role in a variety of psychiatric disorders including depression and 
anxiety. However, little is known regarding the anatomical distribution of NKl in the 
human central nervous system (CNS). Riboprobe in situ hybridization, quantitative PCR 
and in vitro autoradiography were performed. Highest NKl mRNA levels were localized 
in the locus coeruleus and ventral striatum, while moderate hybridization signals were 
observed in the cerebral cortex (most abundant in the visual cortex), hippocampus and 
different amygdaloid nuclei. Very low levels of the NKl mRNA were detected in the 
cerebellum and thalamus. In view of the existence of a long and short isoform of the NKl 
receptor, it was of interest to assess whether there was a differential distribution of the 
two splice variants in the human CNS and peripheral tissues. A quantitative TaqMan 
PCR analysis showed that the long NKl isoform was the most prevalent throughout the 
human brain, while in peripheral tissues the truncated form was the most represented. 
3H-Substance P. autoradiography revealed a good correlation between receptor binding 
sites and NKl mRNA expression throughout the brain, with the highest levels of binding 
in the locus coeruleus. These results provide the anatomical evidence that the NKl 
receptors have a strong association with neuronal systems relevant to mood regulation 
and stress in the human brain, but do not suggest a region-specific role of the two 
isoforrns in the CNS. 
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Characterization of cyclin D2 expression in human endometrium. 

ChoiJ), Yoon S, Lcc E , Hwang S . SongS , Kim J , Yoon BK. LeeJH . 

. Department of Obstetrics and Gynecology, Samsung Medical Center, Sungkyunkwan 
University School of Medicine, Seoul, South Korea, dschoi@smc.samsung.co.kr 

OBJECTIVE: This study was undertaken to investigate cyclin D2 mRNA and protein 
expression in human endometrium during the menstrual cycle. METHODS: Endometrial 
samples were obtained from 15 premenopausal nonpregnant women who had 
hysterectomies for benign gynecologic reasons. They were divided into the following five 
groups according to histologic dating: early proliferative (n - 3), mid to late proliferative 
(n = 3), early secretory (n « 3), mid secretory (n = 3), and late secretory (n - 3). Cyclin 
D2 mRNA and protein expression were analyzed using reverse transcriptase-polymerase 
chain reaction, Western blotting, and immunohistochemistry. RESULTS: Cyclin D2 
mRNA and protein were expressed in human endometrial tissue throughout the menstrual 
cycle. Cyclin D2 mRNA and protein expression of proliferative phase endometrium were 
significantly higher than those of secretory phase endometrium (P <.05). The staining 
intensity of cyclin D2 in proliferative phase endometrium was higher than that in 
secretory phase (P <.05). Cyclin D2 mRNA level showed good correlation with cyclin 
D2 protein level (R = 0.579, P <.03), and cyclin D2 protein also showed good correlation 
with immunohistochemical staining intensity (R = 0.562, P <.03). CONCLUSION: 
Cyclin D2 was expressed in human endometrium throughout the menstrual cycle. Cyclin 
D2 mRNA and protein were expressed at high levels in proliferative phase endometrium, 
especially in the early proliferative phase, and then decreased in the secretory phase. 
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Human chorionic gonadotrophs beta expression in malignant Barrctt*s 
oesophagus. 

Convclard A, Paraf F , Vidaud D , Dubois S , Vidaud M Flejou JF , Degott C . 

Service d'Anatomie Pathologique, Hopital Beaujon, 921 18 Clichy cedex, France.. 
amexouvelard@bjn.ap-hop-paris.fr 

BACKGROUND: Human chorionic gonadotropin beta (hCGbeta) is expressed in 
several non-trophoblastic tumours, and this is usually associated with aggressive 
behaviour.. Little is known about hCGbeta expression in Barrett's adenocarcinoma. 
MATERIALS AND METHODS: We determined the hCGbeta profile in a large series of 
surgically resected Barrett's adenocarcinoma (a) at mRNA level using real-time 
quantitative reverse- transcription polymerase chain reaction analysis and (b) at protein 
level using immunohistochemistry with a polyclonal antibody and with a monoclonal 
antibody specific for free hCGbeta. We then sought links between the hCGbeta protein 
expression pattern and clinical and pathological parameters, including patient outcome as 
well as vascular endothelial growth factor (VEGF) expression. RESULTS: hCGbeta 
protein expression was observed in 43 of 76 (57%) Barrett's adenocarcinomas. We 
showed a strong correlation between hCGbeta protein abundance and CGB mRNA level. 
We observed a statistical link between hCGbeta protein expression and infiltrative 
tumour type ( P-0.023), perineural neoplastic invasion ( P=0.007) and VEGF protein 
expression ( P=0.016). hCGbeta expression tended to be associated with a poor outcome 
(16% versus 36% survival 8 years after resection). CONCLUSION: Expression of 
hCGbeta correlates with specific infiltrative characteristics and is associated with higher 
VEGF expression. Bodi molecules may play a co-ordinated role in the development of 
Barrett's adenocarcinomas. 
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Downregulation of ENaC activity and expression by TNF-alpha in alveolar 
epithelial cells. 

Dagenais A, Frechette R , Yamagata Y . Yamagata T . CarmelJF , Clermoat ME , 
Brochiero E . Masse C , Berthtaumc Y . 

Centre de recherche, CHUM-Hotel-Dieu, 3850 St-Urbain, Montreal, Quebec, Canada 
H2W 1T7. andre.dagenais.chum@ssss.gouv.qc.ca 

Sodium absorption by an amiloride-sensitive channel is the main driving force of lung 
liquid clearance at birth and lung edema clearance in adulthood. In this study, we tested 
whether tumor necrosis factor-alpha (TNF-alpha), a proinflammatory cytokine involved 
in several lung pathologies, could modulate sodium absorption in cultured alveolar 
epithelial cells. We found that TNF-alpha decreased the expression of the alpha-, beta-, 
and gamma-subunits of epithelial sodium channel (ENaC) mRNA to 36, 43, and 16% of 
the controls after 24-h treatment and reduced to 50% the amount of alpha-ENaC protein 
in these ceils. There was no impact, however, on alpha(l) and beta(l) Na(-f)-K(+)- 
ATPase mRNA expression. Amiloride-sensitive current and ouabatn-sensitive Rb(+) 
uptake were reduced, respectively, to 28 and 39% of the controls. A strong correlation 
was found at different TNF-alpha concentrations between the decrease of amiloride- 
sensitive current and alpha-ENaC mRNA expression. All these data show that TNF- 
alpha, a proinflammatory cytokine present during lung infection, has a profound 
influence on the capacity of alveolar epithelial cells to transport sodium. 
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Inhibin aud activiu production and subunit expression in bumau placental 
cells cultured in vitro. 

Debieve Pampfer S , Thomas EC 

Department of Obstetrics and Gynecological Endocrinology, Universite Catholiquede 
Louvain, 1200 Brussels, Belgium. 

Inhibins and activins are dimeric proteins, with each subunit being one of three related 
protein subunits (alpha, betaA or betaB). The mRNA levels of these subunits were 
studied quantitatively during in- vitro differentiation of human cytotrophoblast cells into 
syncytium, using Northern blot analysis and semi-quantitative reverse transcription- 
polymerase chain reaction (RT-PCR) analysis. The corresponding protein concentrations 
were determined by specific enzyme-linked immunosorbent assays for inhibin A, B, pro 
alphaC and activin A in cellular protein extracts and culture medium (n = 5). 
Immunofluorescence studies showed syncytium formation after 48 h. The alpha subunit 
was present before plating and increased at 48 h (PO.001) while the betaA subunit was 
weak before plating and increased at 24 h. The betaB subunit was not detected. With 
respect to corresponding protein synthesis, inhibin A (alpha + betaA) had risen after 48 h 
in cellular protein extract and after 72 h in culture medium, while activin A (betaA + 
betaB) was detected after 24 h, with no significant variations in culture medium. There 
was a good correlation between inhibin A and alpha subunit expression (r = 0.736, 
PO.001), as well as between activin A and betaA subunit expression (r = 0,755, 
PO.001). This study showed that mRNA expression parallels protein synthesis of inhibin 
and activin in trophoblast cells. Inhibin A synthesis appears to be dependent on alpha 
subunit mRNA expression, rather than on the betaA subunit which controls activin A 
synthesis. This study has also shown that isolated cytotrophoblast cells do not produce 
dimeric inhibin. However, during the transformation of cytotrophoblast cells into 
syncytium, betaA subunit mRNA expression may be an indicator of cell aggregation, 
while alpha subunit mRNA expression may be an indicator of cell fusion. 
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Involvement of the CCND1 gene in hairy cell leukemia. 

dc Boer CJ , Kluin-Nelemaris JC Dreef E , Kestcr MG , Khiin PM Schmiring E. vaa 
Kriekcn JH . 

Department of Pathology, University of Leiden, The Netherlands. 

BACKGROUND: Previous results suggested increased mRNA expression of CCND 1 in 
hairy cell leukemia (HCL). The CCND1 gene is involved in the t(l I;14)(ql3;q32) 
chromosomal rearrangement, a characteristic abnormality in mantle cell lymphoma 
(MCL). We and others reported that, in contrast to other B-cell lymphomas, almost all 
MCL have over-expression of the CCND1 gene with a good correlation between RNA 
and protein analysis. Recent studies showed that overexpression of the cyclin D|l protein 
can be easily detected by uiimunohistochemistry (IHC) on formalin-fixed, paraffin 
embedded tissues. PATIENTS AND METHODS: To investigate whether the CCND1 
gene is involved in HCL, we performed IHC on a series of 22 cases using formalin-fixed 
paraffin embedded splenectomy specimens. For IHC the sections were boiled in citrate 
buffer. The presence of rearrangements within the BCL-l locus and the CCND1 gene 
was analyzed in 13 of 22 cases by Southern blot analysis using all available break-point 
probes. Expression of CCND 1 was analyzed at the mRNA level (Northern blot) and 
protein level (IHC). RESULTS: Overexpression of the cyclin Dl protein using IHC was 
observed in all cases, with strong expression in 5 cases. Pre-existing B- and T-cell areas 
of the spleen did not express significant levels of the cyclin Dl protein. Seven of 9 cases 
analyzed by both IHC and Northern blotting showed overexpression of the CCND1 gene 
with both methods. No genomic abnormalities were observed in any of the 13 cases 
studied by Southern blot analysis. Additionally, no I lql3 abnormalities were detected by 
banding analysis of 19 of 22 cases. CONCLUSIONS: The elevated levels of CCND1 
mRNA and protein in conjunction with the absence of overt rearrangements within the 
BCL-l locus distinguish HCL from MCL and other B-cell malignancies. This suggests 
that activation of the CCND I gene in HCL is due to mechanisms other than chromosomal 
rearrangement. 
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Expression of membrane-type matrix metalloproteinases 4, 5, and 6 in 
mouse corneas infected with P. aeruginosa. 

Dong Z , Katar M , Alousi S , Berk RS . 

Department of Immunology and Microbiology, Wayne State University School of 
Medicine, 540 E. Canfield, Detroit, Mr 48201, USA. 

PURPOSE: To investigate the expression and regulation of membrane-type matrix 
metalloproteinases (MT-MMPs) 4, 5, and 6 in the mouse corneas infected with 
Pseudomonas aeruginosa. METHODS: C57BL/6J mice were intracorneally infected with 
P. aeruginosa. The expression of MT4-, MT5-, and MT6-MMP was detected at both the 
mRNA and protein levels by RT-PCR and immunobiot analysis. Immunohistochemicai 
staining was performed tojocalize the expression of MT4- and MT5-MMP in the mouse 
corneas. RESULTS: Expression.of MT4- and MT5-MMP was detected in the normal " 
(uninfected) cornea by RT-PCR and immunobiot analysis. When infected with P. 
aeruginosa, the corneas showed "significant induction of each MT-MMP. Localization of 
MT4- and MT5-MMP revealed that the expression of MT5-MMP was restricted to the 
epithelial tissue in the normal cornea, whereas the induced expression of MT4- and MT5- 
MMP was predominantly in the substantia propria, which contained most of the 
infiltrating cells. MT6-MMP expression was not detected in the uninfected cornea but 
was upregulated in the infected corneas. CONCLUSIONS: Expression of MT4-, MT5-, 
and MT6-MMP was induced in corneas infected with P. aeruginosa. 
Immunohistochemistry showed. predominant immunoreactivity of MT4- and MT5-MMP 
in the substantia propria. Previous histologic studies have revealed different patterns of 
inflammatory cell infiltration with an increased number of polymorphonuclear 
neutrophils (PMNs) during the early stage of inflammation and increased macrophages 
during the late stage. These results indicate a good correlation between the 
overexpression of the MT-MMPs in the infected corneas and the inflammatory response- 
that is, leukocyte infiltration-indicating that inflammatory cells such as macrophages and 
PMNs may play a role in the upregulation of MT-MMPs during corneal infection, which 
in turn can cause the destruction of corneal tissue. 
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Assessment of proliferative activity in colorectal carcinomas by 
quantitative reverse transcriptase-polymerase chain reaction (RT-PCR), 

Duchrow M llascmcver S . Broil R , Bruch Ul\ Windhovcl U . 

Surgical Research Laboratory, Surgical Clinic, Medical University of Lubeck, 
Ratzeburger Allee 160, D.23538 Lubeck, Germany. 

The monoclonal antibody Ki-67 and the isospecific monoclonal antibody MiB-1 are 
routinely used in oncology to assess the proliferation index of tumor cells. A more 
objective and sensitive method is the determination of the of Ki-67 protein-specific 
.mRNA by quantitative reverse transcriptase-polymerase chain reaction (RT-PCR). In 25 
resected colorectal adenocarcinomas of different stages and grades we determined 
between 0.2 and 4.4 amol (I0(-18) mof) Ki-67 protein-specific mRNA per microgram 
total RN A (median = 0.88 amol). The corresponding Ki-67 indices (expressing the 
percentage of Ki-67/MIB-I positive tumor cells) ranged from 41 to 81% (median = 61%). 
We found a good correlation between Ki-67 index and mRNA expression (r = 0.75), a 
significant con-elation between both data and tumor stage (primary tumor, regional nodes, 
metastasis [pTNM] staging classification) (p < 0.001), but not between both data and 
tumor grade. Both Ki-67 indices (p = 0.05) and mRNA levels (p = 0.014) correlated 
significantly to the patients' survival. These results demonstrate that the Ki-67 protein- 
specific quantitative RT-PCR is a useful method for the characterization of tumor cell 
proliferation. 



PMtD: 1 1486701 [PubMcd - indexed for MEDLINE] 



142: Equme Vet J. 2002 Jul;34(4):349-58. Related Articles, Links 

Comment in: 

• Equine Vet J. 2002 Jul:34(4V.326-7 . 

Molecular characterisation of carbohydrate digestion and absorption in 
equine small intestine. 

Pypr JL Fernandcz-Castano Mercdiz E . Salmon ICS , Proudman CJ , Edwards GB . 
Sltirazi-Becchcy SP . 

Department of Veterinary Preclinical Sciences, University of Liverpool, UlC. 

Dietary carbohydrates, when digested and. absorbed in the small intestine of the horse, 
provide a substantial fraction of metabolisable energy. However, if levels in diets exceed 
the capacity of the equine small intestine to digest and absorb them, they reach the 
hindgut, cause alterations in microbial populations and the metabolite products and 
predispose the horse to gastrointestinal diseases. We set out to determine, at the 
molecular level, the mechanisms, properties and the site of expression of carbohydrate 
digestive and absorptive functions of the equine small intestinal brush-border membrane. 
We have demonstrated that the disaccharidases sucrase, lactase and maltase are expressed 
diversely along the length of the intestine and D-glucose is transported across the equine 
intestinal brush-border membrane by a high affinity, low capacity, Na+/glucose 
cotransporter type I isoform (SGLT1). The highest rate of transport is in duodenum > 
jejunum > ileum. We have cloned and sequenced the cDNA encoding equine SGLTl and 
alignment widi SGLTl of other species indicates 85-89% homology at the nucleotide and 
84-87% identity at die amino acid levels. We have shown that there is a good correlation 
between levels of functional SGLTl protein and SGLTl iriRNA abundance along the 
length of the small intestine. This indicates that the major site of glucose absorption in 
horses maintained on conventional grass-based diets is in the proximal intestine, and the 
expression of equine intestinal SGLTl along the proximal to distal axis of the intestine is 
regulated at the level of mRNA abundance. The data presented in this paper are the first 
to provide information on the capacity of the equine intestine to digest and absorb soluble 
carbohydrates and has implications for a better feed management, pharmaceutical 
intervention and for dietary supplementation in horses following intestinal resection. 
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Suppressors of cytokine signaling proteins are differentially expressed in 
Thl and Th2 cells: implications for Th cell lineage commitment and 
maintenance. 



Laboratory of Immunology, National Eye Institute, National Institutes of Health, 
Bethesda, MD 20892, USA. emeka@helix.nili.gov 

Positive regulatory factors induced by IL-12/STAT4 and IL-4/STAT6 signaling during T 
cell development contribute to polarized patterns of cytokine expression manifested by 
differentiated Th cells. These two critical and antagonistic signaling pathways are under- 
negative feedback regulation by a multimember family of intracellular proteins called 
suppressor of cytokine signaling (SOCS). However, it is not known whether these 
negative regulatory factors also modulate Thl /Th2 lineage commitment and maintenance. 
We show here that CD4(» naive T cells constitutively express low levels of SQCS1, 
SOCS2, and SOCS3 mRNAs. These mRNAs and their proteins increase significantly in 
nonpolarized Th cells after activation by TCR signaling.We further show that 
differentiation into Thl or Th2 phenotype is accompanied by preferential expression of 
distinct SOCS mRNA transcripts and proteins. SOCS1 expression is 5-fold higher in Thl 
than in Th2 cells, whereas Th2 cells contain 23-fold higher levels of SGCS3. We also 
demonstrate that IL-12-induced STAT4 activation is inhibited in Th2 cells that express 
high levels of SOCS3 whereas IL-4/STAT6 signaling is constitutively activated in Th2 
cells, but not Till cells, with high SOCS1 expression. These results suggest that mutually 
exclusive use of STAT4 and STAT6 signaling pathways by differentiated Th cells may 
derive in part, from SOCS3- or SOCS I -mediated repression of IL-12/STAT4- or IL- 
4/STAT6 signaling in Th2 and Thl cells, respectively. Given the strong correlation 
between distinct patterns of SOCS expression and differentiation into the Thl orTh2 
phenotype, SOCS1 and SOCS3 proteins are therefore Th lineage markers that can serve 
as therapeutic targets for immune modulation therapy! 
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Intravitreal invading cells contribute to vitreal cytokine milieu in 
proliferative vitreoretinopathy. 

EI-Ghrablv IA , DuaHS 7 Orr GM Fischer R Tigbe PJ . 

Larry A Donoso Laboratory for Eye Research, Department of Ophthalmology, University 
of Nottingham , UK. 

AIM: To examine the contribution of infiltrating cells in the local production of cytokines 
within the vitreous of patients with proliferative vitreoretinopathy (PVR) ; METHODS: 
The presence of mRNA coding for IL-6, IL-8, IL-lbeta, IL-1 alpha, TNFalpha, 
IFNgamma, IL-12, and HPRT was investigated in 25 vitreous samples from patients with 
PVR, 1 1 vitreous samples from patients with retinal detachment (RD) not complicated by 
PVR, and 10 vitreous samples from patients with macular hole (MH). A quantitative 
reverse transcriptase polymerase chain reaction (RT-PCR) using an internal competitor 
was used to investigate these samples. From these samples, 15 PVR, 8 RD, and 8 MH 
were analysed for the' protein levels of the same cytokines using enzyme linked 
immunosorbent assay (ELISA). Spearman correlation was used to test any association 
between mRNA and cytokine protein levels^as an indicator of the contribution these cells 
make to the intravitreal cytokine milieu. RESULTS: A strong correlation was found 
between mRNA and their respective cytokine levels .{protein products) for IL-6, IL-8, IL- 
lbeta, IL-lalpha, TNFalpha, IFNgamma (Spearman r - 0.83, 0.73, 0.67, 0.91, 0.73, and 
0.73 respectively), but not for IL-12. The median levels of IL-6, 11^8, IL-lbeta, and 
IFNgamma mRNA and their respective cytokines were significantly higher (p O.05) in 
patients with PVR than in those with macular hole. There was no statistically significant 
difference in the medi an levels of IL- 1 alpha mRNA between PVR and MH but the 
cytokine IL-1 alpha was detected at a significantly higher level in PVR compared with 
MH patients. Between PVR and RD patients, there was no statistically significant 
difference in mRNA levels for all the investigated cytokines (p >0.05) except for IL-6 
where there was a statistical significance (p= 0.038). In contrast, the median levels of IL- 
• 6, IL-8, and IL-lbeta cytokines were significantly higher (p <0.05) in patients with PVR 
than in those with RD, whereas for IL-1 alpha and IFNgamma no significant statistical 
difference was detected between PVR and RD patients (p >0.05). When results of RD 
and MH patients were compared, a statistical difference was only detected in mRNA 
levels of INFgamma (p = 0.008). However, no difference was detected for INFgamma 
(protein product) or for any of the other cytokines between RD and MH patients. 
CONCLUSION: Levels of both protein and mRNA encoding IL-6, IL-8, IL-lbeta, and 
IFNgamma is significantly increased in vitreous samples from patients with PVR. The 
strong correlation between ELISA detectable cytokines (protein products) and their 
respective mRNA levels suggest that intravitreal, invasive cells are the major source of 
these cytokines, with the exception of IL-12. Cells invading the vitreous do not appear to 
locally produce IL-12 mRNA. This would appear to implicate cells peripheral to the 




vitreal mass as the major source of this cytokine. 
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Altered levels of scavenging enzymes in embryos subjected to a diabetic 
environment. 

Forsberg H , Borg_LA, CaglieroE, Eriksson tJJ . 

Department of Medical Cell Biology, University of Uppsala, Sweden. 

Maternal diabetes during pregnancy is associated with an increased rate of congenital 
malformations m the offspring. The exact molecular etiology of the disturbed 
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in vivo, or h.gh glucose in vitro alters the expression of the free oxygen radical 

scavenging enzymes superoxide dismutase (CuZnSOD and MnSOD), catalase and 
glutathione peroxidase in rat embryos during late organogenesis. We studied offspring of 
normal and diabetic rats on gestational days 1 1 and 12, and also evaluated day-1 1 
embryos after a 48 hour culture period in 10 mM or 50 mM glucose concentration. Both 
maternal diabetes and high glucose culture caused growth retardation and increased rate 
ot congenital malformations in the embryos. The CuZnSOD and MnSOD enzymes were 
expressed on gestational day 1 1 and both CuZnSOD, MnSOD and catalase were 
expressed on day 1 2 with increased concentrations of MnSOD transcripts when 
challenged by a diabetic milieu. There was a good correlation between mRNA protein 
and activity levels, suggesting that the regulation of these enzymes occurs primarily at 'the 
pretranslational level. Maternal diabetes in vivo and high glucose concentration in vitro 
induced increased MnSOD expression, concomitant with increased total SOD activity 
and a tentative decrease in catalase expression and activity in the embryos. These 
findings support the notion of enhanced oxidative stress in the embryo as an etiologic 
agent in diabetic feratogenesis. icuuiogic 
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Induction of the estrogen receptor by growth hormone and glucocorticoid 
substitution in primary cultures of rat hepatocytes. 

Freyschuss B. Stavreus-Evcrs A, Sahlin L Eriksson ft . 

Department of Reproductive Endocrinology, Karoiinska Hospital, Stockholm, Sweden. 

Hepatic estrogen receptors (ER) mediate estrogenic effects on mammalian liver 
metabolism and are thereby involved in the regulation of important 
physiological/pathological processes, such as coagulation, atherosclerosis and 
hypertension. The regulation of the formation of the ER in primary cultures of rat 
hepatocytes was studied by assaying ER and ER rnRNA under different endocrine 
conditions. The ER concentration was measured using two different methods, a ligand- 
bindmg technique and an ER enzyme immunoassay. The results obtained by the two 
methods showed good correlation, and linear regression analysis gave a correlation 
coefficient of 0.95. ER concentrations fell to low steady state levels within 16 h after 
establishing the cell culture and remained low in the absence of hormonal substitution 
Upon medium supplementation with pituitary GH and the glucocorticoid dexamethasorie 
(Ub.K) in combination, the ER concentration increased 6-fold from 4 2 +/- 1 0 to 25 8 +/ 
7.0 finol/mg cytosolic protein. ER rnRNA was measured by solution hybridization ' 
Substitution with GH and DEX in combination increased ER mRNA to 210 +/- 14% 0 f 
control levels. No effect on ER mRNA stability was seen after hormone treatment It is 
concluded that the regulatory effects of GH and DEX on the hepatic ER in this in vitro 
system are very similar to the effects of these hormones under in vivo conditions The 
inducible expression of the ER has never before, to our knowledge, been demonstrated in 
any mammalian liver cell culture system. 

PMID: 8404593 [PubMed - indexed for MEDLINE] 
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Oxytocin (OT) receptor (OTR) concentrations were determined in the cervix of 

was used as the labeled hgand. Mucosal and muscle layers of the cervix were al o 
analyzed separately for both ligand binding and expression of the OTRgene us Z a 

ZLt : vd0p ?™ as ° *"T Cti0n a3Say (RAP) " °* lM! * loca,i ^- of OTR protein 
was determmed by immunohistochemistry. All regions of cervix from cows at had 
hxgh concentrahons of OTR; in the luteal phase, all were sharply down-regulate^At 

Jmn^Tr 1 h f 3b0Ut 3 °- f0,d Wgher -ncentraSn" than the^cfe la^er. 
OTR mRNA was readily detected by RAP in the mucosa from estrous cows while much 
weaker signals were found in the muscle. On Days 7-17, the OTR mRNA £S tabS 
mucosa and muscle were very faint or nondetectable. Thus, there was a goXSon 

^S^^rl^ eXPreSSi ° n ' WhiCh ^ that OTR cVnSntotions 
are mamly regulated at the transcriptional level. The epithelial cells at the luminal surface 

P t C L Pal ° f imm ^^ive OTR; muscle cells sh^eT 

EfooZ 7 ^ ! lgna,S - PrCVi ° USly ' ° T W3S f0Und to stimuIate Prostaglandin (PG) 
E2 output invite, m bovme cervical tissues. Since PGE2 is capable of softfning the 

^Zf^T^^ °J m 7 ^ 3 nOVd P^gUrf Action to cause 
sottenmg oi the bovine cervix mediated by the release of PGE2. 

PMID: 8835394 [PubMed indexed for MEDLINE] 



26: SwS* Fcb;,S(l,:l5 - 20 ' "«°>- - 



Silencing of the thrombomodulin geae in human malignant melanoma. 

FurutaJ, Kancda A , ymebavash. Y, Qtsuka F . S«.gimuraT. Ushiiinia T . 

Carcinogenesis Division, National Cancer Center Research Institute, Tokyo, Japan. 

The loss of thrombomodulin (TM) expression is associated with humour growth 
mhltoton and lymph node metastasis in human tumours. Tn melanoma cell lines TM is 
ported to meduUe cell adhesion, and its introduction into TM-negative melanoma eel 
Sanomr 568 ^ *^ - ^s study, we analysed TM expression insula 
melanoma specuneris and the role of its promoter mediation in the loss of its 
expression. In 15 (75%) of the 20 specimens (five from a primary site a^ 1 Is fern 
metastatic sUes), melanoma cells lacked TM immunoreacLity^ethylation of TM 
promoter region was detected in 10 (67%) of the 1 5 TM-negative specimens by 

7^XeZrlI 0lyn : eraSe Chain reaCti ° n ' WhCreaS meUi y lation was d <*^ ^ two 
Promoter cIg^'^T? fT™™' * "* Ration of the TM 

T !? T ted T s,x (46%) of 13 me,anoma ceU ,ines - wh —s - 

methylation was detected in two cultured normal melanocytes. There was a good 

(m f? A) ex P ression - Treatment of melanoma cell lines with a 
Tnr fl h g HT nt> 5 - aza - 2 '- deo *ycytidine, induced demethylation of the promoter 
CpG "land and the restoration of rnRNA and protein expression These findLTsuggest 

cl™C;Z^T ma T k ™ ** dl3t of the pilT 

«-p<j island is one of the mechanisms responsible. 
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A sampling of the yeast protcome. 

FutchcrB, Latter GI, Monardo P . McLaughlin CS . Garrets JT . 

Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 1 1 724, USA 
futcher@cshl.brg 

In this study, we examined yeast proteins by two-dimensional (2D) gel electrophoresis 
and gathered quantitative information from about 1,400 spots. We found that there is an 
enormous range of protein abundance arid, for identified spots, a good correlation 
between protein abundance, mRNA abundance, and codon bias. For each molecule of 
well-translated mRNA, there were about 4,000 molecules of protein. The relative 
abundance of proteins was measured in glucose and ethanol media. Protein turnover was 
examined and found to be insignificant for abundant proteins. Some phosphoproteins 
were identified. The behavior of proteins in differential centrifugatibn experiments was 
examined. Such experiments with 2D gels can give a global view of the yeast proteome. 
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In this study, we examined yeast proteins by two-dimensional (2D) gel electrophoresis and gathered quan- 
titative information from about 1,400 spots. We found that there is an enormous range of protein abundance 
and, for identified spots, a good correlation between protein abundance, mRNA abundance, and codon bias. 
For each molecule of well-translated mRNA, there were about 4,000 molecules of protein. The relative 
abundance of proteins was measured in glucose and ethanol media. Protein turnover was examined and found 
to be insignificant for abundant proteins. Some phosphoproleins were identified. The behavior of proteins in 
differential centrifugation experiments was examined. Such experiments with 2D gels can give a global view of 
the yeast proteome. 



The sequence of the yeast genome has been determined (9). 
More recently, the number of mRNA molecules for each ex- 
pressed gene has been measured (27, 30). The next logical level 
of analysis is that of the expressed set of proteins. "We have 
begun to analyze the yeast proteome bv using two-dimensional 
(2D) gels. 

2D gel electrophoresis separates proteins according to iso- 
electric point in one dimension and molecular weight in the 
other dimension (21), allowing resolution of thousands of pro- 
teins on a single gel. Although modern imaging and computing 
techniques can extract quantitative data for each of the spots in 
a 2D gel, there are only a few cases in which quantitative data 
have been gathered from 2D gels. 2D gel electrophoresis is 
almost unique in its ability to examine biological responses 
over thousands of proteins simultaneously and should there- 
fore allow us a relatively comprehensive view of cellular me- 
tabolism. 

We and others have worked toward assembling a yeast pro- 
tein database consisting of a collection of identified spots in 2D 
gels and of data on each of these spots under various condi- 
tions (2, 1, 8, 10, 23, 25). These data could then be used in 
analyzing a protein or a metabolic process. Saccharomyces 
cerevisiae is a good organism for this approach since it has a 
well-understood physiology as well as a large number of mu- 
tants, and its genome has been sequenced. Given the sequence 
and the relative lack of in Irons in 5. cerewsiae, it is easy to 
predict the sequence of the primary protein product of most 
genes. This aids tremendously in identifying these proteins on 
2D gels. 

There are. three pillars on which such a database rests: (i) 
visualization of many protein spots simultaneously, (ii) quan- 
tification of the protein in each spot, and (iii) identification of 
the gene product for each spot. Our first efforts at visualization 
and identification for S. cerevisiae have been described else- 
where (7, 8). Here we desciibe quantitative data for these 
proteins under a variety of experimental conditions. 

MATERIALS AND MFIUODS 
Strains and media. S. cem>isuie W303 {M/tTn ade2-1 his3 JJ,J5 Ieu?-3, )]2 
trphl ura3- 1 can J-J0O) was used (26). - Mel YNB (yeast nitrogen base) medium - 
was 1.7 g of YNB (Difco) per litei. 5 g of ammonium sulfate per liter, and 
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adenine, uracil, and all amino acids except methionine; -Met.-Cys YNB me- 
dium was the same, bul without methionine or cysteine. Medium was supple- 
mented with 77c glucose (for most experiments) or with 2% ethanol (Cot ethanol 
experiments). Low-phosphate YEFD was described by Warner (28). 

Jsotopic labeling vf yeast :md preparation of cell extracts. Yeast strains weie 
labeled and proteins were extracted as described by Garrels et al. (7, 8). Briefly, 
cells were grown to 5 X 10* cells per ml. at 30°C; J ml of culture was transferred 
to a fresh tube, and 0J mCi of p 5 S]methionine (e.g., Express protein labeling 
mix; New England Nuclear) was added to this 1-ml culture. The cells were 
incubated for a further 10 to 15 mm and then transferred to a 1.5-mJ microcen- 
trifuge tube, chilled on ice, and harvested by centrifugation. The supernatant was 
removed, and the cell pellet was resitspended in 100 u.1 of lysis buffer (20 mM 
Tris-HCl [pH 7.6], 10 mM NaF, 10 mM sodium pyrophosphate, 0.5 mM EDTA, 
0.1% deoxycholate: just before use, phenylmethylsulfonyl fluoride was added to 
J mM, leupeptin was added to I p-g/ml, pepstalio was added to ) u-g/ml tosyl- 
sulfonyl phenylalanyl chloromethyl ketone was added to 10 iig/ml, and soybean 
trypsin inhibitor was added to 10 ng/ml). 

The resuspended cells were transferred to a screw-cap 1.5-ml polypropylene 
tube containing 0.28 g of glass beads (0.5- mm dia merer; Biospec Products) or 
0.40 g ofzirconia beads (0.5- mm diameter; Biospec Products). After the cap was 
secured, the lube was inserted into a MiniBeadbeater 8 (Biospec Products) and 
shaken al medium high speed at 4°C for 1 min. Breakage was typically 75%. 
Tubes weTe then spun in a microcentrifuge for 10 $ at 5,000 X g at 4"C 

With a very fine pipette tip, liquid was withdrawn from the beads and trans- 
ferred to a prechilled 1.5-ml tube containing 7 uJ of DNase 1 (0.5 mg/rol; Cooper 
product no. 6330)-RNase A (0.25 mg/ml; Cooper product no. 5679)-Mg (50 mM 
MgCL) mix. Typically 70 ij.1 of liquid was recovered. The mixture was incubated 
on ice for 10 min to allow the RNase and DNase to work. 

Next, 75 til of 2 X dSDS (2x dSDS is 0.6% sodium dodecyl sulfate |SDS], 2% 
mercaptoethanol, and 0.1 M Tris-HCI [pH RJ) was added. The tube was plunged 
into boiling water, incubated for 1 min, and then plunged into ice. After cooling, 
the tube was ccnlrifuged at 4 6 C for 3 min at R000 X g> The supernatant was 
transferred to a fiesh tube and frozen at -70*C. About 5 ul of this supernatant 
was used for each 2D gel. 

2D pulyncryJnmide gels. 2D gels were made and run as described elsewhere 
(6-8). 

Image analysis of the gels. The Quest II software system was used for quan- 
titative image analysis (20, 22). Two techniques were used to collect quantitative 
data for analysis by Quest II software. First, before the advent of phospborim- 
agcrs, gels were dried and fluorographed. Each gel was exposed to film foj three 
different limes (typically 1 day, 2 weeks, and 6 weeks) to inn ease the dynamic 
range of the data. The films were scanned along with calibration stiips to relate 
film optical density to disintegrations per minute in the eels and analyzed by the 
software toobtain a linear relationship between disintegrations per minute in the 
spots and optical densities of the film images. The quantitative data are ex- 
pressed as parts per million of the tola) cellular protein. ITiis value is calculated 
from the disintegrations per minute of the sample loaded onto the gel and by 
comparing the film density of each data spot with density of the film over the 
calibialion strips of known radioactivity exposed to the same film. This yields the 
disiniegiaiions per minute per millimeter for e;tch spot on the gel and thence its 
parts- per- minute value. 

Aftei ihe advent of phosphor imaging, gels bearing ^S-labeled proteins were 
exposed lo phosphor imager screens and scanned by a Fuji phosphor imager, 
typically for two exposures per gel. Calibration snips of known radioactivity were 
exposed simultaneously. Scan data from the phosphoiimagei was assimilated by 
Quest I) software, and quantitative data were recorded foi the spots on the gels. 
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Measurements of protein turnover. Ccns in exponential phase were pulse- 
labeled with | 35 S)methionine, excess cold Met and Cys were added, and samples 
of equal volume were taken from the culture at intervals up to 90 min (in one 
experiment) or up to J60 min (in a second experiment). Incorporation of 35 S into 
protein was essentially 100% by the first sample (JO min). Extracts were made, 
and equal fractions of the samples were loaded on 2D gels (i.e., the different 
samples had different amounts of protein but equal amounts of 35 S). Spots were 
cmantitated with a phosphorimaging and Quest software. 

The software was queried for spots whose radioactivity decreased through the 
time course. The algorithm examined all data points for all spots, drew a best-fit 
line through the data points, and looked for spots where this line had a statis- 
tically significant negative slope. Jn one of the experiments, there was one such 
spot. To the eye, this was a minor, unidentified spot seen only in the first two 
samples (10 and 20 min). In the other experiment, the Quest software found no 
spots meeting the criteria. Therefore, we concluded that none of the identified 
spots (and all but one of the visible spots) represented proteins with long 
half-lives. 

Centrifugal fractionation. Cells were labeled, harvested, and broken with glass 
beads by the standard method described above except that no detergent (Le., no 
deoxycholate) was present in the lysis buffer. The crude h/sate was cleared of 
. unbroken cells and large debris by ccnlTifugation at 300 X g for 30 s. The 
supernatant of this centrifugal ion was then spun at 16,000 X g. for 10 min to give 
the pellet used for Fie. 6B. The supernatant of the 16,000 X g, JO-min spin was 
then spun at 100,000 X g for 30 min to give the supernatant used for Hg. 6A. 

Protein abundance calculations. A haploid yeast cell contains about 4 X* JO" 12 
g of protein (1, J5). Assuming a mean protein mass of 50 kDa, there are about 
50 X 10* molecules of protein per cell. There are about 1.8 methionines per 10 
kDa of protein mass, which implies 4.5 X 10 s molecules of methionine per cell 
(neglecting the small pool of free Mel). We measured (i) the counts per minute 
in each spot on the 2D gels, (ii) the total number of counts on each gel (by 
integrating counts over the entire gel), and (iii) the total number of counts 
loaded on the gel (by scintillation counting of the original sample). Thus, we 
know what fraction of the total incorporated radioactivity is present in each spot. 
After correcting for the methionine (and cysteine [see below]) content of each 
protein, we calculated an absolute number of protein molecules based on the 
fraction of radioactivity in each spot and on 50 X 10* total molecules per cell. 

The labeling mixture used contained about one-fifth as much radioactive 
cysteine as radioactive methionine. Therefore, the number of cysteine molecules 
per protein was also taken into account in calculating the number of molecules 
of protein, but Cys molecules were weighted one-fifth as heavilv as Met mole- 
cules. 

mRNA abundance calculations. For estimation of raRNA abundance, we used 
SAGE (serial analysis of gene expression) data (27) and Atfymetrix chip hybrid- 
ization data (29a r 30). The mRNA column in Table J shows mRNA abundance 
calculated from SAGE data alone. However, the SAGE data came from cells 
growing in YEPD medium, whereas our protein measurements were from cells 
growing in YNB medium. In addition, SAGE data for low- abundance mRNAs 
suffers from statistical vacation. Therefore, we also used chip hybridization data 
(29a, 30) foi mRNA from cells giown in YNB. These hybridization data also bad 
disadvantages. First, the amounts of high-abundance mRNAs were systemati- 
cally underestimated, probably because of saturation in the hybridizations, which 
used 10 iig of cRNA. For example, the abundance of ADHJ mRNA was 197 
copies per cell by SAGE but only 32 copies per cell by hybridization, and the 
abundance of END2 mRNA was 248 copies per cell by SAGE but only 41 by 
hybridization. When the amount of cRNA used in the hybridization was reduced 
to J ng> the apparent amounts of mRNA were similar to'thc amounts determined 
by SAGE (29a, 29b). However, experiments using 1 u.g of cRNA have been done 
for only some genes (29a). Because amounts of mRNA were normalized to 
15,000 per cell, and because the amounts of abundant mRNAs were underesti- 
mated, there is a 2.2-fold overestimate of. the abundance of nonabundant 
mRNAs. We calculated this factor of 22 by adding together the number of 
mRNA molecules from a laree number of genes expressed at a low level for both 
SAGE data and hybridization data. The sum for the same genes from hybrid- 
ization data is 2.2-fold greater than thai from SAGE data. 

To take into account these difficulties, we compiled a list ol "adjusted" mRNA 
abundance as follows. For all high- abundance mRNAs of our identified proteins, 
we used SAGE data For all of these particular mRNAs, chip hybridization 
suggested that mRNA abundance was the same in YEPD and YNB media. For 
medium-abundance mRNAs, SAGE data were used, but when hybridization 
data showed a significant difference between YEPD and YNB, then the SAGE 
data weie adjusted by the appropriate factor. Finally, for low- abundance 
mRNAs, we used data from chip hybridizations from YNB medium but divided 
by 2.2 to normalize to the SAGE results. These calculations weie completed 
without reference to piotein abundance. 

CAJ. The codon adaptation index (CA1) was taken from the yeast pro! come 
database ( YPD) ()3) r for which calculations were made according to Sharp and 
Lt (24). Biiefly, ihe index uses a reference set of highly expressed eenes to assign 
a value to each codon. and then a score for a gene is calculated from the 
frequency ol use of ihe various codons in that gene (241. 

Statistical analysis. The IMP program wjs used * j, n ihe aid of T. Tullv. The 
IMP program showed thai neither mRNA nor protein :ihundanccs were nor- 
mally riisiiibulcrd: therefore, Spearman rank con elation OK Hide ms (r.) wen* 
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calculated. The mRNA (adjusted and unadjusted) and protein, data were also 
transformed so that Pearson product-moment correlation coefficients (r ) could 
be calculated. First, this was done by a Box- Cox transformation of log- trans- 
formed data. This transformation produced normal distributions, and an r of 
0.76 was achieved. However, because the Box- Cox transformation is complex, we 
also did a simpler logariihmic transformation. This produced a normal distribu- 
tion for the protein data. However, the distribution for Ihe mRNA and adjusted 
mRNA data was close to, but not quite, normal. Nevertheless, we calculated the 
r p and found that it was 0.76, identical to the coefficient from the Box-Cox 
transformed data. We therefore believe thai this correlation coefficient is not 
misleading, despite the fact that the log(mRNA) distribution Is not quite normal. 



RESULTS 

Visualization of JylOO spots on three gel systems. Yeast 
proteins have isoelectric points ranging from 3.1 to 12.8, and 
masses ranging from less than 10 kDa to 470 kDa. It is difficult 
to examine all proteins on a single kind of gel, because a gel 
with the needed range in pJ and mass would give poor resolu- 
tion of the thousands of spots in the central region of the gel. 
Therefore, we have used three gel systems: (i) pH "4 to 8" with 
10% polyacrylamide; (ii) pH "3 to 10" with 10% poryacryl- 
amide; and (iii) nonequilibrium with 15% polyacrylamide (7, 
8). Each gel system allows good resolution of a subset of yeast 
proteins. 

Figure 1 shows a pH 4-8, 10% polyacrylamide gel. The pH 
at the basic end of the isoelectric focusing gel cannot be main- 
tained throughout focusing, and so the proteins resolved on 
such gels have isoelectric points between pH 4 and pH 6.7. For 
these pH 4-8 gels, we see 600 to 900 spots on the best gels after 
multiple exposures. 

The pH 3-10 gels (not shown) extend the pi range somewhat 
beyond pH 15, allowing detection of several hundred addi- 
tional spots. Finally, we use nonequilibrium gels with 15% 
acryjamide in the second dimension. These allow visualization 
of about 100 very basic proteins and about 170 small proteins 
(less lhan 20 KDa). In total, using all three gel systems, about 
1,400 spots can be seen. These represent about 1,200 diiferent 
proteins, which is about one-quarter to one-third of the pro- 
teins expressed under these condiiions (27, 30). Here, we focus 
on the proteins seen on the pH 4-8 gels. 

Although nearly all expressed proteins are present on these 
gels, the number seen is limited by a problem we call coverage. 
Since there are thousands of proteins on each gel, many pro- 
teins comigrate or nearly comigrate. When two proteins are 
resolved, but are close together, and one protein spot is much 
more intense than the other, a problem arises in visualizing the 
weaker spot: at long exposures when the weak signal is strong . 
enough for detection, the signal from Ihe strong spot spreads 
and covers the signal from ihe weaker spot. Thus, weak spots 
can be seen only when they are well separated from strong 
spots. 

For a given gel, the number of delectable spots initially rises 
wiih exposure lime. However, beyond an optimal exposure, the 
number of distinguishable spots begins to decrease, because 
signals from strong spots cover signals from nearby weak spots. 
Al long exposures, Ihe whole autoradiogram turns black. Thus, 
there is an optimum exposure yielding ihe maximum number 
of spots, and at this exposure the weakest spots are not seen. 

Largely because of the problem of coverage, the proteins 
seen are strongly biased toward abundant proteins. All identi- 
fied proteins have a CA1 of 0.1 S or more, and we have iden- 
lifted no Iranscription factors or protein kinases, which arc 
nonabundant proteins. Thus, this technology is useful for ex- 
amining protein synthesis, amino acid metabolism, and glyco- 
lysis but not for examining transcription, DNA replication, or 
(he cell cycle. 




16kDa ^ 00t ^ 
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Spot identification. The identification of various spots has 
been described elsewhere (7, 8), At present, 169 different spots 
representing 148 proteins have been identified. Many of these 
spots have been independently identified (2, 10, 23, 25). The 
main methods used in spot identification have been analysis of 
amino acid composition, gene overexpression, peptide se- 
quencing, and mass spectrometry. 

Pulse-chase experiments and protein turnover. Pulse-chase 
experiments were done to measure protein half-lives (Materi- 
als and Methods). Cells were labeled with [^methionine for 
10 min. and then an excess of unlabeled methionine was added. 
Samples were (alien at 0, 10, 20, 30, 60, and 90 min after the 
beginning of the chase. Equal amounts of 35 S were loaded from 
each sample; 2D gels were run, and spots were quantitated. 
Surprisingly, almost every spot was nearly constant in amount 
of radioactivity over the entire time course (not shown). A few 
spots shifted from one position to another because of post- 
translational modifications (e.g., phosphorylation of RpaO and 
Efbl). Thus, the proteins being visualized are all or nearly all 
very stable proteins, with half-lives of more than 90 min. Gygi 
et ah (10) have come to a similar conclusion by using the N-end 
rule to predict protein half-Jives. This result does not imply 
that all yeast proteins are stable. The proteins being visualized 
are abundant proteins; this is partly because they are stable 
proteins. 

Protein quantitation. Because all of the proteins seen had 
effectively the same half-life, the abundance of each protein 
was directly proportional to the amount of radioactivity incor- 
porated during labeling. Thus, after taking into account the 
total number of protein molecules per cell, the average content 
of methionine and cysteine^ and the methionine and cysteine 
content of each identified protein, we could calculate the abun- 
dance of each identified protein (Tables 3 and 2; Materials and 
Methods). About 1,000 unidentified proteins were also quan- 
tified, assuming an average content of Mel and.Cys. 

Many proteins give multiple spots (7, 8). The contribution 
from each spot was summed to give the total protein amount. 
However, many proteins probably have minor spots that wc are 
not aware of, causing the amount of protein to be underesti- 
mated. 

When the proteins on a pH 4-8 gel were ordered by abun- 
dance, the most abundant protein had 8,904 ppm, the 10th 
most abundant had 2,842 ppm, the 100th most abundant had 
314 ppm, the 500th most abundant had 57 ppm, and the 
1,000th most abundant (visualized at greater than optimum 
exposure) had 23 ppm. Thus, there is more than a 300-fold 
range in abundance among the visualized proteins. The most 
abundant 10 proteins account for about 25% of the total pro- 
tein on the pH 4-8 gej, the most abundant 60 proteins account 
for 50%, and the most abundant 500 proteins account for 80%. 
Since it seems likely ihat the pH 4-8 gels give a representative 
sampling of all proteins, we estimate that half of the total 
cellular protein is accounted for by fewer than 100 different 
gene products, principally glycolytic enzymes and proteins in- 
volved in protein synthesis. 

Correlation of protein abundance with mRNA abundance. 
Estimates of mRNA abundance for each gene have been made 
by SAGE (27) and by hybridization of cRNA to oligonucleo- 
tide arrays (30). These two methods give broadly similar re- 
sults, yet each method has strengths and weaknesses (Materials 
and Methods). Table J lists ihe number of molecules of mRNA 
per ceil for each gene studied. One measurement (mRNA) 
uses data from SAGE analysis alone (27); a second incorpo- 
rates data from both SAGE and hybridization (30) (adjusted 
mRNA) (Table J: Materials and Methods). We coneJated 
protein abundance with mRNA abundance (Fig. 2). "For ad- 
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justed mRNA versus protein, the Spearman rank correlation 
coefficient,.^, was 0.74 (P < 0.0001), and the Pearson corre- 
lation coefficient, r p , on log transformed data (Materials and 
Methods) was 0.76 (P < 0.00001). We obtained similar corre- 
lations for mRNA versus protein and also for other data trans- 
formations (Materials and Methods). Thus, several statistical 
methods show a strong and significant correlation between 
mRNA abundance and protein abundance. Of course, the cor- 
relation is far from perfect; for mRNAs of a given abundance, 
there is at least a 1 0-fold range of protein abundance (Fig. 2). 
Some of this scatter is probably due to posttranscriplional 
regulation, and some is due to errors in the mRNA or protein 
data. For example, the protein Yef3 runs poorly on our gels, 
giving multiple smeared spots. Its abundance has probably 
been underestimated, partly explaining the low protein/mRNA 
ratio of Yef3. It is the most extreme outlier in Fig. 2. 

These data on mRNA (27, 30) and protein abundance (Ta- 
ble 1) suggest that for each mRNA molecule, there are on 
average 4,000 molecules of the cognate protein. For instance, 
for Actl (actin) there are about 54 molecules of mRNA per 
ceil and about 205,000 molecules of protein. Assuming an 
mRNA half-life of 30 min (12) and a cell doubling time of 120 
min, this suggests that an individual molecule of mRNA might 
be translated roughly 1,000 times. These calculations are lim- 
ited to mRNAs for abundant proteins, which are likely to be 
the mRNAs that are translated best. 

A full complement of ceil protein is synthesized in about 120 
min under these conditions. Thus, 4,000 molecules of protein 
per molecule of mRNA implies that translation initiates on an 
mRNA about once every 2 s. This is a remarkably.high rate; it 
implies that if an average mRNA bears 10 ribosomes engaged 
in translation, then each ribosome completes translation in 
20 s; if an average protein has 450 residues; this in turn implies 
translation of over 20 amino acids per s, a rale considerably 
higher than estimated for mammalians (3 to 8 amino acids per 
s) (18). These estimates depend on the amount of mRNA per 
cell (11, 27). 

The large number of protein molecules that can be made 
from a single mRNA raises the issue of how abundance is 
controlled for less abundant proteins. Many nonabundanl pro- 
teins may be unstable, and this would reduce Ihe protein/ 
mRNA ratio. In addition, many nonabundaiit proteins may be 
translated at suboptimal rates. We have found that mRNAs for 
nonabundant proteins usually have suboptimal contexts for 
translational initiation. For example, there are over 600 yeast 
genes which probably have short open reading frames in the 
mRNA upstream of the main open reading frame (17a). These 
may be devices for reducing the amount of protein made from 
a molecule of mRNA. 

Correlation of codon bias with protein abundauce. The 
mRNAs for highly expressed proteins prefcreniiaJiy use some 
codons rather than others specifying the same amino acid (14). 
This preference is called codon bias. The codons preferred are 
those for which the tRNAs are present in the greatest amounts. 
Use of these codons may make translation faster or more 
efficient and may decrease misincorporation. These effects are 
most important for the cell for abundant proteins, and so 
codon bias is most extreme for abundant proteins. The effect 
can be dramatic— highly biased mRNAs may use only 25 of the 
61 codons. 

We asked whether the correlation of codon bias with abun- 
dance continues for medium-abundance proteins. There are 
various mathematical expressions quantifying codon bias; here, 
we have used the CAJ (24) (Materials and Methods) because 
it gives a result between 0 and J. The r, for CAJ versus protein 
abundance is 0.80 (P < 0.0001), similar lo the mRNA-protcin 



TABLE J. Quantitative data" 



Function 



Name 



roRNA 



Carbohydrate metabolism 



Adjusted mRNA 



Protein synthesis 



Heat shock 



Amino acid synthesis 



Miscellaneous 



Adhl 


0.830 


197 


197 




0.504 


0 


Cit2 


0.185 


1 


2.8 


Eriol 


0.870 


No Ma 




Eno2 


0.892 


248 


248 


Fbal 


0.868 


179 


179 


Hxkl? 


0.500 


13 


10.5 


Jell 


0.251 


0 




Pdbl 


0.342 


5 


5 


Pdcl 


0.903 


226 


226 


Pfkl 


0.465 


5 


5 


Pgil 


0.681 


14 


14 


Pycl 


0.260 


1 


0.7 


Ta!J 


0.579 


5 . 


5 


Tdh2 


0.904 


63 


63 


Tdh3 


0.924 


460 


460 


Tpil 


0.817 


NoNla 





Efbl 


0.762 


33 


16.5 


Eftl,2 


0.801 


26 


26 


Prll 


0.303 


4 


0.7 


RpaO 


0.793 


246 


246 


Tifl,2 


0.752 


' 29 


29 


YeD 


0.777 


36 


36 


Hsc82 


0.581 


2 


2.9 


Hsp60 


0.381 


9 


2.3 


Hsp82 


0.517 


2 


1.3 


Hspl04 


0.304 


7 


7 


Kar2 


0.439 


5 ■ 


10.1 


Ssal 


0.709 


2 


43 


Ssa2 


' 0.802, 


10 


5 


Ssbl,2 


0.850 


50 


50 


Sscl 


0-521 


2 


2.6 


Ssel 


0.521 


8 


8 


Stil 


0.247 


1 


1.1 


Adel 


0.229 


4 


4 


Ade3 


0.276 


2 


1.7 


Ade5,7 


0.257 


2 


1.4 


Arg4 


0.229 


- 1 


8.1 


Gdhl 


0.585 


10 


27 


GInl 


0.524 


11 


Jl 


His4 


0.267 


3 


3 


llv5 


0.801 


6 


6 


Lys9 


0.332 


4 


4 


Met6 


0.657 


NoiVfa 


22 


Pro2 


0.248 


3 


3 


Serl 


0.258 


2 


1.2 


Trp5 


0319 


5 


5 



Act I 
Adkl 
AId6 
Atp2 
Bmhl 
Bmh2 
Cdc48 
Cdc60 
: Erg20 
Gpp] 
Gspl 
Jppl 
Lcbl 
Moll 
P;»bl 
Psal 
Rnr4 
Saml 
Sam2 
Sod] 
Ubal 
YKL056 
YLR109 
YMR116 



0.710 

0.531 

0.520 

0.424 

0.322 

0.384 

0.306 

0.299 

0.373 

0.603 

0.621 

0.620 

0.173 

0.4 23 

0.488 

0.600 

0.497 

0.494 

0.497 

0.376 

0.212 

0.731 

0.549 

0.777 



54 
No Ma 
3 
I 

46 

1 

2 

2 

5 
16 

3 

4 

0.3 

0 

3 
15 

6 

5 

3 
36 

2 
62 
21 
41 



54 
3 

4.1 
46 
1.4 
2.4 
0.86 
5 
5 
3 
4 

0.8 
■ 0.45 
3 

15. 

6 

5 
15 
36 

2 

"62 
21 
41 





rioicin \cX\\) 


cJG raho 


1,230 


972 


0.79 


0 


963 . ■ 


>20 


23 - 


288 


12 


410 


974 . 


2.4 


650 




0.33 


640 


OxJo 


0.95 


62 


46 




o 


ft7t- 
Oil 


->zU 


41 


Jj 




280 


205 


0.73 


75 


53 


0.71 


JOU 


120 


0.75 


37 


34 




110 


35 




430 


876 


NR 


1,670 


' 1,927 


NR 


No Met 


No Met 




•*^» 


362 




GO 
77 


D4 


0.55 


19 
\L 


6 


277 


100 


036 


233 


106 


0.46 


14 


ND 




1 J2 


75 


0.67 


35 


82 


2.3 


52 


135 


2.6 


70 


161 


23 


43 


102 


2.4 


303 


421 


1.4 


213 


324 


1.5 


270 


85 


68 


80 


1.2 


96 


48 




25 


44 


1.7 


14 


27 




12 


9 




14 


4 




41 . 


41 




148 


55 




77 


104 


1.3 


15 


23 


1.5 


-.152 


109 


0.7 


32 


17 


0.52 


190 


80 


0.42 


30 


12 




15 


8 




28 


12 





205 
47 
181 
76 
191 
134 
32 
6 
92 
234 
115 

254 
19 
20 
41 

148 
44 
59 
63 

63 J 
14 

253 

930 

184 



164 
43 
159 
109 
137 
147 
26 
2 
39 
153 
39 
147 
40 
16 
19 
56 
37 
21 
20 
618 
20 
112 

40 



0.78 



1.4 

0.72 



0.34 
0.58 



047 



0.44 
0.20 



- CAK a measure of codon bias, is taken horn the YPD. mRNA. numbn of mRNA molecules pei cell Iron. SAGE data (27V adjusted mRNA number of mRNA 
Y^r? , f fhT k.. 110(1 CHip h > blidiz3,i0 " <*» MaictbU and Methods); Protein (Ghr), numbe 'nfU*cu^ 

no^ n I ; 7 ( ' "7 ° l m °' eC " ,eS °' pU>tC,n PC ' CC " YNB C,h:Jn0 '- E ' G Ia,k >« ralio *' P T ° ,C '" abundance in e.hanol to -Jucose The E^G r«io b 
not g tven rf was close to J or ,f „ was no, , tfpca i aW e (NR) in muhiptc se | s . Some gtnc products (e.g., Vrfl and Til2 | Tin,2|) were Jifficufi toTs.inRuish on el« 

ibe 1 n ™ f ' hCSe P °? kd '. N0 /V '" 7 ' h " e "™ *> »" - '»« y of the gene, a J so ,hcVe are no SAGE mt^m- No 

the m.iture gene product contains no meth.onmc*. ;,nd m> rheic are no reliable protein data. ' 
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TABLE 2. Functions of proteins listed in TabJe 1 



Name* 



YPD title lines* 



Adh] 

Adh2 

Cit2 

EnoT 

Eno2 

Fbal 

Hxkl 

Hxk2 

IcIJ 

Pdbl 

PdcJ 

Pfkl 

Pi>il 

Pyd 

Tall 

Tdh2 . 

Tdh3 

TpiJ 

Efbl 
Eftl 
Efi2 
Prtl 

RpaO (RPPO) 

Tifl 

Tif2 

Yef3 

Hsc82 
Hsp60 
Usp32 
Hsp](W 

Kar2 

Ssal 
Ssa2 
Ssbl 
■' S$b2 
Ssc] 

Ssel 
Slil 

-Ade! 

Ade3 

Ade5,7 

Ar«4 

Gdhl 

Gtnl 

His4 . 

Uv5 

Lys9 
. Mci6 

Pro2 
ScrJ 
Trp5 

Actl 

Adfcl 

Ald6 

Alp2 

Bin hi 

Bmh2 

Cdc48 

Cdc60 

Eri-20 

CppJ (Rhr2) 
Gspl 
IppJ 
Lcbl 

Moll (Thi4) 
Pabl 

Psal 
Rnr4 
Sam I 
Sam2 
SodI 
Uba] 



Alcohol dehydrogenase I; cytoplasmic isozyme reducing acetaldehyde 10 -ethanol, regenerating 
Alcohol dehydrogenase II; oxidizes ethanol to acetaldehyde, pJucose repressed ** 

Pltrnt*. cirnlkiK,* 1 / :» I I-" I\_ J ^ _ " . ^ . T 



NAD* 



lycolysis 

Fructose bisphosphaie'a'idolase II; sixth step in^gryVorysis^ * ~ £,, ' J "'' ,0 "* iU F»wp»^''y'pyruvdie in glycolysis 
Hexokinase I; converts hexoses to hexose phosphates in glycolysis; repressed by glucose 

Hexokinase II; converts hexoses to hexose phosphates in glycolysis and plays a regulatory role in glucose repression 
Isoolrale lyase, peroxisomal; carries out part of the glyoxylate cycle; required for gluconeoeenesis 
Pyruvate dehydrogenase complex, EI beta subunit 
Pyruvate decarboxylase isozyme 1 

Phospbofructokinase alpha subunii, part of a complex with Pfk2p which carries out a key regulatory step in prycorysis 
Giucose-6-phosphate isomerase, converts gIucose-6-phosphate to f rue lose- 6-phosphate / k &v / 

Pyruvate carboxylase 1; converts pyruvate to oxaloacetate for gluconeogenesis 
TransaJdoIase; component of nonoxidalive part of pentose phosphate pathway 

GJyceraldehyde-3-phosphale dehydrogenase 2; converts r>glycer aldehyde 3-phosphale to J^dephosphoglycerale 
Gryceraldehyde-3-phosphate dehydrogenase 3; converts r>grycer aldehyde 3-phosphate to Ip-dephosphofirycerate 
Triphosphate isomerase; inlerconverts gJyceraldehyde-3-phosphale and dihydroxyace tone phosphate 

Translation elongation factor EF-3p; GDP/GTP exchange factor for Teflp/Tef2p 

Translation elongation factor EF-2; contains diphihamide which is not essential for activity- identical to Eft2p 
Trans ation elongation factor EF-2; contains diphthamide which is not essential for activity; identical to EftlD 
Translation initiation factor eIF3 beta subunii (p90); has an RNA recognition domain 
Acidic ribosomal protein AO 

Translation initiation factor 4A (eIF4A) of the DEAD box family 
Translation initiation factor 4 A (eIF4A) of the DEAD box family 
Translation elongation factor EF-3A; member of ATP-binding cassette soperl'amily 

Chaperonin homologous to E coli HtpG and mammalian HSP90 

Mitochondrial chaperonin that cooperates with HspJOp; homolog of E coli GroEL 

Heat-inducible chaperonin homologous to E coli HtpG and mammalian HSP90 

H ^^]%h^c!^T^ 0t indUCed ,hern,0,0,erancc and for '"olubilizing aggregates of denatured proteins; important for |psT> 

H Tnd1o^nK ret)UirCd *» 0t ™ — «* -ticulum membrane 

Cytoplasmic chaperone; heat shock protein of the HSP70 family 

Cytoplasmic chaperone; member of the HSP70 family 

Heal shock protein of HSP70 family involved in the translational apparatus 

Heat shock protein of HSP70 family, cytoplasmic 

M JSM&?2S! JiMEi? jiMStf * ^ ^ P ' ay5 ' Ch0p " Onin '° ,e in "«>™* ™» foWi "8 °f P-ein chain, 

^S^SZKi^^ ,he seven,h 5,ep " * novo pu ' irie bi ^ n,h " is p—» 

Phospbonbosylamme-glycme ligase plus phosp'boribosylformvlglvcinamidine evelo- ligase; Afunctional protein • 

Argmmosuccinate lyase; catalyzes the final step in arginine biosynthesis * 

Olulamatc dehydrogenase (NADP*); combines ammonia and a-ketogiutnmle to foim lilulamate 

(jiutnmine synthetase; combines ammonia to ghitamale in ATP-driven reaction 

m?b£^ pyrophosphohydrolase/his.idinol dehydrogenase; 2nd, 3rd, and 10th steps of 

K TnVlK d bi^ 3dd ^oiwmww) (alpha-keto-P-hydroxylacyl) reducloisomerase); second step in Val 

ItoS forming) (saccbaropine reductase), .seventh step in lysine biosynthesis pathway 

^SlSn i^X^ ' (5-meibyllelrahydroplcF.wl tnglutamatc-homocysteine mc.hyltransfcrase), methionine synthase, ' 
j-Glutamyl phosphate reductase (phosphoglulamate dehydrogenase), proline biosynthetic enzyme 
JKnospnoscrme transamrnase; involved in synthesis of serine from 3-phosphoglycerate 
Tryptophan synthase, last (5th) step in tryptophan biosynthesis pathway 

Aclin; involved in cell polarization, endocyiosis, and other cvroskeletal functions 
Adenylate kinase (CTP:AMP phosphotransferase), cytoplasmic 
Cytosohc acetaldehyde dehydrogenase 

Beta subunii of Fl - ATP synthase; 3 copies are found in each Fl oligomer 
Homolog of mammalian 14-3-3 protein; has strong similarity to Bmh2p 
Homolog of mammalian 14-3-3 protein; has strong similarity to BmhJp 

i T.pk!^ ^ fnmil ' V ° f ATPase5 . : "quired for cell division and homotvpic membrane fusion 
LeucyMRNA synthetase, cytoplasmic 

Famesyl pyrophosphate synthetase; may be rate-limiting siep in sterol biosynthesis pathway 
DL- Glycerol phosphate phosphatase ■ 
Ran, a CTP-binding protein of the Ras superfnmily involved in trafficking throneh nuclear pores 
Inorganic pyrophosphatase, cytoplasmic ^ 

Component of serine Cpalmitoyhransferase; first step in biosynthesis of long-chain base componen. of sphinoolipids 
Th lamine-repressed pi ote,n essential for growth in the absence of thiamine spnmgonpias 

.^S"LK?JS" ^ Cy,(>P,:,Sm a,>d nUdCnS; P;,M ° f ,hC 3 '' Cnd RNA-pioccsrfng comple, (cleavage factor I); has 4 RNA 
Mannose- J -phosphate puanyliiansferasc; GDP mannose pyrophosphon lase 
Ribonucleotide reductase small subunit 
5-Adenosylmelhionine synthetase 1 
S- Ade nosy) methionine synthetase 2 
Copper -zinc superoxide dismulase 
Ubiq ui tin -activating (El) enzyme 



Yt K R?NQ ( Ah.n drT. m K b, r ,r3n ^V° nat ;. v controlled tumor p.otein of animal cells and higher plants " 

YLKW9 (Ahpn Alkyl hydroperoxide icduciase e P 

YMRI16 (Ascl) Abundant protein with effects on t.anslational effic iency and cell size, has two WD (WD-40) repeats 

• Acccp(eJ name irom .he Surcharomyccs genome diiahase and YFD. Names in pa.eniheses represent recent ch.ntcs 
Courtesy of Prorciln»e ; Inc.. icpnnicU wiih permission. b 
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Adj. mRNA 

FIG. 2. Correlation of protein abundance with adjusted mRNA abundance. 
The number of molecules per cell of each protein is plotted against the number 
of molecules per cell of the cognate mRNA, with an r p of .0776. Note the 
logarithmic axes. Data for mRNA were taken from references 27 and 30 and 
combined as described in Materials and Methods. 



correlation, confirming a strong correlation between CA1 and 
protein abundance (Fig. 3). The relationship between CAI and 
protein abundance is log linear from about 1,000,000 to about 
10,000 molecules per cell. We have no data for rarer proteins. 

It is not clear whether CAI reflects. maximum or average 
levels of protein expression. The proteins used for the CAJ- 
protein. correlation included some proteins which were not 
expressed at maximum levels under the condition of the ex- 
periment (Hsc82, Hspl04, Ssal, Adel, Arg4, His4, and others). 
When these proteins were removed from consideration and 
the correlation between CAJ and the remaining (presumably 
constitutive) proteinswas recalculated, the r s was essentially 
unchanged (not shown). 

The equation describing the graph in Fig. 3 is log (protein 
moJecuJes/cell) - (2.3 X CAJ) + 3.7. Thus, under certain 
conditions (a CA) of 0.3 or greater: a consiitutivejy expressed 
gene), a very rough estimate of protein abundance can be 
made by raising 10 to the power of [(23 X CAI) + 3.7]. 

The distribution of CAJ over the genome (Fig. 4) consists of 
a lower, bell-shaped distribution, possibly indicating a region 
where there is.no selection for codon bias, and an upper, flat 
distribution., starting at a CAJ of about 0.3, possibly indicating 
a region where there is selection for codon bias. Almost all of 
the proteins whose abundance we have measured are in the 
upper, flat portion of the distribution. In the lower, beJJ-shaped 
region, we do not know whether there is a correlation between 
CAJ and protein abundance. 

Changes in protein abundance in glucose and ethaool. A 
comparison of cells grown in glucose (Fig. 1 A) with cells grown 
in ethanol (Fig. IB) is shown in Table J. As is well known, 
some proteins are induced- tremendously during growth on 
ethanol. Two striking examples are the peroxisomal enzymes 
Jell (isocitraie lyase) and Cit2 (citrate synthase), which are 
induced in ethanol by" more than 300- and 12-fold., respectively 
(Fig. 1; Table 1). These enzymes are key components of the 
glyoxylate shunt, which diverts some acetyl coenzyme A 
(aceryJ-CoA) from the tricarboxylic acid cycle to gluconeogen- 
esis. S. cerevisiae requires large amounts of carbohydrate for its 
cell wall; in ethanol medium, this carbohydrate comes from 
gluconeogenesis, which depends on the glyoxylate shunt and 
on the glycolytic pathway running in reverse. The need for 



gluconeogenesis also explains why glycolytic enzymes are 
abundant even in ethanol medium. Thus, 2D gel analysis shows 
the prominence of the glycolytic and glyoxylate shunt enzymes 
. in cells grown on ethanol, emphasizing that gluconeogenesis, 
presumably largely for production of the cell wall, is a major 
metabolic activity under these conditions. 

During gluconeogenesis, substrate-product relationships are . 
reversed for the glycolytic enzymes. One might expect that not 
all glycolytic enzymes would be well adapted to the reverse 
reaction. Indeed, 2D gels show that in ethanol, Adh2 (alcohol 
dehydrogenase 2) is strongly induced (16), while its isozyme 
Adhl is not greatly ailected. Adhl and Adh2 each interconvert 
acetaldehyde and ethanol. Adhl has a relatively high K m for 
ethanol (17 mM), while Adh2 has a lower 7C m (0.8 mM)(5). 
Thus, it is thought that Adhl is specialized for glycolysis (ac- 
etaldehyde to ethanol), while Adh2 is specialized for respira- 
tion (ethanol to acetaldehyde) (5, 29). Similarly, Enol (enolase 
1) is induced in ethanol, while its isozyme Eno2 (enolase 2) 
decreases in abundance (Table 1) (4, 19). Enol is inhibited by . 
. 2-phosphogfycerate (the glycolytic substrate), while Eno2 is 
inhibited by phosphoenolpyruvate (the gluconeogenic sub- 
strate) (4). Perhaps Enol has a lower K m for phosphoenol- 
pyruvate than does Eno2, though to our knowledge this has not 
been tested. Thus, the 2D gels distinguish isozymes specialized 
for growth on glucose (Adhl and Eno2) from isozymes spe- 
cialized for ethanol (Adh2 and Enol). 

Many heat shock proteins (e.g., Hsp60, Hsp82, Hspl04, and 
Kar2) were about twofold more abundant in ethanol medium 
than in glucose medium. This is consistent with the increased 
beat resistance of cells grown in ethanol (3). 

Enzymes involved in protein synthesis (Ef 11, RpaO, and Tifl ) 
were about twice as abundant in glucose medium as in ethanol 
medium. This may reflect the higher growth rate of the cells in 
glucose; 

Phosphorylation of proteins. To examine protein phosphor- 
ylation, we labeled cells with 32 P and ran 2D gels to examine 
phosphoproteins. About 300 distinct spots, probably represent- 
ing 150 to 200 proteins, could be seen on pH 4-8 gels (Fig. 5B). 
We then aligned autoradiograms of three gels, each with a . 
different kind of labeled protein ( 32 P only |Fig. 5B], 32 P plus 
35 S (Fig. 5A), and 35 S only [not shown, but see Fig. 1 for 
example]). Jn this way, we made provisional identification of 
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FIG. 3. Correlation o/ protein abundance with CAI. The number of mole- 
cules per cell of each protein is plotted against the CAJ for thai pioicin. Note the 
logarithmic scale on the protein axis. Data tot the CAI are fiom the YPD 
databa.se (13). 
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FIG. 4. Distribution of CA1 ovei the whole genome, shown in internals of 0.030 (i.e., (here axe 150 genes with a CAI between 0.000 and 0.030, inclusive; 31 genes 
with a CAI between 0.031 and 0.060; 269 genes with a CAI between 0.061 and 0.090; 1,296 genes with a CAI between 0.091 and 0.120; etc.) The distribution peaks 
with 2,028 genes with a CAI between 0.121 and 0.150. 



some of the 32 P-labeled spots as particular 35 S-Iabeled spots. 
AJ1 such identifications are somewhat uncertain, since precise 
alignments are difficult, and of course multiple spots may ex- 
actly comigraie. Nevertheless, we believe that most of the 
provisional identifications are probably correct. Among the 
major 32 P-labeled proteins are the hexokinases Hxkl and 
Hxk2, the acidic ribosome-associated protein RpaO, the trans- 
lation factors Yef3 and Efbl, and probably Hsp70 heat shock 
proteins of the Ssa and Ssb families. RpaO and Efbl are quan- 
titatively monophosphorylated. 

Many yeast proteins resolve into multiple spots on these 2D 
gels (7). Yef3 has live or more spots, at least four of which 
comigraie with 32 P. Tpil has a major spot showing no 32 P 
labeling and a minor, more acidic spot which overlaps with 
some 32 P label. Tif] has at least seven spots (7); two of these 
overlap with some 32 P label, but five do not (Fig. 5). Eftl has 
at least three spots (7), and none of these overlap with 32 P, 
although there are three nearby, unidentified 32 P- labeled spots 
(a, c, and d in Fig. 5). Spots that seem to be extra forms of 
Met 6, Pdcl, Eno2, and Fbal can be seen in Fig. 6A, but there 
is little 32 P at these positions in Fig. 5. -Thus, phosphorylation 
explains some but not all of the different protein isoforms seen. 

The cell cycle is regulated in part by phosphorylation. We 
compared 32 P-labeled proteins from cells synchronized in G, 
with a -factor, in cells synchronized in by depletion of G } 
cydins, and in cells .synchronized in M phase with nocodazole. 
Only very minor differences were seen, and these were difficult 
to reproduce. The cell cycle proteins regulated by phosphory- 
lation may not be abundant enough for this technique to be 
applied easily. 

Centrifugal fractionation. We fractionated 3S S- labeled ex- 
tracts by centrifugation (Materials and Methods). Figure 6A 
shows the proteins in the supernatant of a high-speed 
(100,000 X g t 30 min) centrifugation, while Fig. 613 shows the 
proteins in the pellet of a low-speed (16:000 x g, 10 min) 
centrifugation. Many proteins are tremendously enriched in 
one fraction or the other, while others ate present in both. 



Most glycolytic enzymes (e.g., Tdh2, Tdh3, Eno2, Pdcl, Adhl, 
and Fbal) are enriched in the supernatant fraction. The only 
exception is Pfkl (not indicated), which is found in both pellet 
and supernatant fractions. Many proteins involved in protein 
synthesis (Eftl, Yef3, Prtl, Tif 3, and RpaO) are in the pellet, 
possibly because of the association of ribosomes with the en- 
doplasmic reticulum. However, Efbl is in the supernatant, as is 
a substantial portion of the Eftl. Perhaps surprisingly, several 
mitochondrial proteins (Atp2 [not shown) and Uv5) are largely 
in the supernatant. Perhaps glass bead breakage of cells re- 
leases mitochondrial proteins. The nuclear protein Gspl is in 
the pellet fraction. The enrichment produced by centrifugation 
makes it possible to see minor spots which are otherwise poorly 
resolved from surrounding proteins. Figure 6B shows that the 
previously identified Tifl spot is surrounded by as many as six 
other spots that cofractionale. We observed six identical or 
very similar additional spois when we over expressed Tifl from 
a high-copy-number pJasmid (not shown). Signal overlaps only 
one or two of these spois in 32 P-labeJing experiments (Fig. 5), 
and so the different forms are not mainly due to different 
phosphorylation states. 

D1SCUSSJON 

Our experience with developing a 2D gel protein database 
for S. cetevisiae is summarized here. With current technology, 
we can see the most abundant 1,200 proteins, which is about 
one-third to one-quarter of the proteins expressed. The re- 
maining proteins will be difficult to see and study with the 
methods that we have used, not because of a lack of sensitivity 
but because weak spots are covered by nearby strong spots. 

Of the 1,200 proteins seen, we have identified 148, with a 
bias toward the most abundant proteins. Steady application of 
the methods already used would allow identification of most of 
the remaining proteins. Gene overexpression will be particu- 
larly useful, since it is not affected by the lower abundance of 
the remaining visible proteins. 
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2D gels of the kind that we have used are not suitable for 
visualization of rare proteins. However it wij) be possible to 
study on a global basis metabolic processes involving relatively 
abundant proteins, such as protein synthesis, glycolysis, gh> 
coneogenesis, amino acid synthesis, cell wall syn thesis, nucle- 
otide synthesis, lipid metabolism, and the heat shock response. 

Gygi et a I. (10) have recently completed a study similar to 
ours. Despite generating broadly similar data, Gygi et al. 
reached markedly different conclusions. We believe that both 
mRNA abundance and codon bias are useful predictors of 
protein abundance. However, Gygi et al. feel that mRNA 
abundance is a poor predictor of protein abundance and that 
"codon bias is not a predictor of either protein or mRNA 
levels" (10). These different conclusions are partly a matter of 
viewpoint. Gygi et al. focus on the fact that the correlations of 
mRNA and codon bias with protein abundance are far from 
perfect, while we focus on the fact that, considering the wide 
range of mRNA and protein abundance and the undoubted 
presence of other mechanisms affecting protein abundance, 
the correlations are quite good. 

However, the different conclusions are also partly- due to 
different methods of statistical analysis and to real differences 
in data. With respect to statistics, Gygi et al. used the Pearson 
product-moment correlation coefficient (r p ) to measure the 
covariance of mRNA and protein abundance. Depending on 
the subset of data included, their r values ranged from 0.1 to 
0.94. Because of the low r values with some subsets of the 
data, Gygi et al. concluded that the correlation of mRNA to 
protein was poor. However, the r p correlation is a parametric 
statistic and so requires variates following a bivariate normal 
distribution; that is, it would be valid only if both mRNA and 
protein abundances were normally distributed. Iri fact, both 
distributions are very far from normal (data not shown), and so 
a calculation of r p is inappropriate. There was no statistical 
backing for the assertion that codon bias fails to predict pro- 
tein abundance. 

We have taken two statistical approaches. First, we have 
used the Spearman rank correlation coefficient (r 5 ). Since this 
statistic is nonparametric. there is no requirement for the data 
to be normally distributed. Using the r st we find that mRNA 
abundance is well correlated with protein abundance {r s = 
0.74), and the CA1 is also well correlated with protein abun- 
dance (r s = 0.80) (and also with mRNA abundance [data not 
shown]). For the data of Gygi et al. (10), we obtained similar 
results, though with their data the correlation is not as good; r 5 
~ .0.59 for the mRNA- to- protein correlation, and r 3 = 0.59 for 
the codon bias- to- protein correlation. 

In a second approach, we transformed the mRNA and pro- 
tein data to forms where they were normally distributed, to 
allow calculation of an r p (Materials and Methods). Two trans- 
formations, Box-Cox and logarithmic, were used; both gave 
good correlations with our data [e.g.. r = 0.76 for lbg(adjusted 
RNA) to log(protein)]. Wc were not able to transform the data 
of Gygi et al. to a normal distribution. 

Finally, there are also some differences in data between the 
two studies. These may be partly due to (be different measure- 
ment techniques used; Gygi et al. measured protein abundance 
by cutting spots out of gels and measuring the radioactivity in 
each spot by scintillation counting, whereas we used phospho- 
rimaging of intact gels coupled to image analysis. We com- 
pared our data to theirs for the proteins common between the 
studies (but excluding proteins whose mRNAs are known to 
differ between rich and minim;)! media, and excluding Tifl, 
which was anomalous in differing by 100- fold between the two 
data sets). The r s be I ween ihe two protein data sets was 0.88 
(P < 0.0001). Although this is a strong correlation, the fact that 
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it is less than 1.0 suggests that there may have been errors in 
measuring protein abundance in one or both studies. After 
normalizing the two data sets to assume the same amount of 
protein per cell, we found a systematic tendency for the protein 
abundance data of Gygi et al. to be slightly higher than ours for 
the highest- abundance proteins and also for the lowest-abun- 
dance proteins but slightly lower than ours for the middle- 
abundance proteins. These systematic differences suggest some 
systematic errors in protein measurement. Although we do not 
know what the errors are, we suggest the following as a rea^ 
sonable speculation. For the highest-abundance proteins, we 
may have underestimated the amount of protein because of a 
slight ry nonlinear response of the phosphorimager screens. For 
the lowest-abundance proteins, Gygi et al. may have overesti- 
mated the amount of protein because of difficulties in accu- 
rately ait ling very small spots out of the gel and because of 
difficulties in background subtraction for these small, weak 
spots. The difference in the middle abundance proteins may be 
a consequence of normalization, given the two errors above. 

The low-abundance proteins in the data set of Gygi et al. 
have a poor correlation with mRNA abundance. We calculate 
that the r, is 0.74 for the top 54 proteins of Gygi et al. but only 
0.22 for the bottom 53 proteins, a statistically significant dif- 
ference. However, with our data set, the r 5 is 0.62 for the top 33 
proteins and 0.56 (not significantly different) for the bottom 33 
proteins (which are comparable in abundance to the bottom 
53 proteins of Gygi et al.). Thus, our data set maintains a good 
correlation between mRNA and protein abundance even at 
low protein abundance. This is consistent with our speculation 
that protein quantification by phosphorimaging and image 
analysis may be more accurate for small, weak spots than is 
cutting out spots followed by scintillation counting. Our rela- 
tively good correlations even for nonabundant proteins may 
also reflect the fact that we used both SAGE data and RNA 
hybridization data, which is most helpful for Ihe least abundant 
mRNAs. In summary, we feel that the poor correlation of 
protein to mRNA for the nonabundant proteins of Gygi et al. 
may reflect difficulty in accurately measuring these nonabun- 
dant proteins and mRNAs. rather than indicating a truly poor 
correlation in vivo. It is not surprising that observed correla- 
tions would be poorer with less-abundant proteins and 
mRNAs, simply because the accuracy of measurement would 
be worse. 

How well can mRNA abundance predict protein abun- 
dance? With r p - 0.76 for logarithmically transformed mRNA 
and protein data, the coefficient of determination, (r p ) 2 , is 0.58. 
This means that more than half (in log space) of the variation 
in protein abundance is explained by variation in mRNA abun- 
dance. When converted back to arithmetic values, protein 
abundances vary over about 200-fold (Table 1), and \r p ) 2 ~ 
0.58 for the log data means that of this 200- fold variation, 
about 20-fold is explained by variation in the abundance of 
mRNA and about 30- fold is unexplained (but could be due 
partly to measurement errors). For proteins much less abun- 
dant than those considered here, we imagine the in vivo cor- 
relation between mRNA and protein abundance will be worse, 
and other regulatory mechanisms such as protein turnover will 
be more important. 

Some important conclusions can be drawn from this sam- 
pling of ihe proteome. First, there is an enormous range of 
protein abundance, from nearly 2,000,000 molecules per cell 
for some glycolytic enzymes to about 3 00 per cell for some cell 
cycle proteins (26a). Second, about half of all cellular proicin 
is found in fewer than 100 different gene products, which are 
mostly involve tl in cai bony d rale metabolism or protein syni he- 
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sis. Third, the correlation between protein abundance and CAI 
is log linear as far as we can see, which is from about 10,000 
protein molecules per cell to about 1,000,000. This is somewhat 
surprising, because it implies that selective forces for codon 
bias are significant even at moderate expression levels. Jt also 
means that codon bias is a useful predictor of protein abun- 
dance even for moderately low bias proteins. Fourth, there is a 
good correlation between protein abundance and mRNA 
abundance for the proteins that we have studied. This validates 
the use of mRNA abundance as a rough predictor of protein 
abundance, at least for relatively abundant proteins. Fifth, for 
these abundant proteins, there are about 4,000 molecules of 
protein for each molecule of mRNA. This Jast conclusion 
raises questions as to how the levels of nonabundarit proteins 
are regulated and suggests that protein instability, regulated 
translation, suboptimal rates of translation, and other mecha- 
nisms in addition to transcriptional control may be very impor- 
tant for these proteins. 
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Pre-translational regulation of cytochrome P450 genes is responsible for 
SrtoS ° e$ ° f iDdivid,,al P45 ° enz * mes amo *g Patients with 

GegrggJ, LiddleC, Murray M , BythK, Farrell GC . 

^ZT dy K^ rtCd that u d ; seas ^Pecific differential alterations in the hepatic 
expression of xenobiohc-metabolizing cytochrome P450 (CYP P450) enzymes occur in 

in CYP pro ems are modulated at pre- or pdst-translatiorial levels, we have now examSd 
the hepatic levels of mRNA for CYPs IA2, 2C9, 2E1 and 3A4 by solution ^ybriSon 
m the same hvers of 20 controls (surgical waste from WstologicaUy nonnal 1^32 

SSa [ : lai "* 18 ° f Ch0,CStatiC S6Vere ^ H - r ^ CYP1 A2 
mRNA and CYP 1 A immunoreactive protein were both reduced in livers with 

hepatocellular and cholestatic types of cirrhosis. In contrast, CYP3A4 mRNA and protein 

were reduced only m hvers from patients with hepatocellular diseases. For 1 A2 anH/S 

there were sigmficant correlations between mRNA species and the respective protein 

ITT ( 1t£\:Vt' rS3A4 = a64 ' p < 0000, >- ™ 2C9 mRN "wS 7Z^Z 

pauents wito bom cholestatic and hepatocellular types of liver disease, but 2C protein 

CYV^PMA y Ch ° leStatiC *y*^™- ™e correlation between 

UYP2C9 mRNA and protein, was also significant (rs = 0.36, P < 0.005) but mRNA levels 
accounted for only 13% of the variability in protein rankings. This is probabl^T 

3™ST eins apart from 209 being detected b * the - tj - 2C 

antibody CYP2E1 mRNA and protein were reduced in patients with cholestatic liver . 
disease, but in hepatocellular disease the expression of only CYP2E1 mRNA was 
decreased. CYP2E1 mRNA was significantly correlated with CYP2E1 protein but 
accounted for ordy 18% of the variability in protein rankings (rs = 0.43, P < 0;0005) 

SSlS? J PVP ^ ^ indiC3te tHe disease - s P««fic alterations of xenobiotic- 
metabohzing CYP enzymes among patients with cirrhosis is due, at least in part, to pre- 
translanonal mechamsms. The lack of a strong correlation between CYP2E I mRNA and 
protein suggests that this gene, like its rat orthologue, may be subject to pre-transtational 
as well as translational and/or post-translational regulation: nmsianonai 

PMID: 7741759 [PubMed - indexed for MEDLINE] 
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Cyclooxygenase-2 cxpressioa in macrophages: modulation by protein 
kinase C-alpUa. 

Girotix M . Descotcaux A . 

Institut National de la Recherche Scientifique-Institut Armand-Frappier, Univers.te du 
Quebec, Laval, Canada. 

Cyclooxygenase-2 (COX-2) is an inducible enzyme responsible for high levels of PG 
production during inflammation and immune responses. Previous studies with 
pharmacological inhibitors suggested a role for protein kinase C (PKC) in PG production 
possibly by regulating COX-2 expression: In this study, we addressed the role of PKC- 
alpha in the modulation of COX-2 expression and PGE2 synthesis by the overexposing 
„ ,«rT' nant " ncgative (DN > mulant of this isoenzyme in the mouse macrophage cell line 
IVq rSr WC ,nvesti S ated *« effe <* of various stimuli on COX-2 expression, namely 
LPS, IFN-gamma, and the intracellular parasite Leishmania donovani. Whereas LPS- 
induced COX-2 raRNA and protein expression were down-regulated in DN PKC-alpha- 
overexpressing clones, IFN-gamma-induced COX-2 expression was up-regulated in DN 
PKC-alpha-overexpressing clones with respect to normal RAW 264 7 cells 
Measurements of PGE2 levels revealed a strong correlation between PGE2 secretion and 
.IFN-gamma-induced COX-2 mRNA and protein levels in DN PKC-alpha-overexpressing 
clones. Taken together, these results suggest a role for PKC-alpha in the modulation of 
LPS- and IFN-gamma-induced COX-2 expression, as well as in IFN-gamma-induced . 
PGE2 secretion. ^ 
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Overexpression of a DEAD box protein (DDX1) in neuroblastoma and 
retinoblastoma cell lines. 

Godbout R . Packer M . Bie W . 

Department of Oncology, Cross Cancer Institute and University of Alberta, 1 1560 
University Ave., Edmonton, Alberta T6G 1 Z2, Canada. 

The DEAD box gene, DDX1, is a putative RNA Kelicase that is co-amplified with 
MYCN in a subset of retinoblastoma (RB) and neuroblastoma (NB) rumors and cell lines. 
Although gene amplification usually involves hundreds to thousands of kilobase. pairs of 
DNA, a number of studies suggest that co-amplified genes are only overexpressed if they 
provide a selective advantage to the cells in which they are amplified. Here, we further 
characterize DDX1 by identifying its putative transcription and translation initiation sites. 
We analyze DDX1 protein levels in MYCN/DDX1 -amplified NB and RB cell lines using 
polyclonal antibodies specific to DDX1 and show that there is a good correlation with 
DDX1 gene copy number, DDX1 transcript levels, and DDX1 protein levels in all cell 
lines studied. DDX1 protein is found in both the nucleus and cytoplasm of DDX1- 
amplified lines but is localized primarily to the nucleus of nonamplified cells. Our results 
indicate that DDX1 may be involved in either the formation or progression of a subset of 
NB and RB tumors and suggest that DDX1 normally plays a role in the metabolism of 
RNAs located in the nucleus of the cell. 
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Modulation of gap junction mediated intercellular communication in TM3 
Leydig cells. 

Goldenbcrg RC, Fortes FS . Cristancho JM, Morales MM , Fraud CR , Varanda WA 
Campos de Carvalho AC 

Institute of Biophysics Carlos Chagas Filho, UFRJ, Brazil. 

Long-term modulation of intercellular communication via gap junctions was investigated 
in TM3 Leydig cells, under low and high confluence states, and upon treatment of the 
cells for different times with activators of protein kinase A (PKA) and protein kinase C 
(PFCC). Cells in low confluence were readily coupled, as determined by transfer of the 
dye Lucifer Yellow; on reaching confluence, the cells uncoupled. Western blots and RT- 
PCR revealed that connexin 43 (Cx43) was abundantly expressed in TM3 Leydig cells 
and its expression was decreased after the cells achieved confluence. Stimulation of PKA 
or PPCC induced a decrease in cell-cell communication. Staurosporin, an inhibitor of 
protein kinases, increased coupling and was able to prevent and reverse the uncoupling 
actions of dibutyryl cAMP and 12-0-tetradecanoyI-phorboH3-acetate (TP A). Under 
modulation by confluence, Cx43 was localized to the appositional membranes when cells 
were coupled and was mainly in the cytoplasm when they were uncoupled. In addition, 
cAMP and TP A reduced thesurface membrane labeling for Cx43, whereas staurosporin 
increased it. These data show a strong correlation between functional coupling and the 
membrane distribution of Cx43, implying that this connexin has an important role in 
intercellular communication between TM3 cells. Furthermore, increased testosterone 
secretion in response to luteinizing hormone was accompanied by a decrease in 
intercellular communication, suggesting that gap junction mediated coupling may be a 
modulator of hormone secretion in TM3 cells. 
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ABSTRACT 

Motivation: Protein abundance is related to mRNA 
expression through many different cellular processes. 
. Up lo now, there have been conflicting results on how 
correlated the levels of these two quantities are. Given that 
expression and abundance data are significantly more 
complex and noisy than the underlying genomic sequence 
information, it is reasonable to simplify and average them 
in terms of broad proteomic categories and features (e.g. 
functions or secondary structures), for understanding their 
relationship. Furthermore, it will be essential to integrate, 
within a common framework, the results of many varied 
experiments by different investigators. This will allow one 
to survey the characteristics of highly expressed genes 
and proteins. 

Results: To this end, we outline a formalism for merging 
and scaling many different gene expression and protein 
abundance data sets into a comprehensive reference 
set, and we develop an approach for analyzing this in 
terms of broad categories, such as composition, function, 
structure and localization. As the various experiments are 
not always done using the same set of genes, sampling 
bias becomes a central issue, and our formalism is 
designed to explicitly show this and correct for it. We apply 
our formalism to the currently available gene expression 
and protein abundance data tor yeast. Overall, we found 
substantial agreement between gene expression and 
protein abundance, in terms of the enrichment of structural 
and functional categories. This agreement, which was 
considerably greater than the simple correlation between 
these quantities lor individual genes, reflects the way 
broad categories collect many individual measurements 
into simple, robust averages. In particular, we found 

*To u'hon* conespondencc .v fund J be athJio.scd. 
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that in comparison to the population of genes in the 
yeast genome, the cellular populations of transcripts and 
proteins (weighted by their respective abundances, the 
transcriptome and what we dub the Iranslatome) were both 
enriched in: (i) the small amino acids Val, Gly, and Ala; 
(ii) low molecular weight proteins; (iii) helices and sheets 
relative to coils; (iv) cytoplasmic proteins relative to nuclear 
ones; and (v) proteins involved in 'protein synthesis,' 'cell 
structure,' and 'energy production.' 

Supplementary information: http://genecensus.org/ 
. expression/translatome 
Contact: mark.gerstein@yale.edu 

INTRODUCTION 

High throughput experimentation, measuring mRNA 
(Schena et ai, J 995; Eisen and Brown, 1999; Ferea 
and Brown, J 999; Lipshutz et ai, 1999) and protein 
expression (Anderson and Seilhamer, J 997; Futcher et ai, 
J 999; Gygi et ai, 1999a; Ross-Macdonald et ai r J 999; 
Lopez, 2000; MacBeatb and Schreiber, 2000; Nelson 
et ai, 2000; Zbu et ai, 2000) are currently the single 
ricbesi source of genomic information. However, how to 
best interpret this data is still an open question (Basset t 
et ai, J 996; Wines mid Friedman, 1999; Zhang, J 999; 
Gcrsiein and Jansen, 2000; Searls, 2000; Sherlock, 2000; 
Claverie, 1999; Einarson and Golcmis, 2000; Epstein and 
Butow, 2000; Shapiro and Harris, 2000). Understanding 
how protein abundance is related to mRNA transcript 
levels is essential for interpreting gene expression., protein 
interactions, structures and functions in a cellular sys- 
tem (Batzimanikatis et ai. 1999). Moreover, as protein 
concentration is the more relevant variable with respect 
to enzyme activity, ii connects genomics to the physical 
chemistry of the cell (Kidd et ai, 2001). Protein abun- 
dance may also be invaluable for diagnostics and for 
determining drug targets ( Cor t ha Is et ai, 2000). 
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Previously, we surveyed the population of protein 
features — such as folds, amino acid composition, and 
functions — in yeast, and other recently sequenced 
genomes (Gerstein, 1997, 1998a,b; Gerstein and Hegyi, 
J998; Hegyi and Gerstein, 1999; Das and Gerstein, 
2000; Lin and Gerstein, 2000), and we extended this 
concept to compare the population of features in the 
yeast transcriptome to that in the genome (Drawid et 
ai, 2000; Jansen and Gerstein, 2000). Others have also 
done related work (Frishman and Mewes, 1997; Tatusov 
et oL y 1997; Jones, 1998; Wallin and von Heijne, 1998; 
Frishman and Mewes, 1999; Wolf et al y 1999). Here, we 
present a new methodology to compare the features of the 
mRNA expression population with the protein abundance 
population. 

Precise terminology is essential for this comparison. 
Unfortunately, 'proteome' is used inconsistently. Pro- 
teome can logically be used to describe all the distinct 
proteins in the genome (Qi et aL y J 996; CavalcoJi et aL y 
1997; Fey et aL, 1997; Garrets et aL y 1997; Gaasterland, 
1999; Jones, 1999; Sali, 1999; Tekaia et aL, 1999; 
Bairoch, 2000; Cambillau and Claverie, 2000; Dooliitle, 
2000; Pandey and Mann, 2000; Rubin et aL, 2000) and, 
in this context, it is equivalent to what others may refer 
to as the coding part of the genome. However, in papers 
on two-dimensional (2D) electrophoresis, it is often used 
to describe the sum total of proteins in a eel), taking 
into account the different levels of protein abundance 
(Shevchenko et aL, 1996; Gygi et aL, 2000a; Lopez, 
2000; Washburn and Yates, 2000). .In an effort to be clear, 
we propose the term 'translatome' for this second usage 
of proteome. 

With this definition, we are able to refer compactly to 
three different cellular populations. These are illustrated 
in Figure i . 

(i) We use the term genome when we refer to the 
population of open reading frames, where each ORF 
counts once. 

(ii) We use the term transcriptome when we refer to 
i he population of mRNA iranscripis. This term was 
originally coined by Vclculescu et al. (1997). Note 
that each ORF may give rise to different numbers 
of transcripts. Consequently, the transcriptome is 
essentially the same as the genome but with each 
ORF weighted by its expression level. 

fiii) The next level is the cellular population of proteins. 
As each protein represents a translated transcript, 
we make an analogy with the term trans criptome 
and use the term translatome as described above 
to describe this third population. Thus, the irans- 
Intome is a subset of the genome where each 
ORF is weighted by its associated level of protein 
abundance. 



Note that one could also, less compactly call the trans- 
latome a 'weighted proteome/ However, doing so assumes 
one of the two aforementioned definitions of proteome. To 
avoid ambiguity, we studiously avoid the use of proteome 
altogether in the paper. 

Differences between the translatome and the transcrip- 
tome exist given that transcripts from different genes 
can give rise to different numbers of proteins, due to 
different rates of translation and protein degradation. 
Post- transcriptional modifications further affect the 
translatome. 

In our analysis of the transcriptome and translatome, we 
focus on global protein features rather than the compari- 
son of individual genes. Previous analyses have shown that 
differences between mRNA expression and protein abun- 
dance levels can be quite dramatic for individual genes. 
. This may either be due to the noise in the data or to funda- 
mental biological processes. However, our analyses show 
that the variation between transcriptome and translatome 
is much smaller for global properties thai are computed by 
averaging over the properties of many individual genes. 

METHODS 
Data sources used 

For our analysis we culled many divergent data sets, 
representing protein abundance and mRNA expression 
experiments and also other sources of genome annotation. 
These are all summarized in Table 1. 

Biases in the da I a 

The databases that annotate the specific genes may 
not always be accurate (Jsbii et aL y 2000). Gene Chip 
experiments suffer with regard to cross hybridization 
and l he saluralion of probes. SAGE data degrades for 
lowly expressed mRNAs. 2D gels are unable to resolve 
membrane proteins (approximately 30% of the genome) 
and basic proteins (Gerstein, 1998c; Krogh et al. y 2001). 
In addition, the procedures for identification and quan- 
tification of the protein spots are subject to. uncertainties 
(Haynes and Yates, 2000). Human biases include the 
lack of low abundance proteins (Fey and Larsen, 2001; 
Gygi et aL, 2000b; Harry et aL, 2000) and the differences 
beuvcen laboratories in sample preparation. Our reference 
expression data set attempts to resolve these problems. 

Data set scaling 

A reference set for mRNA expression. With many differ- 
ent mRNA expression data sets available, it is worthwhile 
to integrate them into a single unified reference set, with 
the intention of reducing the noise and errors contained in 
the individual data sets and to obtain a unified estimate of 
the normal expression stale in a cell. 
We adopt an iterative scaling and merging formalism. 
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Fig. 1. Schematic overview of the analysis. On the left- side we outline the terms we use to describe ihe process of gene expression. The 
coding section of ihe genome is transcribed into a population of mRNA transcripts called the 'transcriptome/ The transcripts in turn are 
translated to a population of proteins; we use the term 'translator™:* for this protein population rather than the alternative 'proteome* because 
the latter term may be confounded with the protein complement of the genome (which is not necessarily associated with a quantitative 
abundance level). 

The matrix in the middle schematically shows an analysis of the three stages of expression. In general, we define a protein 'population' as 
a set of genes associated with a corresponding number of expression or abundance levels ('weights'); Jn the matrix each row represents a 
weight and each column a gene set. In particular, we differentiate between the mRNA reference expression set (GmRN A = G Cen ), which 
essentially covers the complete genome, and the reference protein abundance set (G Plot ) which contains the proteins in data sets 2-DE#I and 
2-DE Wl (see Table 1) because the protein abundance set is a significantly smaller subset of the genome. By definition, this subset contains 
only proteins that can be identified by 2-D gel electrophoresis and is therefore biased in this sense. The enrichment figures throughout this 
paper, through a comparison of the right- and left-sides of this figure, show the results of the experimental biases of 2D gels on the data set. 
Each pie chart represents a composition of a particular protein feature F (for instance, an amino acid composition) in a population (represented 
by the symbol ii). We can further look at the 'enrichment' of this feature in one population lelalive to another (represented by the symbol A, 
see Section 'Methods' for an explanation of the formalism). 



which we summarize below. We present a more del oiled 
review of the me i hods on our web site. 

We start with the values of one gene chip data set Vi 
where /' is used throughout as a subscript to denote gene 
number. We then transform the values of the next Gene 
Chip data set X t to Y { with the following non-linear regres- 
sion: min£ ( - (Yi - V;) 7 , with Y; = AX? where A and B 
are the parameters of (he regression - . Note that two Gene 
Chip sets may not be defined for the same set of genes, 
so we have to perform the fit only over the genes com- 
mon to both sets. The motivation for scaling is that the 
dynamic range of observed expression levels varies some- 
what between different data sets, although cell types and 
growth condition:; are very similar. Reasons for dispar- 
ity may include different calibration procedures for relat- 
ing fluorescence intensity to a cellular concentration (mea- 
sured in copies of transcripts per cell) or different pro- 
tocols for harvesting and reverse- transcribing the cellular 
mRNA. 

We then merge and average the data to create a new 



reference set V as follows: 

If Vi and Y- t are both defined for »ene * and — — < a 

Yi + Vi 

Then V, = i(Yi + Vi) 

FJse if only Y t exists, V; = Y t 

Else Vi = i/ f . , 

As presented above, where only one data set has a value 
for the corresponding ORF, we incorporated that value 
and did not exclude it. When both data sets have values 
for an ORF. we averaged the values if they were within 
)5% of each other; otherwise, we just stayed with the 
original chip data set Ui. We used a = 15% in order to 
prevent outliers from skewing the result. This 15% value is 
a reasonable threshold for excluding outliers though other 
values (e.g. 10 or 20%) would give similar results (data 
not shown). Other data sets are subsequently included in 
ihe same procedure, continuing the iteration from the new 
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expression values V t . The initial iteration starts with the 
Young Expression Set, as l/,-, since we have the highest 
confidence in its accuracy. 

The SAGE data (Velculescu et ai t J997) was not 
included in the above procedure syrce it is of a funda- 
mentally different nature. An advantage of the SAGE 
technology over Gene Chips is that there is no possible 
signal saturation for high expression levels, as is possible 
for chips (Futcher et a/., 1999). Conversely, SAGE values 
are less . reliable for lowly expressed genes since there 
is a chance that one might not sequence a SAGE tag 
corresponding to such a gene altogether. Therefore, if 
after the last iteration, the average Gene Chip expression 
level Vi was both above a certain threshold ft and below 
the SAGE expression level S-, for the same gene, it was re- 
placed with the SAGE value; otherwise the average Gene 
Chip value was kept. This gave us our final expression 
set w mRNA . Our treatment of the SAGE data is modeled 
after that in Futcher et ai (1999), and like them, we used 
£=16. 

This incorporation of the SAGE data into the reference 
data set ensures that the highly expressed outliers are as 
accurate as possible. 

Rather than plain arithmetic averaging, this overall scal- 
ing procedure with the or cutoff avoids 'artificial averages' 
that combine very different values for a particular gene. 
Some expression values might be statistical outliers. In 
addition, it may be possible that the expression levels of 
a variety of genes can only be within mutually exclusive 
ranges or modes, such as when two alternative pathways 
are switched on or off. Simply averaging these would give 
values that are less representative of the particular mode' 
values. This situation is analogous to that in averaging 
together an ensemble of protein structures (i.e from NMR 
structure determination). Each structure could be stereo- 
chemical^ correct, with all side-chain atoms in predefined 
roiamer configurations. However, an average of all struc- 
tures could yield one that is stereochemically incorrect if 
this involved averaging over particular side- chains in dif- 
ferent rotarneric stales. 

With regard to our regression analysis, we have investi- 
gated both non- linear and linear fits but found a non-Ii near 
procedure to be more advantageous. The non-linear rela- 
tionship between different expression data sets perhaps 
reflects saturation in one or more of the Gene Chips — not 
an uncommon phenomenon. This non-linearity is imme- 
diately evident on scatter plots of two data sets against 
one another (see website). Accordingly, the non- linear 
fit produces a smaller residual than the linear fit: 98297 
(non-)ineai) veisus J 22 182 (linear) lor the scaling of the 
Church data set and 59828 (non-linear) versus 67 462 
(linear) for the Samson data set. 



A reference set for protein abundance. We followed a 
similar procedure to calculate a reference protein abun- 
dance set from the two gel electrophoresis data sets. We 
first scaled the two data sets against the mRNA expres- 
, si on reference data set, getting regression parameters Cj 
and Dj : 

i 

where the subscript j indicates the data set 2-DE #1 or 
2-DE #2 respectively; P t j is the protein abundance value 
in data set j y and u/ m RNA,; the corresponding reference 
expression value, and Cj and Dj are the parameters of 
the non-linear regression. 

Using these parameters, we transformed the values of set 
2-DE #2 onto 2-DE #1. Then we combined both sets into 
the reference protein set Wproi by averaging them, if both 
values existed. Otherwise, by using the existing value, viz: 

C ,,.c,(£) 

wpiou ■= (Pi j 4- Q\,i)f2 if both P { j and Q i 2 exist. 
Else if only P i j exists, u>p r0 i./ = Pij 
Else if Q, 2 exists, vjpj 0t .i ~ Qi,2> 

Enrichment of features 

Formalism. In the next part of our analysis, we want 
to group a number of proteins together into various 
categories based on common features and characterize 
those features that are enriched in one population relative 
to another, i.e. the translatome population of proteins 
as measured by 2D gels relative to the transcriptome 
population of transcripts or the genome population of 
genes. To this end, we set up a formalism that could 
be applied universally to all the attributes that we were 
interested in. Due to the limitations of the experiments, 
the translatome, transcriptome, and genome populations 
are defined on different sets of genes, and sometimes we 
want to remove this 'selection bias' by forcing them to be 
compared on exactly the same set of genes. This is a key 
aspect of our formalism as presented in Figure 1 . 

We call an entity like |w, G] a 'population; where G 
is a set describing a particular selection of genes from the 
genome and w is vector of weights associated with each 
element of this population. In particular, we focus on three 
main populations here: 

0) I J r GcenJ is the population of genes in the genome, 
all 6280 genes weighted once (w ~ J); 

(ii) h v mRNA: G mRNA] is the observed population of the 
transcripts in the transcriptome, i.e. the 6249 genes 
in the reference expression set weighted by their 
reference expression value: 
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(iii) [wp^, (?p,oil is ihe observed cellular popuJat ion of 
the proteins in the transiatome, i.e. the 181 genes 
in the reference abundance set weighted by their 
reference abundance value. 

(The set of genes in the genome Go* is approximately 
equal to the genes in set G mRNA , such that we can use 
both symbols interchangeably.) We can also use this nota- 
tion to describe specific experiments— e.g. [\v l3c2 , G hcZ ] 
describes the gene set and weights relating to the transpo 
son abundance set. 

Furthermore, we define Fj as the value of a feature F 
in ORF j. For example, F could be the composition 
of leucine (a real number) or a binary value (0 or ]) 
indicating whether an ORF contains a trans-membrane 
segment. Given these definitions, the weighted average of 
feature F in population [w, G] is: 



On the other hand, when computing the average molec- 
ular weight per amino acid, we need to normalize by the 
number of amino acids per ORF, which is equivalent to 
choosing the following weights: 



^ [w, G]) 



The weighted averages of iwo populations [w, G] and 
[v, S] can be compared by simply looking at their relative 
difference A: 

A(7\ fv, SI [w G}) = ^ v ^W(F,[w ? G}) 
M(/\fw,G)) 

where v and w are weights for the sets of ORFs S and G 
respectively. We call A the 'enrichment' of feature F 
because ii indicates whether F . is enriched (if A is 
positive) or depleted (if A is negative) in population f v S] 
relative lo [w, G J. 

Usually, the gene set G is defined by ihe particular 
.experiment, for which the weight w was measured. 
However, it is also possible lo combine the gene set 
associated with one experiment with expression levels 
from another set. One may want to do this to compute 
(he enrichment only on the genes common to both 
populations, for which there are defined values for both w 
and v, viz: A(f\ \v, S n G}. \ w , $ n Gj). In practice, 
this is most relevant for comparing G Vin and G mRNA . 
Since G Pror is completely a subset of G mRNA , we need 
not explicitly deal with intersections if we calculate all 
statistics directly over Gp IO ,. 

One can adjust the weight vectors to take into account 
different types of averaging. For instance, when com- 
puting ihe amino acid composition (F = on) from ihe 
amino acid compositions of individual ORFs F } = on- 
(Vy e G). we weight by ORF length. In the' case of 
expression weights, we have: 

where A'; is a measure of the length of OR F / (such as ihe 
number of amino acids). 



Application of methodology to quantitative 
abundance sets 

Having defined our formalism, we applied it to a diverse 
set of protein features in yeast. 

Amino acid enrichment. As shown in Figure 2a, we used 
pur methodology to measure the enrichment of individual 
amino acids in both the transiatome and the transcriptome 
relative to the genome. We found that three amino acids— 
valine, glycine and alanine— were consistently enriched in 
both transcriptome and transiatome populations. 

In Figure 2a we compare different gene Sets. In Fig- 
ure 2b we focus mainly on the variation in enrichments 
when all the comparisons are restricted to the set of 18J 
genes (G Prot n G mRNA = Gp TO( ) common to all data sets. 
Thus, the differences between the populations now only 
reflect the effects of differential transcription of certain 
genes and differential translation of certain transcripts. 
We find here an enrichment specifically of cysteine in the 
transiatome in relation to the transcriptome. 

To measure the statistical significance of the results on 
amino acid enrichment, we have performed a control anal- 
ysis on a randomized data set (Figure 2d). We randomly 
permutated the expression values of the ORFs 1000 times 
and then recomputed Ihe enrichments. This allowed us to 
compute distributions for Ihe amino acid enrichments and, 
from integrating these, one-sided values indicating the 
significance of the observed enrichments. 

Amino acid enrichment in Transposon data set. We 
also Iried to extend our methodology, ineffectively, to 
cope with the semi-quantitative Transposon sei. We used 
only those 450 ORFs that consistently yielded either no 
expression or high expression, as binary data, on or off. We 
show ihe enrichments of amino acids computed from this 
fillered Transposon abundance set in Figure 2a. Overall 
ihe enrichments from this set seemed lo be attenuated in 
comparison to other data. 

Biomass enrichment. A corollary lo amino acid enrich- 
ments is ihe determination of the average biomass of the 
transcriptome and transiatome populations (shown in Fig- 
ure 2c). We found (hat the average molecular weight of 
a protein m both populations was, on average, lower than 
in the genome population. These preliminary observations 
suggesi a ceil preference to use less energetically expen- 
sive proieins for those that aie highly transcribed or lians- 
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Fig. 2. Amino acid and biomass enrichment, (a) Shows (he amino acid enrichments between different populations as indicated by the legend 
to the right of the plot (the legend is ordered in the same way as the schematic illustration in Figure I). The bars indicate the enrichment of the 
transcriptome relative to the genome, whereas the circles indicate the enrichment of the translatome relative to the genome. In addition, we 
also show the enrichment for protein abundance from the Tiansposon abundance set, represented by the circles with the line through them, 
(b) Shows a different view of amino acid enrichment from thai contained in (a), now focusing on changes, and thus restricting the comparison 
to the genes common to all the dala sets. The graph is oidered according to ihe enrichment from transcriptome to translators (black squares). 
We focus here only on the changes for (he abundance gene set (G Prol ) to exclude the effects that arise from looking al different subsets. In this 
. view the enrichments from genome to transcriptome (white squares) and from genome to translatome (while diamonds) look more similar 
than do the analogous sets .n (a). To make comparison with (a) easier we again show the enrichment from genome to the transcriptome for 
the complete gene set (G Ge n. shown in bars), (c) Shows biomass enrichment. The left panel depicts the average molecular weight per ORF 
(in units of kDa) and the right panel, the aveiage molecular weight per amino acid (in units of Dallons) in each of the three slages of gene 
expression. The numbers inside the circles indicale the average molecular weights. The values next lo the arrows indicate the enrichments 
-m biomass between different populations. Both the circle diameters and the ariow widths are functions of ihe corresponding values (the 
hollow arrow indicates a positive value). It is very clear that the average molecular weight per ORF is much lower in the translatome (by 20 
or 15%) and transcriptome (by 29%) than in the genome. This relative depletion of biomass mainly lakes place as a result of transcription; the 
effect of translation is less clear, depending on the populations compared. On the other hand, the depletion in the average molecular weight 
per amino acid (-3.3% fiom genome to translatome) is an order of magnitude smaller lhan in the average weight per ORF. This shows 
thai the yeast cell favors the expression of shorter ORFs over longer ones, and agrees with cur earlier observation that ihere is a negative 
correlation between maximum ORF icngih and niRNA expression (Janscn and Ccrstcin. 2000); it seems that this effect mainly lakes place 
during transcription rather than translation, (d) This plot shows that (he amino acid enrichments are statistically significant. We have assessed 
Significance by randomly permuting the expression levels among the genes and then recomputing (he amino acid enrichments. This procedure 
can be repeated and used to generaie distributions of random enrichments that can then be compared against the observed enrichments. hi 
the plot Ihe gray bars represent the observed enrichments already shown in Figure 3a. On top of ihe gray bars we show standard boxplols 
of enrichment distributions based on 1000 random permutations. (The middle line represents the distribution median. The upper and lowc. 
suies of the box co.ncide wiih the upper and lower quartdes. Outliers are shown as dots and defined as data points that are outside the range 
of the whiskers, the length of which is 1.5 the interquartile distance.) Based on the random distributions, we can compute onesided p- values 
for the observed enrichments. Amino acids for which the values arc less lhan JO -3 are shown in bold font. 



lated. However, we also found that the average molecu- 
lar weight per amino add differed much less beiween the 
transcripicme and the translatome on ihe one hand, and the 



genome on ihe other hand (though it was still slightly less). 
This findmg indicates that lower molecular weights in the 
translatome and transctiptome relative io the cenomc are 
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predominantly due to greater expression of shorter pro- 
teins rather than the incorporation of smaller amino acids. 

Secondary, structure composition. We also used our 
methodology to study the enrichment of secondary- 
structural features. Secondary structural annotation was 
derived from structure prediction applied uniformly to all 
the ORFs in the yeast genome as described in Table J. 
As shown in Figure 3a, all three populations — genome, 
transcriptome, and iranslatorne — had a fairly similar 
composition of secondary structures — sheets, helices, and 
coils. The differences between populations were marginal 
and based only on the small subset of genes. 

We also found that Transmembrane (TM) proteins 
were significantly depleted in the transcriptome (see 
website and caption). These results are consistent with 
our previous analyses (Jansen and Gerstein, 2000). The 
protein abundance data does not have any membrane 
proteins. 

Subcellular localization. Figuie 3c shows the enric li- 
me nt of proteins associated with the various subcellular 
compartments. For clarity, we divided the cell into five 
distinct subcellular compartments, (see Table I). We 
found that, in comparison to the genome, both the tran- 
scriptome and trans la tome are enriched in cytoplasmic 
proteins. This is true whether we make our comparisons in 



relation to the relatively large reference mRNA expression 
set or the smaller reference protein abundance set. As 
Figure 3c shows, the 2D gel experiments are clearly 
biased towards proteins from the cytoplasm. However, in 
the biased subset Cp rot transcription and translation lead 
to an even higher fraction of cytoplasmic proteins in the 
transJatome. 

Functional categories. Finally, we compared the enrich- 
ment of various functional categories in both the trans- 
latome and the transcriptome (see Figure 3b). This gives 
us a broad yet informative view of the cell as a whole. As 
described in Table 1, we used the top-Jevel of the MIPS 
scheme for the functional category definitions. We found 
broad differences between the .various populations, with 
some of the functional categories showing strikingly high 
enrichments. 

DISCUSSION AND CONCLUSION 

We developed: (i) a methodology for integrating many dif- 
ferent types of gene expression and protein abundance into 
a common framework and applied this to a preliminary 
analysis; (ii) a procedure for scaling and merging diffeienl 
mRNA and protein sets together; and riii) an approach for 
computing the enrichment of various proteomic features in 
the population of transcripts and proteins. We showed that 
by analyzing broad categories instead of individual noisy 
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Secondary structure (b) 



Localisation 





Function 



□ Metabolism (1) 
EI Energy (?) 

PlCeH growth. ceB division 
and DNA synthesis (3). 
Q Transcription (4) 
09 Protein synthesis (5) 

□ Cellular organization (30) 
Unclassilied {98. 99) 

DOther 



Fig. 3. Breakdown of the transcriptome and iranslatome in terms of broad categories relating to structure, localization, and function. All 
of the subfigures are analogous to the schematic illustration in Figure 1. (a) Represents the composition of secondary structure in the 
different populations, (b) Represents (he distribution of subcellular localizations associated with proteins in the various populations. We used 
standardized localizations developed earlier (Drawid and Gerstein, 2000). which, in turn, were derived from the MIPS, YPD, and SwissProl 
databases (Bairoch and Apweiler, 2000; Costanzo et a/., 2000; Mewes et ai, 2000). The subcellular localization has been experimentally 
determined for less than half of the yeast proteins, so our analysis applies only to this subset, (c) Shows the division of ORFs into different 
functional categories (according to the MIPS classification) in the various populations. Only the largest functional categories of the top level 
of the MIPS classification arc shown. The group 'other' contains the smaller top-level categories lumped together. This 'other' group is 
different from the gioup 'unclassified,' which contains genes without any functional description. 



data points, we could find logical trends in the underlying 
data. For example, individual transcription factors might 
have higher ot lower protein abundance than one expects 
fiom i heir mRNA expression, but the category 'transcrip- 
tion factors' as a whole has a similar representation in the 
transcriptome and iranslatome. 

We found, as previously described (Futcher et al. : 1999; 
Gygi et nl.. 1999b; Greenbaum et ai, 2001), a weak 
correlation between individual measurements of mRNA 



and protein abundance. The outliers of this correlation 
lend to be associated with cellular organization. One 
might conceive of using these outliers (i.e. those with 
significantly different transcriptional and tra relational 
behavior) to find consensus regulatory sequences. One 
possible method would involve using predicted mRNA 
structures (Jaeger et oi, 1990; Zuker, 2000) to find and 
investigate consensus structural elements in these outliers 
to which the yeasl rranslaiional machinery is known to be 
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Table 1. Data sets 



Description 



Size [ORFs} 



Reference 



mRNA expression 
Young 

Church 

Samson 

SAGE 

Reference expression 
Protein abundance 

■ 2-DE82 
Trans poson • 

Reference abundance 

Annotation 

Annotated localization 

TM segments 

MIPS functions 

COR secondary structure 



Gene chip profiles yeast cells with 
mutations that affect transcription 

. Gene chip profiles of yeast cells 
under four different conditions 

Comparing gene chip profiles for 
yeast cells subjected to alkylating agent 
Yeast cells during vegetative growth 

Scaling and integrating the mRNA 
expression set into one data source 



Measurement of yeast protein 

abundance by 2D gel 

electrophoresis and mass spectrometry 

Similar to 2- DE set #1 . 

Large-scale fusions of yeast genes 
with locZ by transposon insertion 
Scaling and integrating the 2-DE 
data sets into one data source 

Subcellular localizations of yeast 
proteins 

Predicted TM and 
soluble proteins »n yeast 
Functional categories for yeast 
ORFs 

Predicted secondary structure 
yeast ORFs 



5455 

6263 

6090 

3778 
6249 

156 

71 
1410 

181. 



2133 
(6280) 

2710 
(6280) 

3519 
(6194) 

6280 



Holstege*/a/. (1998) 
Roth etal. (1998) 
iciinsky and Samson (1999) 
Velculescu et of. (1997) 

Gygi ffa/.(I999a,b) 

Futcher etal (1999) 
Ross-Macdonald et ol (1999) 



Drawid and Gerstein (2000) 
Gerstein (I998a.b r c) 
Mewes et at. (2000) 
Gerstein (I998a,b.c) 



This table provides an overview of the data sets used in our analysis. The table is divided into three sections. The top section lists different mRNA ex pi ess ion 
sets. The middle section shows the protein abundance data sets used. The bottom section contains diffejenl annotations of protein features. The column Dau 
set' lists a shorthand refeience to each data set used throughout this paper. The next columns contain a brief description of the data sets, the number of ORFs 
contained in each of ihern, and the literature reference. In contrast to the other data we investigated, the reference expression and abundance data sets have 
been calculated for the purpose of our analysis (see text). An expanded version of the table is available on our web site. 
Some further information on the genome annotations: 

Localization. Protein localization information from YPD, MIPS and SwissProt were merged, filtered and standardized (Bairoch and Apwciler, 2000; 
Costanzo et ni, 2000; Mewes et of, 2000) into five simplified compartments — cytoplasm, hucleus, membrane, extracellular (including proteins in ER and 
golgi), and mitochondrial— according to the protocol in Drawid rt al. (2000). This yielded a standardized annotation of protein subcellular localization for 
2133 out of 6280 ORFs.' 

7M segments. In 2710 out of 62S0 yeast ORFs TM segments ajc predicted to occur, ranging from low to high confidence (732 OHFs). The TM prediction was 
ptifonned as follows: ihe values from the scale for amino acids in a window of size 20 (the typical size of a TM helix) were averaged and then compared 
against a cutoff of - I kcal mol" l . A value undei this cutoff was taken to indicate the existence of a TM helix. Initial hydrophobic stretches corresponding to 
signal sequences for membrane insertion were excluded. (These have the pa Hern of a charged residue within the first seven, followed by a stretch of 14 with 
an average hydiophobicity under the cutoff.) These parameters have been used, tested, and refined on surveys of membrane protein in genomes. 'Sure' 
membrane proteins had .it least two TM-si'gincnls with an average hydiophobicity less than -2 kcal mol" 1 (Rest et af., 1995: Gerstein el nl.. 2000. Santoni 
r> nl.. 2000; Senrs ftnt.^ 2000). 

Functions. MIPS functional categories have been assigned to 3519 out of 6194 ORFs. (The remainder are assigned to category '98' or '99/ which 
corresponds to unclassified function.) 



sensitive (McCanhy, 1998). 

Jn relniion to fnnciional categories, we found three 
i remls ihat were particularly notable:- (i) the 'cellular 



organization.* 'protein synthesis/ and 'energy production' 
categories were increasingly enriched as we moved Irom 
genome to transcripiome io translntoine. In ihe transcript 
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tome and translatome population relative to the genome; 

(ii) proteins with 'unclassified function* are significantly 
depleted, perhaps reflecting a bi3s against studying them; 

(iii) proteins in the 'transcription' and 'cell growth, cell 
division, and DNA synthesis' categories were consistently 
depleted. This reflects the fact thai many of these proteins, 
such as transcription factors, act as 'switches' such that 
only small quantities of the protein are necessary to 
activate or deactivate a process. These results concur with 
previous calculations (Jansen and Gerstein, 2000) wherein 
we found the transcriplome is enriched specifically with 
proteins involved in protein synthesis and energy. 

Limitations given the small size of (he protein 
abundance data 

Even with the extended coverage made possible by 
merging many data sets together into reference sets, the 
analysis is still limited by the minimal data. This was 
most applicable to the protein abundance measurements, 
potentially biasing our statistical results towards certain 
protein families. Moreover, the J8J proteins in Cp rc ,i do 
not represent a random sample. They are skewed towards 
highly expressed, well-studied proteins. Our methodology 
attempts to control for this gene-selection bias through 
our enrichment formalism, which allows one to rather 
precisely gauge various aspects of the bias. Conversely, 
many protein features in both the translatome and the 
transcriplome are dominated by highly expressed proteins. 
Under ihese circumstances, it is often sufficient to look at 
this smaller number of dominating proteins to characterize 
the whole population. This is similar to the development of 
the codon adaptation index for yeast (Sharp and U, 1987). 
While based on only 24 highly expressed proteins, it has 
proven to be robust in predicting expression levels for the 
entire genome. 

We believe that the essential formalism and approach' 
that we develop will remain quite relevant for future data 
sets' (Smith, 2000). 
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Thymidme kinase, thymidylate synthase, and dihydropyrimidine 
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Galanin in pituitary adenomas. 

g™ ba «*E > BMSOU1P, Wallerman E, LundbladL . Anggard A Ericson K, Aman 
K, Landr^M, Schmidt WE . rTnl i fclfT H^^~ ~ ^ ' ^cj>on_K, Aman 

??7^? °J. M f M««i«nc» Endocrine and Diabetes Unit, Karolinska Hospital 
b-I /1/6 Stockholm, Sweden. Eva.Grenback@ks.se 

Tumor galanin content was measured in extracts from human pituitary adenomas using a 
specific RIA method for monitoring human galanin. Twenty-two out of twenty-four 
tumors contained galanin with notably high levels in corticotroph adenomas, varying 
levels ,n clinically inactive tumors, and low levels in GH. secreting adenomas. Tumor 
ga anm and ACTH contents were closely correlated in all tumors. In four young patients 
wiUi microadenomas and highly active Mb Cushing tumor galanin was inversely related 
to tumor volume. The molecular form of tumor galanin, studied with reverse-phase 
HI LC, was homogeneous with the majority of tumor galanin coeluting with standard 
human galanin. In the tumors analysed with in situ hybridization there was a good 
correlation between galanin peptide levels and galanin mRNA expression In some 
tumors galanin mRNA and POMC levels coexisted, in others they were essentially in 
different cell populations. Levels of plasma galanin-LI were not related to tumor galanin 
concentration, and galanin levels were in the same range in sinus petrosus close to the 
pituitary venous drainage as in peripheral blood. Corticotropin releasing Hormone 
injections in two patients caused ACTH, but no detectable galanin release into sinus 
petrosus^ Our results demonstrate that corticotroph, but not GH adenomas, express high 
levels of galanin, in addition to ACTH, and that in some tumors both polypeptides are 
synthesised in the same cell population. However, galanin levels in plasma were not 
intluenced by the tumor galanin content. 
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Transcript profiling of human platelets usiug microarray and serial 
analysis of gene expression. 

Gnatcnko DV , DmmJJ, McCorkle SR . Wcissmann D Perrotta PL . Bahou WF . 

Department of Medicine, Program in Genetics, State University of New York Stonv 
Brook 1 1794-8151, USA. y 

Human platelets are anucleate blood cells that retain cytoplasmic mRNA and maintain 
functionally mtact protein translation^ capabilities. We have adapted complementary 
techniques of microarray and serial analysis of gene expression (SAGE) for genetic 
profiling of highly purified human blood platelets. Microarray analysis using the 
Affymetrix HG-U95Av2 approximately 12 600-probe set maximally identified die 
expression of 2147 (range, 13%-17%) platelet-expressed transenpts, with approximately 
ll/o collectively involved in metabolism and receptor/signaling, and an 
overrepresenlation of genes with unassigned function (32%). In contrast, a modified 
SAGE protocol using the Type IIS restriction enzyme Mmel (generating 21 -base pair 
[bp] or 22-bp tags) demonstrated that 89% of tags represented mitochondrial (mt) 
transenpts (enriched in 16S and 12S ribosomal RNAs), presumably related to persistent 
mt-rransenphon m the absence of nuclear-derived transcripts. The frequency of non-mt 
SAGE tags paralleled average di fference values (relative expression) for the most 
"abundant" transcripts as determined by microarray analysis, establishing the 
concordance of both techniques for platelet profiling. Quantitative reverse transcription- 
polymerase chain reaction (PCR) confirmed the highest frequency of mt-derived 
transenpts, along with the mRNAs for neurogranin (NGN, a protein kinase C substrate) 
and the complement lysis inhibitor clusterin among the top 5 most abundant transcripts 
For confirmatory characterization, immunoblots and flow cytometric analyses were 
performed, establishing abundant cell-surface expression of clusterin and intracellular 
expression of NGN. These observations demonstrate a strong correlation between high 
transenpt abundance and protein expression, and they establish the validity of transcript 
analyses as a tool for identifying novel platelet proteins that may regulate normal and 
pathologic platelet (and/or megakaryocyte) functions. 
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Transcript profiling of human platelets using microarray and serial analysis 
of gene expression 

Dmitri V. Gnatenko. John J. Dunn, Sean R. McCorkle, David Weissmann, Peler L. Perrotla, and Wadie F. Bahou 



Human platelets are anucleate blood cells • 
that retain cytoplasmic mRNA and main- 
tain functionally intact protein transla- 
tional capabilities. We have adapted 
complementary techniques of microarray 
and serial analysis of gene expression 
(SAGE) for genetic profiling of highly 
purified human blood platelets. Microar- 
ray analysis using the Affymetrix HG- 
U95Av2 approximately 12 600-probe set 
maximally identified the expression of 
2147 (range, 13%-17%) platelet-expressed 
transcripts, with approximately 22% col- 
lectively involved in metabolism and re- 
ceptor/signaling, and an overrepresenta- 
tion of genes with unassigned function 
(32%). In contrast, a modified SAGE proto- 
col using the Type IIS restriction enzyme 

Introduction " 



Mmel (generating 21-base pair [bpj or 
22-bp tags) demonstrated that 89% of 
tags represented mitochondrial (mt) tran- 
scripts (enriched in 16S and 12S ribo- 
somal RNAs), presumably related to per- 
sistent mt -transcript ion in the absence of 
nuclear-derived transcripts. The fre- 
quency of non-mt SAGE tags paralleled 
average difference values (relative expres- 
sion) for the most "abundant" transcripts 
as determined by microarray analysis, 
establishing the concordance of both 
techniques for platelet profiling. Quantita- 
tive reverse transcription-polymerase 
chain reaction (PCR) confirmed the high- 
est frequency of mt-derivod transcripts, 
along with the mRNAs for neurogranin 
(NGN, a protein kinase C substrate) and 



the complement lysis inhibitor clusterin 
among the top 5 most abundant tran- 
scripts. For confirmatory characteriza- 
tion, immu no blots and flow cytometric 
analyses were performed, establishing 
abundant cell-surface expression of clus- 
terin and intracellular expression of NGN. 
These observations demonstrate a strong 
correlation between high transcript abun- 
dance and protein expression, and they 
establish the validity of transcript analy- 
sis as a tool for identifying novel platelet 
proteins that may regulate normal and 
pathologic platelet (and/or megakaryo- 
cyte) functions. (Blood. 2003;101:2285-2293) 
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Human blood platelets play critical roles in normal hemostatic 
processes and pathologic conditions such as thrombosis, vascular 
remodeling, inflammation, and wound repair. Generated as cytoplas- 
mic buds from precursor bone marrow megakaryocytes, platelets 
are anucleate and lack nuclear DNA, although they retain 
megakaryocyte-derived mRNAs. '• 7 Platelets contain rough endo- 
plasmic reticulum and polyribosomes, and they retain the ability 
for protein biosynthesis from cytoplasmic mRNA. 3 Quiescent 
platelets generally display minimal translational activity, although 
newjy formed platelets such as those found in patients with 
immune thrombocytopenic purpura (ITP) synthesize various a-gran- 
ule and membrane glycoproteins (CPs), including GPlb and 
GPJlb/llla (ojjbpj). Furthermore, stimulation of quiescent platelets 
by agonists such as o-thrombin increases protein synthesis of 
various platelet proteins, including Bcl-3. 4 Like nucleated cells, the 
rapid translation of preexisting mRNAs may be regulated by 
integrin ligation to extracellular matrices. 5 In the case of platelets, 
the primary integrin involved in. this process appears to be a nb L3 3 
with cooperative signals mediated by the collagen receptor ri : p,. 6 - 7 



Jntegrin-mediated platelet protein synthesis appears to be regulated 
at the level of translation initiation involving the eukaryotic 
initiation factor 4E (eIF4E). Jnstead of directly influencing elF4E 
activity via posttranslational modifications (ie, phosphorylation), 
platelet e!F4E activity best correlates with its spatial redistribution 
to the mRNA -enriched cytoskeleton. 8 Furthermore, because pro- 
tein translation is partially inhibited by the immunosuppressant 
rapamycin. it suggests that adhesion- and/or aggregation- induced 
outsidc-in-signaling function to regulate protein synthesis through 
the mTOR (mammalian target of rapamycin) pathway. 6 -*- 9 

Despite the biologic importance of platelets and their intact 
protein synthetic capabilities, remarkably little is known about 
platelet mRNAs. Younger platelets contain larger amounts of 
mRNA with a greater capacity for protein synthesis, as detennined 
by using fluorescent nucleic acid dyes such as thiazole orange. 10 
This assay has been used as a quantitative determinant of younger 
or "reticulated" platelets (RPs). Indeed increased reticulated plate- 
lets are typically found in patients with conditions associated with 
rapid platelet turnover such as ITP; typically RP percentages in 
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such patienls approach 10% to 20% of all platelets, considerably 
higher than in healthy control subjects. n Interestingly, high RPs 
have, been associated with enhanced thrombotic risk when identi- 
fied in patients wilh thrombocytosis, 10 suggesting that quantita- 
tively increased mRNA levels may be associated with the prothrom- 
botic phcnotype. Whether this is relateoVto globally altered gene 
expression profiles or to select changes more evident during 
situations of rapid platelet turnover remains unknown. Certainly, 
technical limitations of this assay limit its utility in defining 
prothrombotic genotypes/ 012 and it cannot identify differentially 
expressed genes that may be causally implicated in disordered 
platelet pbenotypes. 

Toward the goal of defining the molecular anatomy of the 
platelet genome, we have adapted complementary techniques of 
microarray and serial analysis of gene expression (SAGE) for 
genetic profiling of highly purified human blood platelets. Microar- 
ray technology represents a "closed" profiling strategy limited by 
the target genes imprinted onto gene chips. In contrast, SAGE is an 
"open" architectural system that can be used to identify novel genes 
and to quantify differentially expressed mRNAs. ,3_,5 The sequence 
of each tag along with its positional location uniquely identifies the 
gene from which it is derived, and differentially expressed genes 
can be identified in a quantitative manner because the tag frequency 
reflects the mRNA level at the time of cellular harvest and analysis. 
By using both technologies, we have identified a number of 
previously uncharacterized genes that appear to be expressed in 
human platelets, while simultaneously establishing the dominant 
frequency of mitochondrial-expressed genomes comprising the 
platelet mRNA pool. These observations provide a panoramic 
overview of the platelet transcriptome, while additionally provid- 
ing insights into the molecular pathways regulating platelet (and/or 
megakaryocyte) function in normal and pathologic conditions. 



Materials and methods 

Reagents and supplies 

Thermm aifuoiicus (Ttiq) polymerase was purchased from (Roche, Indianap- 
olis, IN). T'l DNA ligase was purchased from Inviirogen (Carlsbad, CA), 
and restriction enzymes were from New England Biolabs (Beverly. MA), 
except for Mine), which was obtained from the Center for Technology 
Transfer (Gdansk, Poland). All oligonucleotides were synthesized on an 
Applied Biosysiems (Foster City, CA) 3-channeI synthesizer and are listed 
in Table 1. Monoclonal antibodies used for flow cytometric analysis 
included the F1TC (fluorescein isothiocyanate)-conjuga[ed anti-CD'W 
(ohbPj) immunoglobulin Gl (IgGI; Immunolecb, Miami. FL); phyco- 
erythrin (PE)-conjui»aicd antiglycophorin (lgG2; Becton Dickinson Fharm- 
ingen, San Dicgo^ CA); and peridinin chlorophyll protein (PERCP)- 
conjucated anti-CD45 (IgGI: Becton Dickinson Pharmingen). 

Platelet isolation, purification, and immunodetection 

All human subjects provided informed consent for an IRB (lnsiiiuiion.il 
Review BoardV-appioved protocol completed in conjunction with the 
General Clinical Research Center at Stony Brook University Hospital. 
Peripheral blood (20 niL) from healthy volunteers drawn into 2 mL of 4% 
sodium citrate t0.4% vol/vol final concentration) was used to isolate 
erythrocytes by differential centrifugal ion O500g) or to isolate pure 
leukocytes by density- gradient centrifugal ion as previously described."* 
Platelets collected fioni healthy volunteers by a phoresis were used within 
24 hours of collection. A tier addition of 2 mM EDTA (ethylcncdinmineict- 
raacetic acid), aphe'iesis- derived platelets from a single donor were 
cenirifuged at \40y for 15 minutes at 25°C. To minimize leukocyte 
contamination, only me upper .9/10 of the plalefcl-rich plasma (PRP) was 
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used for gel filtration over a BioGel A50M column (1000 mL total vohime) 
equilibrated with HBMT (HEPES-buffered modified Tyrodes buffer. 10 
mM HEPES (W^-hydroxyeth'ylpiperazine-AT^-ethanesnlfonic acid) pH 
7.4, 150 mM NaCL 2.5 mM KCL 0.3 mM NaH 2 P0 4> 12 mM NaHCQ,, 
0.2% bovine serum albumen [BSA], 0.1% glucose, 2 mM EDTA). 
Gel-filtered platelets (GFPs) were subsequently filtered through a 5-pjn 
nonwelting nylon filament filter (BioDesign, Carmel, NY) at 25°C and 
harvested by centrifugation al I500g for 10 minutes at 25°C. Platelets were 
gently and thoroughly resuspended in 10 mL HBMT buffer and incubated 
with 120 pX murine monoclonal anti-CD45 antibody conjugated to 
magnetic microbeads (Miltenyi Biotec, Bergisch Gladbach, Germany) on a 
rotating platform for 45 minutes at 25°C. Magnelic separation columns 
were used to capture CD45* cells (leukocyte fraction) by positive selection 
. (MACS II; Miltenyi Biotec). Purified platelets were concentrated by 
centrifugation at 1500g and immediately used for total RNA isolation. 

The efficiency of platelet purification was documented al each step by 
flow cytometry." Briefly, aliquots containing 2 X I0 6 platelets were 
incubated with saturating concentrations of FITC-conjugaled anti-CD41, 
PE-conjugaled antiglycophorin, and PERCP- conjugated antj-CD45 for 15 
minutes in the dark at 25°C, washed with phosphate-buffered saline (PBS), 
and fixed in PBS/1% formalin. Samples were analyzed using a FACScan 
(fluorescence- activated cell sorter scan) flow cytometer (Becton Dickinson) 
using CELLQuest software designed to quantify the number of CD45 + and 
glycophorin- positive events in the sample (expressed as the number of 
events per 100 000 CD41 + events). For some experiments, fixed platelets 
were permeabilized with 0. 1% Triton-X/PBS for 30 minutes at 25°C prior 
to the addition of primary antibodies, all as previously described. 17 ' 

Platelet protein detection was completed by sodium dodecyl sulfate 
(SDS)-polyacrylamide gel electrophoresis (PAGE) and immunoblot analy- 
sis as previously described, using the species- specific horseradish pcroxidase- 
conjugated secondary antibody and enhanced chemiluminescence. 16 Anti- ' 
bodies included the anticlusterin monoclonal antibody (Quidel, Santa Clara, 
CA; 1:1000 primary and 1:10 000 secondary) and the antineurogranin 
rabbit polyclonal antibody (Chemicon International, Temecula, CA; 1:1000 
primary and 1:10 000 secondary). 

Molecular analyses and microarray profiling 

Purified, individual cell fractions were resuspended in 10 mLTrizol reagent 
(Inviirogen), transfened into diethylpyrocarbonatc (DEPCHreated Corex 
(Springfield, MA) tubes, and serially purified ;md precipitated by using 
isopropanol essentially' as previously described.'* Total cellular RNA was 
harvested by centrifugation at 1,2 500g for 20 minutes at 4°C, washed 2 
times with 75% ethanol (10 mL/tube), and resuspended in 100 pX 
DEPC-treated water. Platelet mRNA quantitation was performed by using 
fluorescence-based real-lime PCR (polymerase chain reaction) technology 
(TaqMan Real-Time PCR; Applied Biosyslems, Foster City. CA), Oligonu- 
cleotide primer pairs were generated by using Primer 3 so fl ware (www- 
genome. wi.mit.edu), designed lo generate approximately 200-base pair 
(bp) PCR products al the same annealing temperature, and are outlined in 
Table I. Purified platelet mRNA (4 u.g) was used for first-strand cDNA 
synthesis using oligo(dT) and Superscript I) reverse transcriptase (Invitro- 
gen). For real-time reverse transcription |RT)-PCR analysis, the RT 
reaction was equally divided among primer pairs and used in a 40-cycle 
PCR reaction for each target j;ene by using the following cycle: 94°C for 30 
seconds, 55"C for 30 seconds, 72°C for 1 minute, and 7J°C for iO seconds 
(40 cycles lolal). mRNA levels were quantified by monitoring real-time 
fluorometric intensity of SYBR green 1. Relative mRNA abundance was 
determined from triplicate assays performed in parallel for each primer pair 
and was calculated by using the comparative threshold cycle number (A-Ct 
method) as previously described. 111 

Gene expression profiles were completed by using the approximately 
12 600-probe set HG-U95Av2 gene chip ("Affymeirix. Santa Clara, CA)." 
Total cellular RNA (5 p.g) was used foi cDNA synthesis by using 
SupVr Script Choice system (Life Technologies. Rock vide. MD) and an 
olieo(dT) primer containing the T7 polymerase recognition sequence 
t Primer Si; Table 1), followed by cDNA purification using GFX spin 
columns. In vitro transcription was completed in the presence of biotinyl- 
ated ribonucleotides by using a BioArray High Yield RNA Transcript 
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Table t. Oligonucleotide primers 









Nucleotide 


Primer 


direction 


Sequence (5' - 3') 


Position 


SI 


pligo(dT) 


5 ' - Bit - GGCCAGTGAATTGTAATACGACTCACTATAGGGAGGCGG- ( dT) 2 « -3 ' 
- ■ — ■ ■' — 




Cassette A 


SAGE 


5 ' - TTTGGATTTGCTtXrrCGAGTACAACTAGGCTTAATCCGACATG - 3 ' 


— 
— 






3 ' - • CCTAAACGACCAGCTCATCTITGATCCGAATAAGGCTp- 5 1 




Cassette B 


SAGE 


5 ' - pTTC ATGG CGGA G A CGTC CGC C A CT AGT GTCG CAA CTG A CT A * - 3 ' 


— 






3'-rWAAGTACCGCCTCTGCAGGCX5GTGATCACAGCGTTGACTGAT- 5' 
■4 ■ ■ ■ 




S2 


SAGE 


5 ' - Bn - GGATTTGCTGGTCGAGTACA- 3 ' 


— 


S3 


SAGE 


5'-En-TAGTCAGGTGCGACACTAGTGGC- 3' . 


— 


GP4 


Glycoprotein IIB [F) 


5 ' - AGGGCTTTGAGAGACTCATCTGTA -3' 


2094^2117 


GPS 


Glycoprotein IIB |R) 


5 ' - ACAATCTTGCltrnTGGATTCTG- 3 ' 


.2301-2279 


GP6 


Glycoprotein DtA |F) 


5 ' - TATAAAGAGGCCACGTCTACCTTC - 3 ' 


2335-2358 


GP7 


Glycoprotein IIIA (R) 


5 ' - CACTTCCACATACTGACATTCTCC- 3 ' 


2532-2509 


PAR18 


PARI IFJ 


5 ' - AATGTCAGTTCTGATATGGAAGCA- 3 ' 


2585-2608 


PAR 19 


PARI [R| 


5 ' - CCCAAATGTTCAAACTTCTTTAGC - 3 ' 


277G-2753 


SR8 


16SrRNA |F] 


5 ' - TGCAAAGGTAGCATAATCA CTTGT - 3 ' 


2586-2609 


SR9 


16SrRNAIRl 


S ' - GTTTAGGACCTGTGGGTTTGTTAG - 3 ' 


2785-2762 


NADH10 


NADH2 |F| 


5 ' - CTAGCCCCCATCTCAAATCATATAC- 3 ' 


4875-4898 


NADH11 


NADH2 JR| 


5 ' - AATGGTTATGTTAGGGTTGTACGG - 3 ' 


5075-5052 


THYM12 


Thymosin p4 [FJ 


5 ! - AAGACAG AGAOGCAAG AG AAAAAT - 3 ' 


135-158 


THYM13 


Thymosin p4 |R] 


5' - GCAGCACAGTCATTTAAACTTGAT- 3 ' 


336-313 


CLUS14 


Clusterrn |FJ 


5 ' - CCAA CAG AATTCATACGAGAAGG - 3 ' 


► 1006-1028 


CLUS15 


CJusterin JRJ ■ 


5 ' - CGTTATATTTCCTGGTCAACCTCT - 3 ' 


1222-1199 


NRG16 


Neurograntn (F) 


5 ' - GCCC'J l 1 1 AGTTAGTTCTGCAGTC- 3 ' 


1351-1374 


NRG 17 " 


Neurogranin fRJ 


5 ' - TTTTCTTT AAGTGAGTGTGCTTGG - 3 ' 


1567-1544 




T-cel) receptor 






TCR1& 


B-chain |F) 


S ' - CCA CAA CTATGTTTTGGTATCGT- 3 ' 


131-153 




T-cell receptor 






TCR19 


B-chain |Rj" 


5 ' - CTAGCA CTGCAG ATGTAGAAGCT - 3 ' 


332-310 


CD4520 


CD45 (F) 


S ' - GCTC AG AATGGA CAAGTA - 3 ' 


3771-3788 


CD4521 


CD45 |R| 


5 ' - C ACA CC CA TACACA CATACA - 3 ' 


42804261 



(FJ indicates forward (sense) strand; jRJ, ieverse (antisense) sirand; Bn, biotin; p, a phosphorylaierf 5' end (cassettes A and B); underlining, Nfo\ It sites in cassettes A and B; 
arrows, corresponding sequence tor S2 and S3 wilhin cassettes A and B. respeclivery; bold, the Mmel site; and N. A, C, 7. or G. nucleotide position based on the following 
accession numbers: glycoprotein IIB (J02764). glycoprotein UIA (M35999). PARI (M62424). 16S rRNAand NADH2 (NC_ 001807). thymosin 34 (M17733). cluslerin (M25915). 
neurogranin {X99076), TCR B-chain (AF 043 182), CD45 (Y00638). 

"Indicates an omino-rnodified 3' end in both cassettes; — , not applicable. 



Labeling Kit (Enzo Diagnostics, Farmingdale, NY), and, after metal- 
induced fragmentation, J 5 u>g biotinylaied cRNA was hybridized lo the 
MG-U95Av2 oligonucleotide probe array for 16 hours at 45°C. After 
washing, the cRNA was detected with streptavtdm-phycoerylhrin (Molecu- 
lar Probes, Eugene, OR) and analysis was completed by using a Hewlett- 
Packard Gene Array Scanner (AfTymcirix). The average difference value 
(AD) for each probe set was quantified using MAS 4.01 software 
I A ffymel ris), calculated as an average of fluorescence differences for 
perfectly matched versus single-nucleotide mismatched 25-mer oligonucle- 
otides (16 to 20 oligonucleotide pairs per prnbe set). The software is 
designed to exclude "positive calls" in the presence of high average 
differences with associated high mismatch intensities. 

SAGE profiles 

Platelet SAGE libraries were generated essentially as previously de- 
scribed.'-" modi lied as outlined in Figure I for ihe use of Mme\ as ibe 
lagging enzyme." This type IIS restriction enzyme cleaves 20 of 18 bp past 
Ms nonpalmdroniic (TCCRAC) recognition sequence, thereby generating 
longer lags or 2?-mer) than those obtained using Bsmfl as the standard 
tagging en?yinc (13-14 bp tags). These longer Mmel-generatcd tags 
potentially provide for more definitive "tag- io- gene * identification and are 
particularly useful in characterizing expression patterns in the absence of 
complete genomic sequence data (comprehensive methods detailed in Dunn 
cl al l? ). Briefly. poly(A) mRNA was isolated from 1 0 ng lora) platelet RNA 
using i lie oligo-idT) ST primer conjugated to magnetic beads (Dynal 
Biotech. Lake Success. NY), followed by cDNA synthesis using Super- 
script I) reverse transcriptase (Inviirogen). The cDNA was then digested 



with the restriction enzyme AValll (anchoring enzyme), ligated to cassette A 
using. T4 DNA ligase, and r after the beads were extensively washed, the 
cDNA was digested with AfWJ to release the tags from the beads. After 
purification, lags were ligated lo degenerate cassette B linkers (specifically 




Figure 1. Schema outlining the modified SAGE protocol used in platelet 
anatyies. The final tags ore flanked by the Atolll (anchoring enzyme) CA1G 
sequence, thereby providing tao-to-gene identification when exported lo a relational 
database (refer to "Bioinloimalic analyses* and Table 1 for details). 
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designed to anneal lo the nonuniform Mmel overhangs), and PCR-amplified 
using biotinylated primers S2 and S3 for 30 cycles (95*0 for 30 seconds; 
. 58°C for 30 seconds; 72°C for 30 seconds) using Platinum Taq DNA 
polymerase (Gibco BRL). A fraction (20%) of the pooled PCR products 
were then subjected lo one round of linear amplification using primer pair 
S2/S3, followed by a second round of 25 amplifications using primer S2 
alone (95°C for 30 seconds, 58°C for 30 seconds, 72°C for 30 seconds).* 
Primer S3 was 'subsequently added for one cycle (95°C for 2.5 minutes,, 
58°C for 30 seconds, 72°C for 5 minutes); the latter steps were collectively 
adapted to exclude beteroduplex fonnation. 1 * Unincorporated primers were 
removed by incubation with 200 U Escherichia coti exonuclease 1 for 60 
minutes at 37°C. PCR products were then pooled and digested with Ma]\\ 
to release tags, and biotinylated linker arms were cleared using streptavid in- 
coated immunoaffmiiy magnetic beads (DynaJ Biotech). Tags were con- 
catamerized using 5 uVuX T4 DNA ligase, and products more than 100 bp 
were isolated by size- fractional ion in low-melting agarose gels. The DNA 
was purified by GFX spin columns, and the concaiamers were cloned into 
the Sph\ site of pZero (Invilrogen). After transformation into Ecofi TOP10 
cells, recombinant clones were isolated and sequenced in 96- well microliter 
plates using, an AB1 377 sequencer and ABI Prism BigDye terminator 
chemistry (Perkin- Elmer Applied Biosystems. Branchburg, NJ). . 




g Ske (bp) a 



-603- 




Figure 2. Determination of platelet purity. (A) Total cellular RNA (1.8 ^g) from 
platelet-rich plasma (PRP) or purified platelets from a single apheresis donor were 
analyzed by RT-PCR (35 cycles) using oligonucleolide primers specific (or glycopro- 
tein lib (GPIIb), T-cell receptor p-chain fT CRp), or CD45; 10 pi ot the 50 p. I reactions 
were analyzed by elhidium-stained agarose gel electrophoresis. Minimal to no TCR0 
gene producl was visually evident only in PRP. Size markers corresponding to 
HaellJ-restricled $X174 DNA are shown. (B) Real-time RT-PCR was completed by 
using 1.8 jig total RNA and TCRp-specific oligonucleotide primers optimized tor 
quantitative analysis by real- lime PCR.' 8 On the basis of parallel determinations 
using RNA isolated from known amounts of purified leukocyte standards, the 
leukocyte-depletion protocol represents an approximate 2.5-log purified ton from the 
starting PRP. Results are representative of one complele sel ot experiments repeated 
on 2 separate occasions, and data points represent the mean from triplicate wells, 
with standard errors of the mean (SE M) less than t % (not shown). 



Bioinformatic analyses 

Functional grouping of genes determined lo be present by Affymeirix MAS 
4.0) software was performed using a dChip program linked to the National 
Center for Biotechnology LocusLink, which is an annotated reference 
database for genes and their postulated functions. 70 Of the approximately 
12 600-probe sets represented on (he AfTymeirix HG-U95Av2 Gene chip, 
functional annotations exist for approximately 8100 with the remainder 
categorized as unknown. MicroarTay data were visualized and analyzed 
using BRfJ- Array Tools software (Version 2.1), kindly developed and 
provided by Dr Richard Simon and Amy Peng (rrnus.nci.nih.gov/BRB- 
ArrayTooIs.hlnil). A logarithmic (base 2) transformation was applied lo ihe 
average difference values for individual data sets for determination of 
microarray concordances. Discordancy was defined as a 2-log difference in 
the maximum log intensities between individual experiments. 

SAGE tags were extracted by using in-bouse SAGE software uniquely 
modified to identify Afme) tags. The software ensures lhat only unambigu- 
ous 21- to 22-bp tag sequences are extracted for transcript profiling. Tags 
with ambiguities (Ns), lengths other than 21 or 22 bp, or with ambiguous 
orientations were extracted to separate files for manual editing or further 
examination. Finalized data were exported to a relational database for lag 
quantification and genetic identification. :tr 



Results 

Platelet purification 

To ensure that the RNA profiles accurately represented those of 
circulating blood platelets, a number of complementary methods 
were implemented lo remove contaminating nucleated leukocytes. 
Purification methods incorporating gel filtration, a 5-pm leukocyte 
reduction filler, and magnetic CD45 immunodeplction allowed for 
Ihe cumulative enrichment of highly purified platelets. The efficacy 
of this purification method was initially established by using 
peripheral blood platelet-rich plasma as' the starting material. The 
final product contained no more than 3 to 5 leukocytes per I X I0 5 
platelets as determined by parallel- flow cytometric analysis, 
representing an approximate 4 50- fold i eduction of nucleated 
leukocytes. These results eon elated well with molecular evidence 
for leukocyte depletion as detci mined by RT-PCR using both CD45 
and T-cell receptor G- chain (TCR£) primers (see Figure 2). 
Because the total RNA yield from peripheral blood platelets was 
insufficient for micioanay studies, we adapted the protocol to 
platelet apheresis dono.-s with nearly identical final purity (Ficure 



2). The platelet recovery was nearly 65% of the starting material, 
yielding approximately 2.3 X 10" platelets from an initial aphere- 
sis pack containing approximately 3.6 X 10 11 platelets. The bulk of 
the losses occurred during the initial centrifugation and filtration 
steps. The purification protocol was less effective at removing 
erythrocytes, although there were less than 50 glycophorin-positive 
cells per J X JO 5 platelets after the final purification step. Nonethe- 
less, these cells represent unlikely sources for contaminating 
cellular RNA (see "Cellular microarray analysis" below). 

Cellular microarray analysis 

The purified platelet RNA was sufficient for microarray studies and 
was used for cRNA generation and hybridization to the Affymetrix 
HG-U95Av2 GeneChip. The anatomic profile of platelet RNAs 
from 3 healthy male donors was determined by using Asymetrix 
software. Of the J 2 599 probe sets imprinted onto the chip, a 
maximum of 2147 (17%) transcripts were computationally identi- 
fied as "present" by the Affymetrix software, 152 (1.2%) were 
equivocal, and nearly 82% were absent. As a fraction of the total 
genes present on the chip, the percentage of platelcl-expressed 
genes (I5%-J7%) was generally lower than that obtained from 
other human cell types in which 30% lo 50% of genes are present as 
determined by Affymeirix software (J. Schwedes, personal commu- 
nication. May 2002). The "limited number" of platelet- ex pressed 
transcripts presumably reflects the lack of ongoing gene transcrip- 
tion in the anuclcate platelet. Because less than 1% of circulating 
red blood cells contain residual RNA, it is unlikely that any of these 
transcripts are erythrocyte derived, although this was formally 
addressed by isolating total cellular RNA from 20 mL of whole 
blood (corresponding to an -3-Jog fold excess of erythrocytes than 
lhat identified in our final sample). The total cellular yield of RNA 
from this starting material was approximately 250 ng, suggesting 
lhat less than I ng erythrocyte- derived RNA was present in the 
purified plate lei preparations. Despite this, however both a- and 
3-globin transcripts— along with both the ferritin heavy and light 
chains — were identified as abundant transcripts (Table 2). Al- 
though i he most parsimonious explanation would be lesidual contami- 
nating reticulocytes, this is not supported by our erythrocyte contami- 
nation estimates, and iheir significance remains unresolved. 

As a means of better dissecting the molecular anatomy of the 
platelet, expressed genes were grouped on the basis of assigned 
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•Table 2. Top 50 human platelet-expressed genes 



Accession no. 


Gene symbol 


AD vafues. range* 


Gene transcriptf 


Leukocyte expression^ 


* Ml 7733 


TMSB4X 


140 142-307 852 


Thymosin 84 mRNA. complele cds 


+ 


X99076 


NRGN 


101 510-148279 


Neurogranin gene 




M25079 


HBB 


40 839-229 556 


p-gk)bin mRNA, complete cds 




M25915 


CLU 


84 720-140 246 


Complement cytolysts inhibilor (ctusterin) complete cds 


- 


JM755 


FTHP1 


82 960-148 621 


Ferrilin H processed pseudogene. complete cds 


- 


D78361 


OAZ1 ■ 


73 098-118 140 


mRNA for ornithine decarboxylase anlizyme 


- 


X04409 


GNAS 


77 761-94 781 


mRNA for coupling protein G{s) o-subunit (alpha-S1) 


_ 


. M25897 


PF4 


62811-126 908 


Platelet factor 4 mRNA. complele cds 


_ 


AB021286 


B2M 


61 689-108 921 


^-microglobulin 


+ 


X00351 


ACTB 


25143-73 775 


mRNA lor p-aclin 


_ 


D21261 


TAGLN2 


76 687-101 931 


mRNA for KIAA0120 gene 


+ 


AL031670 


FTLL1 


69 865-99 966 


Ferritin, light polypeptide 1 


■ • + 


U59632 


GPIIB 


41 404-110328 


Platelet glycoprotein IbB chain mRNA 




M21121 


CCL5 


47 308-106 399 


T-ceH* specific protein (RANTES) mRNA, complele cds 




X13710 


GPX1 


41 318*96 878 


Unspliced mRNA lor glutathione peroxidase 




J00153 


HBA1 


21 326-144 201 


Alpha globtn gene cluster on chromosome 16 


+ 


M22919 


MYL6 


46 337-106 833 


Nonmuscle/smooth muscle alkali myosin light chain gene 


+ . 


L20941 


FTH1 


52 787-74 763 


Ferritin heavy chain mRNA complete cds 




J03040 


SPARC 


D 1 lOw-/ H 4X> 1 


QPARf/fkctannctrtin mRklA rnmnlntn frit- 

or^MT\\,#/iJ5>ieuneciiri nirvPiM, complete cus 






GNAS 




GSA mRNA for q subunil of GsGTP binding protein 




X58536 


HLA 


31 183-82 613 


mRNA for major HLA class 1 locus C heavy chain 




M54995 


PP8P 




Connective tissue activation peptide til mRNA 




U34995 


GAPD 


35 095-70 250 


Normal keratinocyte substraction library mRNA clone H22a 




1.40393 


MLM3 


"V> 1fi7-7 J * IRA 


Clone zapl 12 (mulL protein homolog 3) mRNA 








*M A*Si (It fYitt 


curtfs lor rtrvj \ric i proio-oncogene Ht i/ki ujj 




U9055 1 




JD i/OD-O 1 OZfZ 


Uietnnn "DAJiLn nmlain /UITA/IV mDKIA 

msrone ^i/wine pruiein ^nz>V t j iDt\r4/\ 




M1 1353 


H3F3A 




rto.o nisione ciass rnr\r»/\ 




212962 


RPL41 


36 003-54 853 


mRNA for homologue to yeast ribosomal protein L4 1 






TUBA1 


9fl Qflft.RI 7QA 


MAI PMA AA rtarta far 1H1 lltrt 

nMLrrw n gene jor a-iuDuiin 




AB028950 


TLN1 


9d ^t.-ift K1 1 

4*4 ijr l-;jO Ol 1 






Y127 1 1 


PGRMC1 




mRNA for putative progesterone binding protein 




M16279 


Ml 02 


in RQA AR 1Kfl 


Integrated membrane protein (MIC2) mRNA 




D78577 


YWHAM 


. 24 785-50 437 


Drain 14-3*3 protein 0-chain 




AF070585 


TOP38 




Clone 24675. unknown cDNA 




AA524802 


Unknown 




rflNA IMAflF rlrmo Q^A^il 

V^UIVM, l(Vl>\OC CIO* IK? U^J**£ I O 




AD009010 


UBC 


28 745-38 389 


mRNA tor polyubiquitin UbC 


+ 


X 57985 


H2AFO 


2! 678-52 108 


Genes lof histones H2B. 1 and H2A 




X54304 


MLCB 


25 733-34 109 


mRNA lor myosin regulatory light chain 




M 1453 9 


F13A1 


23 691-48 474 


Factor XMI subunil »• polypeptide mRNA, 3* end 




AI540958 


Unknown 


* 24 872-41 118 


. cDNA. PEC T.2_15_HOI.i 5' end/cton 




AL050396 


FLNA 


13 634-55 235 


cDNADKF2p 586K1720 




X56841 


HLA-E 


12 890-49 327 


Nonctossical MHC class 1 antigen gene 




M26252 


PKM2 


15 450-47 786 


TCB {cytosolic thyroid hormone- binding protein) 




M 14630 . 


P7MA 


19 314-45 088 


Prothymosin alphamRNA 




AF 045229 


RGS10 


19156-34 243 


Regulator ol G protein signaling 10 mRNA 




AA477898 


Unknown 


16 863-44 756 


cDNA. Z&34I08.M 5' end 




X95404 


FL1 


15 216-37 456 


mRNA lor nonmuscle type cofilrn 




M34480 


ITGA2B 


8 627-45 495 


Platelet glycoprotein lib (GPIIb) mRNA 




283730 


H2BFE 


18 001-31 306 


HH2B/e gene 




t 19779 


• H2AFO 


17 319-38 951 


Histone H2A.2 mRNA, complele cds 





*Gene expression Quantifications were calculated as the average difference (AD) value (matched versus mismatched oligonucleotides) lor each probe set using Atfymelrix 
GewChip soltwate. version 4.01. The range of values Ifom 3 disiincl platelet rnicroarrays is shown: the normalization value lor all mrcroanay analyses was 250. 

pranscrtpis 3t e rank- ordered (highest to lowest) using BR B- Array Tools software by log- intensities ol AD values obtained Irorn 3 different healthy donors; 33 ol the lop 40 
transcripts were lisled among the lop 50 in all 3 mtcroarr ay sets. 

{Leukocyte e>pression was detennined by microarray analysis using purified peripheral blood leukocytes, followed by construction ol rank- intensity plots for comparison to 
platelet lop 50 transcripts. 70 lop leukocyte-derived transcripts identified within the ranked top 50 platelet transcripts are depicted by a H ) present, or (- ) absent. 

cds indicates coding sequence. 



gene annotations, and this analysis was used to provide a pan- 
oramic tlel'miiion ol* the plalelel iranscriptonicvOf ihe »encs that 
could be cataloged within assigned "clusters."' those involved in 
metabolism (11%) and receptor/signaling (11%) represented the 
largest groups. Also evident in these analyses is the relatively large 
percentage of genes involved in functions unrelated to these key 
groups (ie. miscellaneous. 25%), and the ovenepresentation of 
genes with unknown function (32%) as annotated by Aftyinctrix 



and RcfSeq dalabascs. ? ' These results identify a vast array (nearly 
one half) of platelet genes (and gene products) that presumably 
have important, but poorly characierized functions, in platelet 
and/or megakaryocyte biology. 

Although microarray analysis is not truly quantitative, rank- 
ordering using the mean log- inlensi ties from 3 independent microar- 
rny analyses allowed for the categorization of the lop platelet 
transcripts (Table 2). Computational analyses demonstrated thai 
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only 1 0 of the top 1 00 genes were discordant among the 3 platelet 
microarrays, although 71 of 100 genes were discordant between 
platelet and leukocyte arrays. An inventory of the top 50 platelet 
genes is listed in Table 2, which also delineates those found to be 
highly expressed in peripheral blood leukocytes by parallel microar- 
ray experiments with this purified cellular fraction (data not 
shown). Further analysis of these cell subsets demonstrated that 
approximately 25% (n = 547) of the total platelet transcripts were 
platelet restricted. Furthermore - only 10 of the 50 most highly 
expressed genes were found to overlap, -confirming the distinct 
cellular profiles of each iranscriptome. Of the 12 overlap genes, 3 
corresponded to globin or ferritin chains (again suggesting the 
presence of contaminating reticulocytes in both purified fractions), 
and another 4 were involved in act in cytoskeletal reorganization 
and human leukocyte antigen (HLA) expression, gene products that 
. regulate critical functions in both cell types. Given the importance 
of cytoskeletal reorganization in downstream platelet activation 
events, it is not unexpected that components of the actin machinery 
system would demonstrate prominent transcript expression. Previ- 
ous estimates suggest that 20% to 30% of the total platelet 
proteome is comprised of actin with other components such as 
actin-binding protein, mysosin, and talin accounting for an addi- 
tional 2% to 5% of the total protein.'- 22 The mRNAs encoding the 
actin-related machinery ore overrepresented in our microarray 
analysis, with 8 such transcripts found among the 50 highest 
platelet-expressed genes. Interestingly thymosin 04 demonstrated 
the highesi expression pattern. In unstimulated platelets, 30% to 
40% of actin is polymerized as F-actin, 77 whereas the balance of 
actin monomers (G-aclin) are polymerization inhibited by seques- 
tering proteins such as profilin (100 pM) and thymosin 04 (600 
pM)." The high thymosin (34 iranscript expression not only 
correlates with its known abundance in platelets but also supports 
the importance of actin inhibitory proteins in maintaining the 
nonstimulated state of circulating platelets. 

Platelet SAGE analyses 

Although these initial studies identified the distribution and relative 
expression patterns of the genes within the Asymetrix data set, 
they do not allow for analyses of genes that arc unrepresented by 
these oligonucleotide chips. Unlike closed microarray profiling 
strategies, SAGE is an open architectural system that is ideally 
suited for novel gene and pathway identification. Accordingly, the 
platelet RNA used for microarray studies was used for platelet 
SAGE. A total of 2033 tags were initially cataloged, of which J 800 
(89%) corresponded to mitochondrial -derived genes. These results 
were quite different from those obtained by microarray analyses, 
but the discrepancy can be resolved by the nonrepresentation of the 
mitochondrial genome on the gene chip. The mitochondrial ge- 
nome. is a compact approximately 16.6-kiIobase (kb) sequence 
encoding 13 genes and 2 ribosomal subunits.- 4 Primary mitochon- 
drial transcripts are polycislronic and typically contain premature 
termination or unpredictable splice sites, result ing in multiple 
polyadenylated transcripts from individual genes. ?4 - ? - s Indeed, the 
' overall distribution of platelet- derived mitochondrial SAGE tags is 
quite similar to that found in muscle." All 13 genes containing 
Mid J J sites were detected, whereas neither of the non-AValll- 
containing genes were identified (nicotinamide adenine dinuclco- 
lide |NAD>I] dehydrogenase subunit 4L and adenosine triphos- 
phatase (ATPase] 8). Most of the tags were from the I6S and 12S 
ribosomal RNAs — which collectively accounted for 68% of the 
total mitochondrial lags — with the fewest taus represented by 
NADH dehydrogenase subiwits 3.5. (». and cytochrome c oxidase 1 



BLOOD, 15 MARCH 2003 -VOLUME 101, NUMBER 6 




AlFxtt NADH41 NAWtt 



Figure 3. Schema of the mitochondrial genome with SAGE tag distributions 
(only tags with identical matches are displayed). The abundance of the SAGE 
tags (n = 1800) at individual N/attl sites (arrows) within the mitochondria! fteavy 
strand is shown on the bottom, whereas those tags corresponding to the mitochon- 
dria* light strand are delineated above the anows (the presence of an unaccompanied 
arrow implies no SAGE tags at that Mot II site). The gene products of mt-DNA 
(RefSeq accession no. NC_O018O7) are delineated by the open rectangles, whereas 
. stippted boxes represent tRMA genes and control regions (the single tag represented 
by the [') refers to mitochondriat transfer RNA-serine). Note that NADHB is encoded 
by the fight strand and that there are no Mailt sites within the ATPaseB gene segment. 
COfn], cytochrome c oxidase subunit; Cyt b, cytochrome b. 

(Figure 3). The NADH dehydrogenase subunit 6 RNA is the only 
mRNA encoded by the light (L) strand of mitochondrial DNA and 
was the least abundantly detected transcript. 

The unusually high preponderance of mitochondrial- derived 
genes is not inconsistent with the known enrichment of these 
genomes in human platelets, 1 - 34 and presumably reflects persistent 
transcription from the mitochondrial (mt) genome in the absence of 
nuclear-derived transcripts. This overrepresentation of mtDNA in 
platelets is considerably greater than that of its closest cell type 
(skeletal muscle), in which mt genomes represent approximately 
20% to 25% of all SAGE tags. 25 Interestingly, the energy metabo- 
lism of platelets is not dissimilar from that of skeletal muscle, both 
cell types actively using glycolysis and large amounts of glycogen 
for ATP generation. 26 Like muscle, platelets are metabolicaJly 
adapted to rapidly expend large amounts of energy required for 
aggregation, grannie release, and clot retraction. Similar to the 
situation in all eukaryotic cells, platelet mitochondria represent the 
primary source of ATP, which is generated from oxidative phosphor- 
ylation reactions occurring within these organelles. Mitochondria 
are also responsible for most of the toxic reactive oxygen species 
generated as by-products of oxidative phosphorylation and are 
central regulators of the apoptotic process in other cellular types. 
The mtDNA encodes polypeptides found within 4 of the 5 
multifunctional complexes that regulate oxidative phosphorylation 
within the platelet mitochondria. 27 Whether the continued genera- 
tion of these polypeptides has a role in platelet energy metabolism 
and/or the apoptotic mechanisms regulating platelet survival re- 
mains speculative, although not inconsistent with our observations. 

Comparative analysis of SAGE and microarray 
transcript abundance 

Complete SAGE libraries require the sequencing of up to 30 000 
tags for an exhaustive cataloging of individual mRNAs, especially 
those wiih limited copy numbers. Given the preponderance of 
mt- derived transcripts, comparable sampling would have required 
sequence analysis of nearly 300 000 SAGE tags, an inordinate 
number for comprehensive analysis of the platelet iranscriptome. 
For platelets, alternative methodologies incorporating subtract ive 
SAGE will be required for more comprehensive transcript profil- 
ing.^ Our initial sampling of nonmitochondrial genes remains 
informative, however, and entirely consistent with the results of 
platelet microarray studies. As shown in Table 3, SAGE tags for the 
genes encoding thymosin {M, f*2- microglobulin, neurogranin. and 
the platelet glycoprotein Ibp polypeptide were among the most 
frequently identified platelet genes, similar to the rank-ordered 
results determined by microarray analysis. To formally confirm the 
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Table 3. SAGE-identified nonmitochondrial lags 








Frequency 


CATG + SAGE lags* 


Accession no t 


Gene 


Microarray! 


26 


GTTGT GGTTAATCTGGT 


mm nndCidA i 


02-miaoglobuJin (B2M), mRNA 


PPP 


21 


TTGGTGAAGGAAGAAGT 


NM_021 109.1 


Thymosin B4 ; X chromosome (TMSB4X). mRNA 


P 


8 


AGCTCCGCAGCCAGGTC 


NM_ 002620.1 


Platelet factor 4 variant 1 (PF4VI). mRNA ' 


P 


8 


AGCTCCGCAGCCGGGTT 


NM_002619.1 


. Platelet factor 4 (PF4), mRNA 


P 


7 


TGTA7AAAGACAACCTC 


NM_ 002704.1 


Proptatetet basic protein {p-thromboglobufin) 


Pp 


5 


GGGCACAATGCGGTCCA 


NM.0D0407.1 


Glycoprotein Ib0 polypeptide, mRNA 


P 


3 


AGGTAATAAAAGGTAAT 


NM_ 00351 2.1 


H2A histone family, member L (H2AFL), mRNA 


P 


3 


AGTGGCAAGTAAATGGC 


NM_021914.2 


Cofilin 2 (muscle) (CFL2), mRNA 


N/A 


,3 


TGACTGTGCTGGGTTGG 


NM_006 176.1 


Neurogranin {protein kinase C substrate, RC3) mRNA 


P 


3 


TTGGGGTTTCCTTTACC 


NM_ 002032.1 


. Ferritin, heavy polypeptide 1 (FTH1), mRNA 


P 


2 


CCCTTGTGACTACCTAT 


NM.025158.1 


Hypothetical protein FU22251 (FLJ22251), mRNA 


N/A 


2 


CCTGTAACCCCAGCTAC 


NM_032779.1 


Hypothetical protein FU14397 (FIJI 4397), mRNA 


N/A 


2 


CTTGTAGTCCCAGCTAC 


NM.017962.1 


Hypothetical protein FU20825 (FLJ20825), mRNA 


N/A 



"Unique tags identified more than once. . 

t Refers to the RefSeq accession no. ?l Note that this number does no! necessarily correspond to the accession no. provided by Affymetrix software annotations (Table 1). 

^Presence (P) or absence (A) is based oh results from 3 dislincl platelet microanay experiments. Capitalized "P.* designates a gene that is in the top 50 on all 3 microarray 
experiments, whereas small "p* designates those transcripts not in the top 50. Two of the genes (p2-miqoglobuBn and fMhromboglobufinJ.are represented by 3 and 2 probe 
sets, respectively, on the HGU95Av2 gene chip; for p2-M, all 3 probe sets were in the top 50 genes, whereas for thymosin 04 1 of 2 was in the top 50 for all experiments (the 
other probe set was in the top 75 tor all experiments). N/A indicates oligonucleotide not present on Affymetrix HG-U95Av2 gene chip. 



results independently obtained by SAGE and microarray analysis, 
quantitative RT-PCR was completed by using oligonucleotide 
primers specific for 2 abundant mitochondrial transcripts, )6S 
rRNA and NADH2 thymosin p4 (high-abundance by microarray 
and SAGE), 2 incompletely characterized high-abundance tran- 
scripts (neurogranin and clustcrin; see "Protein immunoanalysis of 
platelet clusterin and neurogranin"), a low-abundant transcript 
(T-cell receptor ^-polypeptide), and the genes encoding proteins 
with well-established quantitative determinations (ie, glycoprotein 
a tin3j [—50 000 receptors/platelet]; protease-activated recepfor-1 
(PARI) {— 1800 receptors/platelel]).' As shown in Figure 4, these 
analyses reveal excellent concordance between SAGE and microar- 
ray studies, demonstrating the predominant frequency of the 
mitochondrial-derived 16S rRNA/NADH2 transcripts,, with incre- 
mentally lower expression of other transcripts as initially demon- 
strated by microarray (16S > NADH2 > thymosin (34 > neurogra- 
nin > clustcrin > o IIB p, > PAR 1 > TCR£). 

Given the small number of nonmitochondrial SAGE tags 
available for analysis (n = 233), limited conclusions can be drawn 
using traditional (nonsubtraction) platelet SAGE libraries as pre- 




Figure 4. Ouantilalive real-lime RT-PCR analysis of platelet transcripts. Real- 
lime RT-PCR was completed b> using purified plalelel RNA and oligonucleotide 
primer pairs specifically designed using Prime»3 software to generate similarly-sized 
(-?00-bp) PCR products, optimized to Ihe some annealing temperature. In graph, 
fO) represents Mb. (») represents Hta, (i) represents PARI, (a) represents 16S 
rRNA. (T) represents NADU?. (▼) represents thymosin, (O) represents clusterin, (♦) 
represents neurogianin. and (•) represents TCRp. Curves are representative ol one 
complete set ol experiments (lepeated twice), and line plots reftecl average 
determinations I'om 3 wells performed in parallel with SEW (ess than 1% t U r al! 
data points. 



sented here. Overall, a total of 126 unique tags were identified, the 
majority of which (94) were represented only once. Of the total 
unique tags, nearly one half represented novel genes not present on 
the Affymetrix U95Av2 GeneChip. Of the genes with unique tags 
identified more than once, there was excellent concordance with 
microarray expression analysis, with nearly alt of the SAGE tags in 
Table 3 corresponding to platelet top 75 microarray transcripts. The 
platelet factor (PF) 4 variant represents a single aberration because 
this was rank-ordered approximately 350 by microarray, although 
its SAGE tag frequency was identical to that of the predominant 
PF4 transcript. The lack of extensive nonmitochondrial SAGE 
sampling precludes any further extrapolations from this, apparent 
aberration. Of note, a subset of these tags had long poly(A) tracts; 
although they all corresponded to genes identified in the RefSeq 
database. 7 ' We cannot exclude the possibility of a SAGE artifact 
for this small subset of tags (~2%, representing 46 of 2033 tags), 
although the authenticity of the vast majority of tags (—98%) 
clearly validates the methodology. These tags are mosi likely 
explained by the unique biology of the platelet (ie, mRNA decay in 
the absence of de novo transcription) or to mRNA degradation 
occurring during the extensive purification methods. In summary, 
even with a remarkably limited sampling, the power of this 
approach in gene identification of relatively abundant and less 
abundant transcripts is evident. It is clear, however, given the 
unique molecular anatomy of the platelet (ie. abundance of 
mitochondrial transcripts), that SAGE adaptations will be required 
for more comprehensive genetic profiling. 79 

Protein immunoanalysis ol platelet clusterin and neurogranin 

Although most of ihe "most abundant" transcripts would conform 
to a priori predictions for platelet-expressed mRNAs, a number of 
transcripts were identified that had been poorly characterized in 
human platelets. To further establish the authenticity of highly 
expressed transcripts such as clusterin and neurogranin, confirma- 
tory protein analyses were completed. As shown in Figure 5, both 
proteins were clearly detected in purified platelel lysates: further- 
more, their cellular platelet . distributions conformed to those 
predicted based on previously proposed functions. Note for ex- 
ample that clusterin — functionally characterized as a complement 
lysis inhibitor able to block the terminal complement reaction — is 
primarily expressed on the extracellular platelet membrane- 3 " 
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Figure 5. Immunoanarysis of ptotetet neurogranin and 
clusterin. (A-D) Get-filtered ptatetets were eiihof ffaced In 
37% fwrnaldehyde (nonpemioabilized) or fixed with porrno- 
atfction in the presence of 0.1% TrforiOC toltowed by flow 
cytometric analysis using srrtidustem, anti-lfb/nb. or anSneu- 
rogranin anUbodies and the FITC-conjugaled spedes- 
spectfic secondary antibody (in C, (he FITOconpjgated 
arttirabbit and antimouse controls are esseritialry superim- 
posed). (E-F) Ten myograms of sofubifced HepG2 cete 
(hepatocyle cell line), human brain, or purified platelet rysates 
were analyzed by SDS-PAGE,^ 7 . end immunoblot analysis 
were completed by using lilOOOdl^ons of either antjneuro- 
granin SDS-PAGE) or antWusterin (8% SDS-PAGE) 
antibodies. The arrticJustonn antibody recognized 2 platelet 
imrnunorBactive species under, shorter exposure. Although 
Ihe relative neuogranin and dusterin protein abundances 
are subopfimaJry quantified by those, araJysos, platelet ctus- 
lerin appears to demonstrate considerable expression when 
compared with that previously identified in hepatocytes. 31 



Given the importance of complement activation in platelet destruc- 
tion, the 'prominent expression of cell-surface clusterin might 
suggest a role for this protein in normal and pathologic events 
regulating platelet survival. Interestingly, a clusterin-deficient 
knockout mouse has been generated that demonstrates enhanced 
cardiac dysfunction in a model of auloimrmme myocarditis. 31 
Although these mice apparently have normal baseline hemograms 
(B. Aronow, personal communication, October 2002), it remains 
unesiablished if they would be predisposed to immune- type 
thrombocytopenia in systemic models of autoimmunity. 

Similarly, the gene encoding an intracellular effector protein 
thai may have key roles in downstream platelet activation events 
has now been demonstrated to have abundant transcript expression 
in human platelets. Neurogranin is a highly expressed platelel 
transcript with its gene producl demonstrating a primarily intracel- 
lular pattern of distribution. Neurogranin is generally described as a 
bra in- specific. Ca ?4 -sensitive calmodulin- binding phosphoprotein 
lhat is preferentially expressed in neuronal cell bodies and den- 
drites. 33 - 13 ii is a specific protein kinase C (PKC) substrate that can 
also be modified by nitric oxide and other oxidants to form 
intramolecular disulfide bonds. Both its phosphorylation and 
oxidation state attenuate its binding affinity for calmodulin. 33 In 
stimulated platelets. PKC generation is linked to various activation 
pathways such as calcium-regulated kinases, milogen-activated 
protein (MAP) kinases, and receptor tyrosine kinases. 1 Thus, these 
observations suggest thai platelet neurogranin may function as a 
previously unidentified component of a PKC-dependcnt activation 
pathway coupled to one (or more) of ihese effector proteins. 



Discussion 

These data provide documentation for a unique plniclet mRNA 
profile "ihal may provide a tool for analyzing platelet molecular 
networks. Nonetheless, the molecular analysis of the platelet 
Uanscriptome may be confounded by the constant decay of 
mRNAs in the absence of new gene transcription, a situation that 
may. for example, limit the identificaiion of low-abundance 
transcripts. Similarly, because Ihe circnlaiing platelel pool contains 



a mixed population of variably aged platelets, a "static" mRNA 
profile represents an average of this heterogeneous blood pool. 
Despite these potential limitations, the corhbination of genomic 
and proteomic technologies are lifcely to provide powerful tools for 
the global analysis of platelet function. Current strategies for 
cataloging "whole cellular proteomes" are generally accomplished 
by using 2 developing methodologies:,( J ) high resolution 2-dimen- 
sional polyacrylamide gel electrophoresis (2-DE) with mass spec- 
tromerric sequence identification, 34 and (2) microcapillary liquid 
chromatography with tandem mass spectrometry (uXC-MC/ 
MQ. 35 Further modifications of both procedures have been devised 
for direct comparative studies between 2 cellular proteomes. The 
introduction of 2-DE differential gel electrophoresis has now made 
il possible to detect and quantify differences between experimental 
sample pairs resolved on the same 2-dimensional gel. 36 Likewise, 
Ihe application of isotope-coded affinity tags to uXC-MC/MC 
represent a novel means of quantitative analyses between cellular 
proteomes. 37 The success of both approaches relies on the availabil- 
ity of comprehensive genomic databases and mathematical algo- 
rithms for optimal protein identification. Indeed, mathematical 
modeling studies have demonstraied the need lo delineate both 
protein and mRNA expression levels for optimal definition of 
intracellular networks. 38 Our data presenl an initial framework for 
delineating platelet function by defining the molecular anatomy of 
human platelets, information that is likely to provide important 
clues into the dynamic protein interactions regulating normal and 
pathologic platelet functions. Furthermore, because Ihe platelet 
transcriptome mirrors Ihe mRNAs derived from precursor 
megakaryocytes, Ihese analyses may provide insights into the 
biochemical and molecular events regulating megakaryocytopoi- 
esis and/or proplalelel formation. 
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Expression level of Ubc9 protein in rat tissues. 
Gotcbiowski P, Szulc A . Sakowicz M Szutowicz A . Pawclczvk T 

Department of Molecular Medicine, Medical University of Gdansk, 80-2 1 1 Gdansk 
Poland. ' ' 

Ubc9 is a homologue of the E2 ubiquitin conjugating enzyme and participates in the 
cova ent hnking of SUMO- 1 molecule to the target protein. In this report we describe a 
simple and efficient method for obtaining pure human recombinant Ubc9 protein The 
purified Ubc9 retained its native structure and was fully active in an in vitro sumoylation 
assay with the promyelocyte leukaemia (PML) peptide as a substrate. In order to better 
understand die physiology of Ubc9 protein we examined its levels in several rat tissues 
Immunoblot analyses performed on tissue extracts revealed quantitative and qualitative 
differences in the expression pattern of Ubc9. The Ubc9 protein was present at a high 
level m spleen and lung. Moderate level of Ubc9 was detected in kidney and liver Low 
amount of Ubc9 was observed in brain, whereas the 18 kDa band of Ubc9 was barely 
visible or absent in heart and skeletal muscle. In heart and muscle extracts the Ubc9 
antibodies recognized a 38 kDa protein band. This band was not visible in extracts of 
other rat tissues. A comparison of the relative levels of Ubc9 mRNA and protein 
indicated that the overall expression level of Ubc9 was the highest in spleen and lung In 
spleen, lung, kidney, brain, liver and heart there was a good correlation between the 18 
kDa protein and Ubc9 mRNA levels. In skeletal muscle the Ubc9 mRNA level was 
unproporhonally high comparing to the level of the 18 kDa protein. The presented data 
indicate that m the rat the expression of the Ubc9 protein appears to have some degree of 
tissue specificity. 

PMID: 14739995 [PubMed - indexed for MEDLINE] 
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Protein abundancy and mRNA levels of (lie adipocytc-type fatty acid 
binding proteia correlate in non-invasive and invasive bladder transitional 
cell carcinomas. 

Gromova I. Gromov P . Wolf H . Celis JE . 

Department ofMedical Biochemistry and Danish Centre for Human Genome Research 
The University of Aarhus, Aarhus C, Denmark. 

The adipocyte type fatty acid-binding protein (A-FABP) is a small molecular weight fatty 
acid-binding protein whose expression correlates both with the grade of atypia and the 
stage of bladder transitional cell carcinomas (TCCs). To determine if the protein 
abundancy correlates with the mRNA levels in non-invasive and invasive lesions we 
have analysed fresh TCCs (grade II, Ta; grade III, T2-4) by two-dimensional ' 
polyacrylamide gel electrophoresis (2D-PAGE) and measured the mRNA levels using the 
reverse transcription linked polymerase chain reaction (RT-PCR). Overall, the results 
showed a good correlation between protein abundancy and mRNA levels, 'indicating that 
the lack of expression of the protein observed in some lesions reflects low levels of 
transcription of the A-FABP gene rather than translational regulation. In addition our 
studies showed that the loss of A-FABP protein observed in some tumors is not ' 
compensated by an increase in the skin fatty acid-binding protein P A-FABP as is the 
case in the A-FABP knockout mice. 

PMDD: 9664136 [PubMed - indexed for MEDLINE] 
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Abstract. The adipocyte type fatty acid-binding protein (A- 
FABP) is a small molecular weight fatty acid-binding protein 
whose expression correlates both with the grade of atypia and 
the stage of bladder transitional cell carcinomas (TCCs). To 
determine if the protein abundancy correlates with the 
mRNA levels in non-invasive and invasive lesions, we have 
analysed fresh TCCs (grade II, Ta; grade 111. T 2 . 4 ) by two- 
dimensional polyacrylamide gel electrophoresis (2D-PAGE) 
and measured the mRNA levels using the reverse transcription 
linked polymerase chain reaction (RT-PCR). Over all > the 
results showed a good correlation between protein abundancy 
. and mRNA levels, indicating that the lack of expression of 
the protein observed in some, lesions reflects low levels of 
transcription of the A-FABP gene rather than translaiional 
regulation. In addition, our studies showed thai the loss of A- 
FABP protein observed in some tumors is not compensated 
by an increase in the skin fatty acid-binding protein PA- 
FABP, as is the case in the A-FABP knockout mice. 

Introduction 

Bladder cancer accounts for 4.7% of all human cancers 
diagnosed. The spectrum of bladder tumors is broad with 
various histological types that include transitional cell 
carcinomas (TCCs), squamous cell carcinomas (SCCs), and 
adenocarcinomas (1-3). TCCs are by far the most prevalent 
tumors as they represent nearly 95% of all bladder cancers in 
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the Western Hemisphere. At first presentation, about 70% of 
the urinary bladder TCCs are diagnosed as differentiated 
superficial lesions that are confined either to the mucosa 
(Ta), or to the underlying connective tissue (Tj). The rest 
correspond to highly invasive, poorly differentiated tumors. 

Non-invasive TCCs occur as two distinct growth patterns, 
papillary and non-papillary (flat) lesions (1,2); that display 
significant differences in their maJignanf potential and that 
are believed to originate from different genetic alterations 
(4-6). Papillary carcinomas usually correspond to low-grade 
lesions which frequently recur multiple times. These tumors 
begin as areas of hyperplasia that later undergo a process of 
dedifferentiation (grades 1-1V). Invasive tumors may arise 
Trorn these lesions, but often are derived from non-papillary 
carcinoma in situ that usually is of high grade at presentation 
and tend to invade and progress to muscle invasion and 
metastatic disease. 

To date, many attempts have been made to pinpoint genetic 
changes that underly progression of bladder cancer as well as 
to identify molecular markers that correlate with tumor 
progression. Cytogenetic studies and molecular geneuc data 
have shown that chromosomes 3p, 4p, 4q, 5q. 8p, 9p, 9q. 
lip I3q, 14q, I7p and I8q are frequently altered in bladder 
urothelial tumors (4,5 and refs. therein), and as a whole they 
have supported the notion thai bladder cancer is a multistep 
process. Recently, Spruck et ot (6) showed that chromosome 
9 alterations occur early during development, while p:>3 
mutations appear later in the process and confer invasive 
properties. The situation however is reverse in the case ot 
Cis, as a large fraction of these lesions contain p53 mutations 
(5 6 8,9). Besides pointing towards two divergent pathways of 
bladder tumor progression, these studies implied that the order 
in which the genetic changes occur is important in determining 
the outcome of the lesion. 

Assessment of bladder cancer is based on a ihorougn 
pathological examination of biopsy material which establishes 
the histological type of. the tumor, its degree of differentiation 
(orade), and depth of invasion of the bladder wall (staging) 
( .0-12) In spile of strict criteria for the pathological assessment 
of ihese lesions, (here exist a significant inter-pathologist 
variation, a fact that emphasises the need for objective 
markers that may aid their classification. With this in mind 
we are exploring the possibility of using proteome (13) 
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expression profiles of these lesions as fingerprints to define 
their grade of aiypia and eventually their stage (3 J 4). So far. 
more than 400 tumors of various grades and stages have been 
analysed by two-dimensional potyacrylamide gel 
electrophoresis (2D PAGE), and preliminary experiments 
have shown that even though the overall protein expression 
profiles of tumors of the same grade and stage are very similar, 
there are important differences suggesting that morpho- 
logically 'identical' TCCs may be further subdivided (1). Of 
the biomarkers of TCC progression identified so far, the 
adipocyte-type fatty acid binding protein (A-FABP) is 
perhaps one of the most interesting as the levels of this 
polypeptide have been shown to correlate both with the grade 
of aiypia as well as with the stage of the disease (3). Given 
the putative importance of A-FABP as a progression marker, 
and since Anderson and SeiJheimer (15) recently showed that 
post-transcriptional regulation of gene expression is a frequent 
phenomena in higher organisms, we have compared the levels 
of A-FABP mRNA and protein in non-invasive and invasive 
bladder TCCs expressing and lacking this protein. 

Materials and methods 

Tumors. Fresh bladder tumors were obtained immediately 
after transurethral resection. The grade and clinical stage of 
the tumors were determined by the pathologist at the Aarhus 
Municipal hospital. Clean tumors devoid of blood clots were 
divided into small pieces for 2D PAGE and DNA, and RNA 
preparation. The latter were immediately frozen in liquid 
nitrogen and store at -80"C until use. 

l^SJ-methionine labeling and 2D-PAGE. In a few cases, 
small tumor pieces were labeled with [ 15 S]-methionine as 
previously described (3). 2D-PAGE was performed 
according to published procedures (16; see also hltp: 
//biobase.dk/cgi-bin/celis). 

RT-PCR. Frozen tumor samples were ground to powder in. 
liquid nitrogen and tola) RNA was isolated using the acid 
guanidium isothiocyanate/phenol chloroform extraction 
procedure (17). The samples were treated with RNase-free 
DNases I (Pharmacia) to eliminate contaminating genomic 
DNA using the protocols recommended by the supplier. 
Poly(A)* RNA was prepared using Poly (A)' Quick columns 
according to the manufacturers instructions (Stratagene). The 
synthesis of cDNA for RT-PCR reactions were carried out 
using the Gibco BRL Superscript Kit. Two pg of toial RNAs 
was mixed with oligo -dTl I primer, PCR buffer, MgCV 
(25 mM), 0.1 M DDT and 10 mM dNTP. The mixture was 
incubated at 42'C for 5 min followed by the addition of 
Superscript II reverse transcriptase and further incubation at 
42'C for 50 min. The reaction was terminated by raising the 
temperature to 70"C for 15 min, followed by additional 
incubation at 37*C for 20 min in the presence of RNase H to 
deplete the RNA. Primers for known and cloned genes were 
purchase from Pharmacia as follows: A-FABP, Upper (from 
186-208 bp) 5* GATCATCAGTGTGAATGGGGAT-37 
lower (from 374-397 bp) 5'- CATCCTCTCGTTTTCTCTiT 
ATG-3'; B-actin upper 5'-GAGGTTGGCrCTGACTGTACC 
AC-371ower 5-CTCATTCAGCTCTCGGAACATCTCG-3'. 



Table I. Expression of A-FABP in non-invasive and invasive 
bladder TCCs: 
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■The levels of a A-FABP protein were determined based on the 
visual analysis of Coomassie Brilliant Blue stained gels and 
represent the average estimate of at least two different runs. Tumors 
scored as positive differed significantly with respect to the levels of 
ihe protein, and therefore are indicated with either four (very high), 
three (High), two (medium) and one (low) cross (see also Fig. 1); 
The mRNA levels were determined based on the intensity of 
Ethidium Bromide stained cDNA bands separated on agarose gel 
using the Bio-Rad Gel Doc 1000 system and represent the average 
estimate of at least three independent experiments. Corresponding 
mRNA levels are represented by crosses as described above. 



PCR reactions were performed in a Progene thermal cycler 
using the Advantage Klen Tag Polimerase (Clontech). The 
cycling parameters consisted of 30 sec of denaturation at 
94"C, with annealing of 30 sec at 60*C for 8-actin or at 
64*C for A-FABP. The extension time was for 2 min at 68"C 
for 29-40 cycles with the final extension of 7 min at 68X. The 
PCR products were separated on 1.5% agarose gel 
electrophoresis followed by ethidium bromide staining and 
photography under UV light. 

Results 

A-FABP proiein levels in non-invasive and invasive TCCs. 
One hundred suspected TCCs removed at the Department of 
Urology, Skejby Hospital, were analysed by high resolution 
2D PAGE and Coomassie Brilliant Blue staining. Of these, 
10 grade II. Ta TCCs (Table 1) were chosen to correlate A- 
FABP proiein and mRNA levels as these lesions yielded 
acceptable protein profiles both sn terms of their purity as 
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FiPurc 1 1EF 2D eels of whole cellular extracts from non-invasive and 
and a TCC 711-1- Only ihe relevant area of the gels are shown. 




assessed by monitoring for the absence of vimentin 
(contamination with connective tissue) and desmin 
(contamination with smooth muscle cells), as well as 
polypeptide resolution. In addition, reasonable amounts of 
these tumors were available for mRNA preparation. 

Table I shows the levels of A-FABP protein expression in 
the 10 tumors analysed by 2D PAGE. The data were scored 
entirely based on the visual analysis of Coomassie Brilliant 
Blue stained gels and represent an average estimate of at 
least two different runs. Tumors scored as positive differed 
significantly with respect to the levels of this protem, and 
therefore are indicated with either four (very high), three 
(high), two (medium) and one cross (low). Representative 
examples of Coomassie stained 2D gels of tumors exhibiting 
very high (TCC 532-1, Fig. I A), high (TCC 692-1; Fig. IB), 
medium (TCC 763-1. Fig. 1C) and undetectable levels (TCC 
709-1 and TCC 711-1 Fig. 1D-E) of A-FABP are shown in 
Fig. 1 (only the relevant area of the gels are shown). 

A-FABP mRNA levels in non invasive grade iJ t To TCCs. 
Since in many instances only a limited amount of fresh tumor 
was available, we used RT-PCR to determine the levels of A- 
FABP mRNA in the ten TCCs analysed by 2D PACE (Fig. 1). 
Following amplification, ihe PCR products were analysed by 
conventional 1.5% agarose gel electrophoresis and visualised 
by ethidium bromide staining as shown in Fig. 2. The amount 



Fifiurc 2 RT-PCR analysis of A-FABP mRNA expression in non-invasive 
(GrlK Ta) and invasive TCCs (Or 111. T r TJ. For RT-PCR analysis, the ss c- 
DNA was synthesized by Reverse Transcriptase using iota! RNA, and used 
Tor RT-PCR amplification. The PCR products were resolved on 1:5% 
agarose gels and visualised under UV light following ethidium bromide 
staining The A-FABP panels show the results of amplifications where the 
pair of gene specific primers was used to generate ihe 220 bp DNA 
fragment. Amplification of A-FABP was obtained after 30 cycles of PCR^ 
The B-actin panels represent Ihe amplification of the B~actm gene, whicb 
was used as an internal control lo confirm thai equal amounts of c-DNA were 
used in each reaction. 



of cDNA in each lane was normalised using several house- 
keeping °enes so as to achieve a more accurate assessment of 
the expression of the A-FABP mRNA. As shown m Fig. 2. 
TCC 532- 1 exhibited the highest amount of A-FABP mRNA, 
followed by TCCs 692-1, 763-1. 616-1. 581-1. 154-1. 166-5 
and 533-1 Undetectable levels of A-FABP mRNA were 
observed in the case of TCCs 607-1 and 709-1 (F.g. 2). 
Relative mRNA levels for the ten TCCs are. given in Table I. 

A-FABP protein and mRNA levels in invasive grade III. T„ 
TCCs. Of the invasive TCCs (grade 111. T 2 . 4 ) analysed by 2D 
PACE only six yielded reasonable protein profiles for further 
study As shown in Table 1. none of these lesions expressed 
delectable levels of A-FABP as determined by Coomassie 
Brilliant Blue staining (F.g. JE. TCC 71 1-1). In line w.th these 
results, the RT-PCR analysis of these tumors also^revealed 
undetectable level of A-FABP mRNA (Fig. 2. Gr IB 1,-1,. 
Table 1). 

Loss of A-FABP protein is not compensated by an increase in 
PA-FABP. Recent studies of A-FABP knockout m.ce have 
shown that the loss of A-FABP in fat tissue is comp«»« d 
by an increase in the skin fatty acid-binding protein malMIBJ. 
Our studies, however, indicated that the human homologue 
of mall. PA-FABP (19). did not compensate for the loss ot 
A-FABP either in the non-invasive or the invasive tumors 
analysed in this study (Fig. ID and E). In addit.on. F.g. 3 
shows 2D gels or («S]-me.hionine labeled prote.ns from two 
»,ade 11. Ta TCCs (192-4. T,; Fig. 3A and 192-4. T,: F.g. 3B). 



1EF - 



PA-FABP 
I A-FABP 



Mrx 10 3 

i 

-12 



B 


PA-FABP 




* A-FABP 




> 

r 











-12 



A-FABP- iii ^ 



Figure 3. Levels of A-FABP and PA-FABP protein in grade II; Ta tumors 
resected from the same patient. The two upper panels show the 2D gel 
autoradiograms of f 5 S}- methionine labeled proteins from TCCs (grade II, 
Ta) resected from the same patient. A, TCC 192-9 tumor 4 and B.- TCC 
192-9 tumor I. Only the relevant area of the autoradiograms arc shown. The 
low panel shows the RT-PCR analysis of A-FABP mRNA expression in the 
same tumors (see also legend lo Fig. 2). 



which differ significantly in their levels of A-FABP protein 
and mRNA (Fig. 3, low panel). As shown in Fig. 3, the 
decrease in A-FABP observed in TCC 129-4, T, is not 
accompanied by an increase in the PA-FABP protein (Fig. IE). 

Discussion. 

Of the TCC progression markers identified to date, A-FABP 
is perhaps one of the most interesting as its presence correlates 
both with the grade of atypia (p=0.0006) and the stage of the 
disease (p=0.0269) (3). A-FABP is a low molecular weight 
protein belonging to a cylosolic multigene family of lipid- 
binding proteins that include heart, liver, intestinal, muscle, 
brain, skin and epithelial isoforms (20). Members of the 
FABP family are highly expressed in differentiated cells and 
show narrow tissue distribution. Their precise function is at 
present unknown, although there is evidence suggesting that 
they may play roles in intracellular lipid transport and 
metabolism, signal transduction (21,22) as well as growth 
control and differentiation (23). The role in signal transduction 
has been inferred from the fact that long-chain fatty acids and 
their metabolites can act as primary and second messengers 
in specific signalling pathways (24). Recently, it has been 
shown that A-FABP may play a central role in the pathway 
that links obesity with insulin resistance, most likely by 
connecting the fatty acid metabolism with the expression of 
TNF-o (18). Furthermore, there is evidence indicating that 
the A-FABP gene contains sequence information necessary 
lor differentiation-dependent expression in adipocytes (25). 
Our own data derived from ihc study of TCCs and normal 
urothelium suggest that A-FABP may be required for normal 
motheliurn differentiation (1), as may be the case lor PA- 
FABP in the skin ( 19): 



Considering the potential prognostic value of A FABP 
protein and/or mRNA in TCC progression it was important to 
determine if the levels of both type of ;macrornolecu(es 
correlated both in the non-invasive and the invasive lesions 
expressing and lacking A-FABP. The need for such correlation 
was underlined by recent studies of Anderson arid Seilhamer 
.(15), who reported a lack of overall correlation between the 
mRNA and protein levels of. 45 rat proteins analysed by 2D 
PAGE in combination with cDNA arrays, Their data yielded a 
correlation coefficient of 0.45 which is half way between weak 
and perfect correlation. Clearly, our data showed a very' good 
correlation between the protein and mRNA levels of A-FABP 
in all tumors analysed indicating that the loss of A-FABP 
protein observed in some tumors is not due to post- 
transcriptional regulation. 

Recently, knockout mice carrying a null mutation in the 
aP2 gene encoding for A-FABP was produced (18). These 
animals do not show an obvious morphological or metabolic 
phenotype, but exhibit a 20-fo!d increase in the levels of the 
keraiinocyte type FABP (ma/7), which may compensate for 
the loss of the deleted gene (18). The human homologue of 
the mall gene, PA-FABP, was cloned in our laboratory and 
has been shown to be highly upregulated in psoriatic skin as 
well as in abnormally differentiated primary keratinocytes 
(19). PA-FABP is expressed in normal urothelium together 
with A-FABP (3), and ongoing studies in the laboratory have 
shown that its level decreases significantly as tumors progress. 
Interestingly, the studies reported in this article did not revealed 
a compensatory up- or down-regulation of PA-FABP in the 
TCCs analysed, supporting the contention that PA-FABP may 
also play a role in cell growth and differentiation (19). 
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Expression of the pS2 gene in breast tissues assessed by pS2-mRNA 
analysis and pS2-protein radioimmunoassay. 

Hahnel E, Robbhas P. Harvey J. Sterrett G. Hahnet R . 

Department of Pathology, University of Western Australia, Queen Elizabeth II Medical 
Centre, Nedlands. 

The expression of the pS2 gene in breast tissues was assessed by measuring pS2-protein 
using a radioimmunoassay, and by determining pS2-mRNA using Northern blotting. 
There was a good correlation between the two measurements, indicating that expression 
of the pS2 gene in breast tissues may be assessed by either method. Since 
radioimmunoassay's technically easier and more efficient than Northern blotting, 
radioimmunoassay will be the method of choice in routine applications. 
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Restored expressioa aud activity of organic ion transporters rOATl, 
rOAT3 and rOCT2 after hyperuricemia in tue t at kidney. 

HabnY, Yano I , Okuda M . Fukatsu A . l..ui K 

Department of Pharmacy, Kyoto University Hospital, Faculty of Medicine, Kyoto 
University, Sakyo-ku, Kyoto 606-8507, Japan. 

We previously reported that in hyperaricemic rats, renal impairment occurred and organic 
ion transport activity decreased, accompanied with a specific decrease in the expression 
^?TT' an,0n transp0Ites - rOAT1 wA ^AT3, and organic cation transporter 
SSh i PreSCnt WC investi ^ tei the reversibility of the organic ion transport 
activity and expression of organic ion transporters (slc22a) during recovery from 
hyperuricemia. Hyperuricemia was induced by the administration of a chow containing 
unc acid and oxonic acid, an inhibitor of uric acid metabolism. Four days after 
discontinuance of the chow, the plasma uric acid concentration returned to the normal 
nu *f Cti ° nS SU ° h 38 creatilune clearance and BUN levels were restored 
although the recovery of tubulointerstitial injury was varied in sites of the kidney ' 
BasolateraJ uptake of p-aminohippurate (PAH) and tetraethylammonium (TEA) and both 
protein and mRNA levels of rOATl, rOAT3 and rOCT2 in the kidnev praduallv 
improved during 14 days of recovery from hyperuricemia. Basolateral PAH transport 
fSZS™? ° 0lT ? ati0n with the P rotein leve l of rOATl (r(2)=0.80) than rOAT3 

^mZ^^?"* taaSm Sh ° Wed 3 ?(IOng COrre,ation with <° CT2 
protein W4-0.91). The plasma testosterone concentration, which is a dominant factor in 

but Ihel?," WaS u £radUa,,y reSt ° red the ~™ nr from hyperuric™ 

bu the correlation between the plasma testosterone level and rOCT2 protein expression 
in the kidney was not significant. These results suggest that the regulation of organic ion 
transporters rOATl, rOAT3 and rOCT2, by hyperuricemia is reversible, and ZJS 
ion transport activity restores according to the expression levels of these transporters. 
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Regulation of cytochrome P4501A1 in teleosts: sustained induction of 

CVP1A1 mRNA, protein, and catalytic activity by 2,3,7,8- 

teti achlorodibenzofuran in the marine fish Stenotomus chrysops. 

Hahn ME . Stegeman JJ . 

Biology Department, Woods Hole Oceanographic Institution, Massachusetts 02543. 

Cytochrome P4501 Ai (CYP1 Al) is known to play important roles in the activation and 
detoxification of carcinogens and other toxicants in vertebrate animals, including fish. 
Although extensively studied in mammalian systems, the regulation of CYP1 A forms in 
other vertebrates is less well understood. We examined the time course and dose- 
response relationships for induction of CYP1 Al mRNA, protein, and catalytic activity by 
2,3,7,8-tetrachlorodibenzofiiran (TCDF) in the marine fish Stenotomus chrysops (scup). 
The time course of CYP1 Al induction was determined following a single ip dose (10 
nmol/kg) of 2,3,7,8-TCDF. Hepatic ethoxyresorufin O-deethylase activity was increased 
after 1 day, reached a maximum by 8 days, and was still elevated 14 days after treatment. 
The content of immunodetectable CYP1A1 protein in liver was elevated on Day 1 and 
continued to increase through 1 4 days. CYP 1 A I protein content was also strongly 
induced in heart and gill beginning at 2 days after treatment and extending through Day 
1 4. Hepatic CYP I A 1 mRNA was strongly induced by I day after dosing and remained 
elevated through 14 days. The sustained induction of CYP1 AI mRNA by 2,3,7,8-TCDF 
contrasts with the transient induction seen previously in fish treated with nonhalogenated 
inducers and most likely reflects differences in persistence of the inducers. Dose-resporise 
studies indicated that induction of CYP 1 Al mRNA, protein/and catalytic activity 
occurred following doses of 2,3,7,8-TCDF as low as 0.4 nmol/kg (120 ngflcg), within the 
range of whole-body con tents of this congener measured in fish from contaminated 
environments. The estimated dose producing half-maximal CYP1 Al induction in scup 
was approximately 2-10 nmol/kg, suggesting that the sensitivity of these fish to induction 
may be as great as or greater than that of rats. In contrast to previous results obtained with 
3 7 3 , ,4,4'-tetrachlorobiphenyl (TCB) and beta-naphthoflavone, which appear to inhibit or 
inactivate CYP1 Al in fish and other vertebrates, there was a good correlation among 
levels of CYP1 A I mRNA, protein, and catalytic activity in individual fish following 
various doses of 2,3,7,8-TCDF. The difference in response to 2,3,7,8-TCDF versus 
3,3',4 7 4'-TCB may reflect differences in the inducing potencies of the two compounds 
relative to their similar potencies as inhibitors of CYP 1 A 1 catalytic activity: In additional 
studies to evaluate structure-activity relationships for CYP1 Al induction by chlorinated 
dibenzofurans in fish, scup were treated with 2,3,6,8-tetrachlbrodiben2ofuran (2,3,6,8- 
TCDF). At 10 or 50 nmol/kg, 2,3,6,8-TCDF was inactive as an inducer of CYP1 Al' 
mRNA, protein, or catalytic activity.(ABSTRACT TRUNCATED AT 400 WORDS) 
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Brief communication 

Expression of the pS2 gene in breast tissues assessed by pS2-mRNA analysis 
and pS2-protein radiofrnmimqassay 

Erika Hahnef, Peter Robbins, Jennet Harvey, Gregory Sterrett and RoJand Hahnel . ■* ' : 

Department of Pathology, University of Western Australia, Queen Elizabeth II Medical Centre, Nedlands, 
6009, Western Australia 

Key words: breast tissue, pS2-mRNA, pS2 protein, radioimmunoassay 

Summary . : . 

The expression of the pS2 gene in breast tissues was assessed by measuring pS2-prbrein using a radioimmu- 
noassay, and by determining pS2-mRNA using Northern blotting. There was a good correlation between the 
two measurements, indicating that expression of the pS2 gene in breast tissues may be assessed by either 
method. Since radioimmunoassay is technically easier and more efficient than Northern blotting, radioim- 
munoassay will be the method of choice in routine applications. 



Introduction 

Expression of the pS2 gene is controlled by estro- 
gen. This was first described in the MCF-7 breast 
cancer cell line [1], pS2 expression has since been 
reported to be useful as a prognostic indicator f2, 
3], although ibis was not confirmed in another se- 
ries [4]. 

pS2 expression may be assessed in tissue ho- 
mogenates by analysis of pS2-mRNA [5], by ra- 
dioimmunoassay of the pS2~protein [2],. or by im~ 
munocytochemical detection of the pS2 protein in 
tissue sections f5). It was the aim of this study to 
establish the correlation between pS2-mRNA and 
pS2-protein by radioimmunoassay in a series of 
tissues obtained from mastectomy specimens per- 
formed for carcinoma of the breast. Primary breast 
carcinoma tissue, metastatic carcinoma within ax- 
illary nodes, and macroscopicaJjy benign breast 
ussue were examined. 



Materials and methods 
Breast tissues 

Tissue specimens from mastectomies performed 
for carcinoma of the breast were examined. 32 pri- 
mary breast carcinomas* 10 axillary lymph nodes 
containing metastatic breast carcinoma, and 20 
samples of uninvolved breast tissue were analyzed 
for pS2 expression. 

The primary breast carcinomas were histolog- 
ically classified using a conventional subclassifi- 
cation. The presence or absence of primary tumour 
was assessed. The presence of metastatic carcino- 
ma within lymph nodes studied was verified by 
histological examination of the node remnant after 
sampling. 

'Uninvolved* breast tissue was sampled from 
sites welJ removed from the primary breast tumour 
(usually in another quadrant of the breast), and was 
selected only if the tissue appeared macroscopical- 
ly unremarkable. Tissue sampling occurred iirime- 



Addressfor offprints: R. Hahnel. University Dcpiirimeni of Pathology. The Queen Eli2abcth I) Medico! Ccnirc. Nedlands 6009. Western Australia 



This material may be protected by Copyright law (Title 17 U.S. Code) 



72 EHahneletaL 

diately upon arrival of the mastectomy specimen in 
the laboratory, with minimal delays between re- 
moval and sampling. 

Tissues for pS2 analysis were snap frozen in liq- 
uid nitrogen and stored at ~70°C until prqcessed. 

Extraction of RNA and determination of 
pS2-mRNA , 

Details of the procedure have been described in our 
previous paper [6]. Briefly, the deep-frozen tissue 
was homogenized in a micro-dismembrator. The 
homogeneous powder, was extracted with guani- 
diniumisothiocyanatephenolchloroformiso- 
amylalcohol, and RNA was precipitated with iso- 
propanoh The washed RNA pellet was dissolved in 
SDS and glyoxylated, and the RNA preparation 
loaded onto agarose gel After electrophoresis the 
gel was capillary blotted onto Zeta-probe mem- 
branes. Membranes were hybridized overnight 
with cDNA probes pS2 and 36B4, which were la- 
beled with [aPT] dCTP by nick translations. 
Washed membranes were exposed to Kodak X- 
pmat AR film. Relative intensities of the mRNA 
bands were assessed visually as not detectable, ve- 
ry weak* weak, medium, strong, and very strong, 
taking the intensities of the ubiquitous 36B4 bands 
into account. 



' Radioimmunoassay of pS2-protein 

Deep frozen specimens were pulverized with a mi- 
crodismembrator. The tissue powder was suspend- 
ed in 10 volumes of pH7.5 phosphate buffer. The 
homogenate was centrifuged in a refrigerated cen- 
trifuge at 4°C for 60 minutes at 2600xg. The su- 
pernatant was removed with a Pasteur pipette, 
carefully avoiding the fat layer on the top. The pro- 
tein concentration in the supernatant was estimated 
by use of the Coomassie dye-binding method [7]. 
An aliquot of the supernatant was diluted to a pro- 
tein concentration between 1 and 2mg/ml before 
assay of the pS2- protein. In one case the protein 
concentration of the supernatant was well below 
J me/ml. 



The estimation of the pS2-protein was perform- 
ed using a solid phase, two-site radioimmunoas- 
say. The kits were bought from CIS Biointema- 
tional, Gif-sur-Yvette, France (ELSA-PS2). In this 
method the molecules of pS2 are sandwiched be- 
tween two monoclonal antibodies; the first one is 
coated on the ELSA solid phase, the second one is 
radiolabeled with J25-iodine. The radioactivity 
bound to the ELSA is proportional to the concen- 
tration of pS2-protein. Details of the procedure are 
supplied with the kit [8]. * 

Results and discussion 

32 primary breast carcinomas, metastatic breast 
carcinoma in 30 lymph nodes, and 20 samples of 
benign breast tissue from mastectomies were irn 
vestigated. Two of the carcinomas were of the infil- 
trating lobular type, two were ductal carcinomas in 
situ t one was a multicentric invasive ductal carci- 
noma, all others were invasive ductal carcinomas. 

Examples of pS2 Northern blots have been 
shown in our previous paper [6] which demon- 
strate that undegraded pS^mRNA can be isolated 
by the method used. 

The results of the pS2-protein and pS2-mRNA 
assays are shown on Fig. 1. There was a good corre- 
lation betweejube two types of results. When pS2- 
mRNA could not be detected by Northern blot, 
pS2-protein results were usually below Ing/mg 
protein (22 of 30) s or between 1 and 3L7ng/mg (6 of 
30). Two were exceptions (7.7 and 14.6); one of 
them could have been due to the very low protein 
content in the cy tosol which would lead to a large 
pS2 value and an associated error There was no 
explanation for the other high result. Very weak 
pS2-mRNA signals on Northern blots correspond- 
ed to pS2-protein values between LI and 19.2 with 
an average of 6.6ng/mg protein (median 5.7); The 
mean and median pS2-protein concentration in the 
tissue with weak pS2-mRNA signals were 14.3 and 
10.7ng/mg protein, respectively. The average pS2- 
protein concentration increased to 32.7 (median 
31.5) ng/ms protein for tissues assessed as medium 
pS2-mRNA intensity, and to 43.3 (median 53.8) 
ng/mg protein for tissues with strong or very strong 
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. pS2-mRNA signals. These values should be used 
as an approximate guide only, since the number of. 
samples in the various groups was fairly small. 
One-way analysis of variance confirmed that the 
means of the pS2-protein values in the groups 
made up according to their pS2-mRNA signal in- 
tensity, were significantly different (p <1(H). 

If the pS2 gene is expressed, its expression is on 
average greater in breast carcinomas than in unin- 
volved breast tissue. If one takes pS2-protein val- 
ues above 4ng/mg protein as cut-off, the average 
pS2-protein in J4 breast cancers was 343 (median 
: (35.2), while it was only 18.1 (median 13.8) in 12 
uninvolved breast tissue samples.lf the cut-off is 
taken at lOng/mg protein, average pS2-protein in 
breast carcinoma is also about twice the level of 
uninvolved tissue. There were not enough lymph 
node metastases which expressed the pS2 gene to 
allow a comparison with carcinoma or uninvolved 
breast tissue. , 

Recent preliminary results of pS2 by radioim- 
munoassay [9] are similar to ours for breast cancer 
but considerably 3 ower than our resul ts for normal 
breast tissue. 

The values of the pS2-protein measured obvi- * 
ously depend on the protein- used for calibration. 
We used the pS2-protein standards supplied with 
the CIS kit, which according to the supplier gave 
values from 0 to 740ng/mg protein in a series of 
205 breast cancer cytosols. Previously, a different 
standard had been used for presumably the same 
series of breast carcinomas [2]^ and a conversion 
factor to current standards is given as 2.8 [8]. 

It was noticed that the correlation between pS2- 
protein and pS2-mRNA was better in breast carci- 
noma specimens than in uninvolved breast tissue. 
This is unexplained, though it could be due to the - 
variable content of cell or tissue types in adjoining 
parts of a specimen, a variation more likely to oc- 
cur in our sampling of non-malignant breast tissue 
compared to sampling of carcinomas, A similar 
variability in breast carcinoma specimens will 
probably have a smaller influence on the pS2 re- 
sults, since the malignant cells - if they do express 
the pS2 gene - contain more pS2-protein than nor- 
mal breast. 
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BACKGROUND: The extent of epithelial injury in asthma is reflected by expression of 
the epidermal growth factor receptor (EGFR), which is increased in proportion to disease 
severity and is corticosteroid refractory. Although the EGFR is involved in epithelial 
growth and differentiation, it is unknown whether it also contributes to the inflammatory 
response in asthma. OBJECTIVES: Because severe asthma is characterized by 
neutrophilic inflammation, we investigated the relationship between EGFR activation and 
production of IL-8 and macrophage inhibitory protein- 1 alpha (M IP -I alpha) using in 
vitro culture models and examined the association between epithelial expression of IL-8 
and EGFR in bronchial biopsies from asthmatic subjects. METHODS: H292 or primary 
bronchial epithelial cells were exposed to EGF or H202 to achieve ligand-dependent and 
ligand-independent EGFR activation; IL-8 mRNA was measured by real-time PCR and 
IL-8 and MIP-1 alpha protein measured by enzyme-linked immunosorbent assay 
(ELISA). Epithelial IL-8 and EGFR expression in bronchial biopsies from asthmatic 
subjects was examined by immunohistochemistry and quantified by image analysis. 
RESULTS: Using H292 cells, EGF and H202 increased IL-8 gene expression and 
release and this was completely suppressed by the EGFR-selective tyrosine kinase 
inhibitor, AG 1 478, but only partially by dexamethasone. MIP-1 alpha release was not 
stimulated by EGF, whereas H202 caused a 1.8-fold increase and this was insensitive to 
AG1478. EGF also significantly stimulated IL-8 release from asthmatic or normal 
primary epithelial cell cultures established from bronchial brushings. In bronchial 
biopsies, epithelial IL-8, MIP-1 alpha, EGFR and submucosal neutrophils were all 
significantly increased in severe compared to mild disease and there was a strong 
correlation between EGFR and IL-8 expression (r = 0.70, P < 0.001). CONCLUSIONS: 
These results suggest that in severe asthma, epithelial damage has the potential to 
contribute to neutrophilic inflammation through enhanced production of IL-8 via EGFR- 
dependent mechanisms. 
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Abstract 

Lung cancer is the most common cause of cancer death 
in developed countries. The prognosis is poor, with less than 
15% of patients surviving 5 years after diagnosis. The poor 
prognosis is attributable to lack of efficient diagnostic meth- 
ods for early detection and lack of successful treatment for 
metastatic disease. Most patients (>75%) present with stage 
III or IV disease and are rarely curable with current ther- 
apies. Within the last decade, rapid advances in molecular 
biology, pathology, bronchology, and radiology have pro- 
vided a rational basis for improving outcome. These ad- 
vancements have led to a belter documentation of morpho- 
logical changes in the bronchial epithelium before development 
of clinical evident invasive carcinomas. This has changed our 
concept of lung carcinogenesis and emphasized the multistep 
carcinogenesis approach on several levels. Combined with 
the technical developments in bronchoscopic techniques, 
e.g., laser-induced fluorescence endoscope (LIFE) bronchos- 
copy, we now have improved methods to localize preinvasive 
and early-invasive bronchial lesions. With the LIFE bron- 
choscope, a new morphological entity (angiogenic squamous 
dysplasia) has been recognized, which might be an impor- 
tant biomarker and target for anliangtogenic chemopreven- - 
tive agents. To reduce the mortality of lung cancer, these 
new technologies have been taken into the clinic in different 
scientific settings. The use ol low-dose spiral computed to- 
mography ir» the screening ol a high-risk population has 
demonstrated the possibility of diagnosing small peripheral 
tumors that are not seen on conventional .X-ray. A shift in 
the therapeutic paradigm from targeting advanced clinically 
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manifest lung cancer toward asymptomatic preinvasive and 
early- invasive cancer is occurring. The present article re- 
views the recent advances in the diagnosis of preinvasive and 
early- invasive cancer to identify biomai kers for early detec- 
tion of lung cancer and for chemopreven Hon studies. 

Intro duct ion 

Lung cancer is the most common cause of cancer deaths in 
the countries of North America and other developed countries, 
accounting for 29% of all cancer deaths and more deaths than 
from prostate, breast, and colorectal cancer combined in the 
United States (I). Lung cancer will be diagnosed in —170,000 
new patients in the United States in the year 2000, and < 15% of 
them will survive 5 years after diagnosis (1). The prognosis for 
the patients with lung cancer is strongly correlated to the stage 
of the disease at the time of diagnosis. Whereas patients with 
clinical stage 1A disease have a 5-year survival of about 60%, 
the clinical stage 1I-IV disease 5-year survival rate ranges from 
40% to less than 5% (2). Over two- thirds of the patients have 
regional lymph-node involvement or distant disease at the lime 
of presentation (3). The poor prognosis is largely attributable to 
the Jack of effective early detection methods and the inability to 
cure metastatic disease. The unsatisfactory cure rates supports 
efforts aimed at enriy identification and intervention in lung 
cancer.. 

Historically, the only diagnostic tests available for the 
detection of lung cancel in its early stages were chest radiogra- 
phy and sputum cylology. The efficacy of these tests as mass 
screening tools was evaluated in controlled trials sponsored by 
the NCI 3 and conducted at Johns Hopkins University, Memorial 
Stoan-Keltcring Cancer Center, and the Mayo Clinic during the 
1970s (4-6). The principal goal of these studies was to deter- 
mine whether a reduction in lung cancer mortality could be 
achieved by adding sputum cytology testing to annual screening 
by chest radiography. Results from these trials showed that both 
tests could detect ptesymplomatic, early-stage carcinoma, par- 
ticularly of squamous cell type. Kesectability and survival rales 
were found to be generally higher in the study groups than in the 
control groups. However, improvements in readability and 
survival did not lead to a reduction in overall lung cancer 
mortality, the most critical end point. A subsequent study of 
6346 Czechoslovakia)! male smokers also found no reduction in 
lung cancer mortality ;ifter dual screening bv chest radiography 



-"The abbre\ iation;- used are; NCI. National Cancer Institute: CIS. 
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and sputum cytology (7). The negative results from these 
: screening studies lead the NCI and other health policy and 
research groups to conclude that mass screening programs in- 
volving periodic sputum cytological evaluation and chest radio- 
graphs could not be justified. However, controversies in the 
methodology and interpretation of the data from these studies 
have later been extensively discussed (8, 9). One additional 
study of annual chest X-ray screening is currently being con- 
ducted by the NCI; The Prostate-, Lung-, Colorectal-, and Ovar- 
ian (PLCO) screening trial. This trial includes individuals 55-74 
yeaTS old, but they are not selected for this trial on the basis of 
high risk for lung cancer (e.g., smoking history with >20 
pack-years). 

The failure of clinical trials to demonstrate the efficacy of 
sputum cytology and chest radiography as mass screening tools 
has resulted in a search for better diagnostic approaches for 
early lung cancer detection that take advantage of recent devel- 
opments in molecular biology, gene technology, and radiology 
(10). Furthermore, as has been the case for mammography 
screening for breast cancer, it has also been important to identify 
risk groups for lung cancer. 

Although, much is known about predisposing factors, nat- 
ural hisiory, and the outcome based on histology and stage, our 
understanding remains very incomplete in many areas. What are . 
the early premalignant changes molecularly, biochemically, and 
morphologically? Which changes are reversible and which are 
not? Whai research tools are available to provide answers to 
these questions? The identification of preinvasive lesions allows 
for developing promising methods for early intervention flJ). 
The therapeutic paradigm and focus are today shifting from 
targeting only clinically verified lung cancer as previously to* 
ward targeting the premalignant and early- malignant lesions. 
Furthermore, the prospect of Jung cancer screening has today 
become more meaningful as a consequence of recent develop- 
ments in biology and radiology and better possibilities to define 
high-risk populations most suitable for lime cancer screening 
(12). 

The present anicle will focus on the clinical perspectives of 
our biological knowledge of premalignant and early- malignant 
lesions and the potential of the recent technological advance- 
ment for early diagnosis of lung cancer. 

Pathology of Preinvasive and Early Invasive 
Bronchial Lesions 

Most of the efforts to classify lung cancer have been 
directed toward invasive carcinoma (13). However, better un- 
derstanding of the pathogenesis of lung cancer aroused renewed 
interest in morphological abnormalities that fall short of inva- 
sive carcinoma but may indicate initiation of carcinogenesis. 
These morphological abnormalities are referred to as preinva- 
sive lesions and ;ue shown in Fig. I. The last edition of (he 
WHO classification of lung tumors included the classification of 
preinvasive lesions as a separate section. Numerous recent stud- 
ies have indicated thai lung cancer is not the result of a sudden 
transforming event in the bronchial epithelium but a mullistep 
process in which gradually accruing sequential gene lie and 
cellular changes test) It in the formation of an invasive (i.e.. 
malignant) tumor. Mucosal changes in the large airways that 



may precede or accompany invasive squamous carcinoma in- 
clude hyperplasia, metaplasia, dysplasia, and'CIS (14); Hyper- 
plasia of the bronchial epithelium and squamous metaplasia 
have generally been considered reversible, and not premalignant 
in the sense of squamous dysplasia and CIS (15). 

Squamous metaplasia is a common finding, especially as a 
response to cigarette smoking. Peters et al. (16) studied bron- 
choscope biopsies from six sites in 1 06 heavy cigarette smok- 
ers; Squamous metaplasia was noted at one or more biopsy sites 
in approximately two- thirds of the group, and one- fourth 
showed squamous metaplasia in three or more biopsy sites. The 
incidence of squamous metaplasia increased with smoking his- 
tory and was highest in individuals who had smoked more than 
two packs of cigarettes a day. Auerbach et al (J 7) noted similar 
findings in autopsy tissues: basal cell hyperplasia and squamous 
metaplasia are increased in smokers in proportion to smoking 
history. Hyperplasia and metaplasia are believed to be reactive 
changes in the bronchial epithelium, as opposed to true preneo- 
plastic changes ( 1 7, J 8). The reasons for this include: (a) they 
are frequently found in association with chronic inflammation, 
and may be induced by mechanical trauma; {b) they spontane- 
ously regress after smoking cessation; (c) in chronic smokers, 
the molecular changes present in these lesions are similar to 
those present in histologically normal epithelium; and (d) there 
are no reports linking their presence to increased risk for devel- 
oping lung cancer. Jn contrast, moderate- to-severe dysplasia and 
CIS lesions seldom regress after smoking cessation (19). 

Dysplasia and CIS are changes that frequently precede 
squamous cell carcinoma of the lung. Saccomanno et al (20) 
studied more than 50,000 samples from 6,000 men, many of 
whom had worked in the uranium mining industry'- Both smok- 
ing and uranium mining (radon exposure) were found to be 
associated with increased incidence of dysplasia, CIS, and in- 
vasive cancer. The studies of Saccomanno et al. established that 
increasing degrees of sputum arypia may be recognized an 
average of '1-5 years before the development of frank lung 
carcinoma. 

Another question is: which grades of sputum arypia pro- 
gress, to cancer? From the Johns Hopkins cohort of the NCI 
chest X-ray/sputum screening trial, we know that among indi- 
viduals with moderate arypia on sputum screening, ~ 10% de- 
veloped known cancer up to 9 years later. Among individuals 
with severe atypia on the sputum screening. >A0% developed 
known cancer during the same time period (21). Although there 
are data in the literature showing the relationship between 
.sputum atypia and subsequent invasive cancer, there is still very 
little information about the histological progression in the bron- 
chia) mucosa in the high risk populations. In a recent publica- 
tion, nine patients with CIS were followed with antofluorcs- 
cence bronchoscopy at regular intervals, and 5 (56%) had 
progression to invasive cancer despite endobronchial therapy 
(22). The number of invasive cancers might even have been 
higher if treatment had not been not given. Ongoing studies of 
high-risk subjects [e.g.. the Colorado sputum cohort study) 
including serial follow-up bronchoscopies will provide evidence 
related to the frequency of development of invasive lung cancer 
as n relates to smoking history, airflow obstruction, and sputum 
atypia. 

Since the previous WHO- classification was published in 




Fig. i A, squamous metaplasia. The 
cells are widely dispersed, with a reg- 
ular maturation from (he basal region 
to the top. There is keratin 12a I ion, and 
the ouclei/cyjoplasmic ratio is low. B, 
moderate dysplasia with ASD. Hyper - 
celrularity of the epithelium with in- 
complete maturation and micropapil- 
lary invasion of capillaries are seen. 
The nuclei/cytoplasmic ratio is high. 
C, severe dysplasia. There is marked 
pleomorphism of the cells with irregu- 
larity and prominent nucleoli. 
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1981, two nonsquamous lesions have been added to (be WHO 
classification of premalignant lesions: atypical alveolar hyper- 
plasia and diffuse idiopathic neuroendocrine cell hyperplasia 
(13). Both of these lesions are diagnosed rarely. The former 
consists of lesions <5 mm in diameter and composed of a 
peripheral epithelial cell proliferation with minimal cytological 
atypia or stromal response and resembles broncbiolo alveolar 
carcinoma. The lesion has been seen in rung specimens resected 
for rung cancer, but no prospective significance of this lesion 
has been reported. However, this morphological lesion may play 
a role for the pathogenesis of peripheral lung adenocarcinomas 
(23, 24). The resolution of spiral CT (currently about 3 mm) 
approaches the diameter of these lesions, and it is anticipated 
that atypical alveolar hyperplasia will be increasingly encoun- 
tered in subjects undergoing this procedure (25). Diffuse idio- 
pathic neuroendocrine cell hyperplasia consists of a patchy 
increase in the number of well-differentiated neuroendocrine 
cells in the bronchioles. This process may result in the formation 
of small carcinoid rumors, and for this reason it is considered 
"preinvasive." To date, small cell carcinomas have not been 
associated with this lesion (13). 

Recently, the use of fluorescence bronchoscopy (see be- 
low) has increased the recognition of dysplastic lesions in the 
large airways and a new morphological entity, ASD, was iden- 
tified (26). Dysplasia of bronchial epithelium in "micropapillo- 
matosis" and the possible link between angiogenesis and prein- 
vasive bronchial epithelial dysplasia were recognized as early as 
1983 by MuIIer and Muller (27), who also described the ultra- 
structure of these lesions. Jt has been suggested that ibis angio- 
genesis, which is recognized as capillary loops projecting into 
the dysplastic bronchial lining, is responsible for the reduced 
fluorescence seen in dysplastic lesions by LIFE bronchoscopes 
(Figs. 1 and 3; Rcf. 26). Future prospective studies will show 
whether this morphological entity is conelaled with a progres- 
sion to lung cancer so as io be a target for the use of ant i angio- 
genic agents for chemoprcvention. 

In general, there are several questions/problems relating to 
premalignant lesions, which will be addressed in future studies: 

(a) The morphological criteria for premalignant and early- 
malignant changes, both on sputum cytology and in bronchial 
biopsies, have to be validated for intra- and inlerobserver repro- 
ducibility. 

(b) Uniform and repioducible morphological/cyioloyicaj 
criteria have to be published more extensively, nnd a trajninc set 
of slides should be available. By the use of Internet technology 
this could be mote easily facilitated (28). 

(c) The correlation of sputum atypia and histological 
changes in the bronchi in high-risk population is not well 
defined. 

[J) The natural course of preinvasive changes in the bron- 
chi from the high risk subjects needs to be clarified through 
longitudinal, prospective studies with reference to histological 
chances in the bronchi. Ongoing longitudinal studies with flu- 
orescence bronchoscopy ami multiple biopsies with histology 
and other biomarkeis will define the ability of these markers io 
assess for risk. 

(r) What is the paihology/biology of the small, often pe- 
ripherally located, tumors (3 mm in diameter), which arc more 



often diagnosed with newer radiological techniques {e.g., low- 
dose spiral CT)? 

(/) Optimization of the tissue procurement and processing 
. techniques are important. Distinction of reactive from neoplastic 
processes is usually straightforward, but diagnostic difficulties 
may arise in the case of (a) inadequate or poorly prepared 
histological material to evaluate and (b) the presence of cyto- 
logical atypia in epithelium stimulated by inflammation, viral 
infection, radiation, or chemotherapy. 

ig) DNA anay analyses of gene expression: will it be 
useful? How to collect proper mRNA? Can mRNA extracted 
from microdissected cells obtained at bronchoscopy be globally 
amplified and still remain representative of RNA present in situ? 

Biology of Lung Carcinogenesis aiid Potential Early 
Detection Markers. 

Lung cancer is the end-stage of multiple-step carcinogen- 
esis, in most cases driven by genetic and epi gene tic damage 
caused by chronic exposure to tobacco carcinogens. The genetic 
instability in human cancers appears to exist at two levels: at the 
chromosomal level, including large scale losses and gains; and 
at the nucleotide level including single or several base changes 
(29). Lung cancers harbor many numerical chromosomal abnor- 
malities (aneupJoidy) and structural cytogenetic abnormalities 
including deletions and nonreciprocal translocations (30). At 
least three classes of cellular genes are involved: proto-onco- 
genes, TSCs, and DNA repair genes. Oncogenic activation often 
occurs via point mutations, gene amplification, or chromosomal 
rearrangement, whereas TSGs are classically inactivated by the 
loss of one parental allele combined with a point or small 
mutation or aberrant meihylation of a target TSG in the remain- 
ing allele. Additionally, dysregulaled gene expression (either 
increased or decreased expression) can occur by other, as yet 
unknown, mechanisms (30). Present studies have not yet con- 
firmed a prominent role for abnormalities of DNA repair genes 
. in Jung cancer. 

Preneoplastic cells contain several molecular genetic ab- 
normalities identical to some of the abnormalities found in overt 
lung cancer cells (Fig. 2). These include allele loss 3t several 
loci [3p, 9p, bp, nnd f7p), myc and ras up-regulation, cyclin Ol 
overexpression, p53 mutations, and increased imrnunoreactiv- 
ity, be I- 2 ovcrexpression and DNA anenploidy (31-35). Allelo- 
typing of precisely microdissected. preneoplastic foci of cells 
suggests that the earliest changes in the bronchial epithelium is 
allele Joss at chromosome regions 3p, then 9p, 8p, |7p : 5q. and 
then / as mutations (36-39).Thc biological meaning of LOH is 
only vaguely understood. Recent evidence suggests that LOH 
may be a consequence of mitotic recombination, lhar there is 
only infrequent physical loss of genetic loci, and that LOH 
probably precedes chromosomal duplication (40). Allelic loss 
would thus be significant primarily in the presence of mutation 
in the retained allele, and gene dosage would not be expected to 
exert a phenoiypic effect in LOH. Some reports have indicated 
that ms activation occurs at early carcinoma stages (34). His- 
tologically normal bronchial epithelium adjacent to cancers has 
also been shown to have certain genetic losses. Atypical ade- 
nomatous hyperplasia, the potential piecursor lesion of adeno- 
carcinomas, often have Ki-rav mutations (4 1). 
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Fig. 2 Sequential changes during lung cancer pathogenesis. Although multiple genetic markers are abnormal in lung cancers, their appearance during 
the lengthy preneoplastic process varies. The timing of the appearance of these changes has been investigated in bronchial prcneoplasia, because 
sequential sampling of the peripheral lung is technically difficult. Several alterations have been described in histologically normal bronchial epithelium 
of smokers. Other changes have been detected in slightly abnormal epithelium (hyperplasia, metaplasia) which we do not consider to be true 
prenialienant lesions. These changes are regarded as caily changes. Molecular changes delected frequently in dysplasia are regarded as intermediate 
in timing, whereas those usually detected at the CIS or invasive stages are regarded as laic changes. It should be stressed that although there is a usual 
order, exceptions regarding the timing of onset may occur. Some changes are progressive, such as chromosome 3p deletions. Thus small discrete , 
changes are present early, progressively become more extensive during pathogenesis, and frequently involve all or almost all of the arm in CIS 
samples. Although allelic loss at the TP53 locus may precede the onset of mutations, data on this sequence are scant. Dysregulation of the RNA 
component of telomerase (with its appearance in nonbasal cells) is an early event, whereas up-regulation of the gene is a relatively late event. 



Molecular changes have been found nol only in the lungs 
of patients wjth lung cancer, but also in the lungs of current anil 
former smokers without lung C3ncer (IS. 42. 43). These obser- 
vations are consistent with the multistep model of carcinogen- 
esis and "field r ancerization" process, whereby the whole region 
is repeatedly exposed to carcinogenic damage (tobacco smoke) 
3ih1 is al risk for developing mulliple. separate, clonaljv unre- 
lated foci of neoplasia. The widespread aneuploidy that occurs 
throughout the respiratory tree of smokers supports this theory 

However, the presence of the same somatic p53 point 
mutation at widely dispersed preneoplastic lesions in a smoker 
without invasive lung cancer indicates that expansion of a single 
progenitor clone may spread throughout the respiratory tree 
<4S)_ These molecular alterations might thus be important 
targets lot use in the early detection of lung cancer and lot use 
as surrogate bioinaikers in the follow-up of chemopf even lion 



studies. Detection of these mutant cells should be possible with 
the different molecular techniques in accessible specimens. The 
prospects of diagnosing these biological abnormalities in mul- 
tiple types of clinical specimens are discussed below. 

Specimens for Clitticuf Testing: Sputum 

Since the 1930s, cytological examination of spumm has 
been used for the diagnosis of lung cancer (46). Cytological 
examination of sputa, especially multiple samples, is helpful for 
the detection of central rumors aiising from the latger bronchi 
squamous cell- and small cell carcinomas). Exfoliated 
cells from peripheral lurnois. such as adenocarcinomas, arising 
from the smaller airways (small bronchi, bronchioles, and alve- 
oli), especially those less than 2 cm in diametei. can be delected 
only occasionally in sputum samples. This has become of 
greater importance because the changes in cigarette exposition 
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(fillers and decreased nicotine content) have created an increase 
in adenocarcinomas and a decrease in squamous carcinomas 
(47-49). The sensitivity of sputum cytology for earjy lung 
cancer is only in the 20%-3O% range from screening studies, 
but by adhering to proper specimen collection, and processing 
and interpreting criteria, the yield can be substantially improved 
(50, 51). The data on the reliability of the sputum are conflicting 
(52-54). Browman et al (52) reported interobserver agreement 
of 68% for exact and 82% for within - 1-category. Holliday et 
ah (54) reported low agreement within observers (27- 60%) and 
across observers (13-50%). Within - 1 - category intraobserver 
agreement underwent a two- or 3- fold increase in agreement, 
which was also the case for interobserver agreement. The var- 
iation in intra- and interobserver agreement seems to depend on 
experience among the cytotechnicians/cy I ©pathologists and the 
composition of categories studied. A higher degree of agreement 
is obtained for higher grades of dysplasia (54). Risse et ah (55) 
showed that the ability to detect premalignant conditions is 
* dependent on the number and type of cells present in the deeper 
airways, suggesting a mode of improvement that is unrelated to 
observer reliability. MacDougall et ah (56) concluded that spu- 
tum cytology was too insensitive and insufficiently accurate to 
be included in the routine work-up of any patient suspected of 
having lung cancer. To improve the reliability of sputum cytol- 
ogy examinations a simplification of the diagnostic, categories 
from 6 (normal; squamous metaplasia; mild, moderate, and 
severe atypia; and carcinoma) to 2-3 categories have been 
proposed (54). Future clinicopathological studies will be re- 
quired to validate this concept. 

To improve the sensitivity of sputum examination as a 
population-screening tool for the detection of early lung cancer, 
several approaches are currently under development. 

jNimnnostaining. Annual sputum specimens obtained 
from individuals screened at Johns Hopkins were obtained, and 
the patients were monitored for 8 years {57). Because the 
clinical outcome of these patients was known, archival sputum 
specimens were screened for the piesence of biomarkers that 
could indicate the presence of lung tumois in an early, preinva- 
sive sl3ge. In an attempt lo distinguish the pattern of marker 
expression Tockman et oh (58) studied two monoclonal anti- 
bodies. Positive staining predicted subsequent lung cancer ap- 
proximately 2 years' before clinical recognition of the disease, 
with a sensitivity of 91% and a specificity of 88% (58). One of 
these antibodies (703 D4) had a higher sensitivity and was later 
identified as recognizing hnRNP A2/I3I (59). The ^ole of 
hnRNP A2/BI ovcrexpressicn for delecting preclinical lung 
cancer has been studied in a large high-risk population including 
6000 Chinese tin miners who were heavy smokers and who had 
an extraordinary rare of lung cancer [hi)). 1 he results from. this 
study iirdicated thai detection of hnRNP A2/B1 oveiexpression 
in sputum epithelium ceils was 2- io 3- fold more sensitive for 
detection of lung cancer than standard chest X-ray and sputum 
cytology methods. The method was particularly effective in 
identifying early disease (60). The sensitivity was J 4% versus 
21% for cytology and 42% for chest X-ray. However, the 
biomarker had a lower specificity (70%) compared with cytol- 
ogy (100%) and chest radiograph (90%). An ongoing clinical 
trial is evaluating the performance of rhc A 2/131 protein as a 
biornarker for the early detection of SPLC. The patients at risk 



for SPLC have the highest incidence of lung cancer (2-5%) 
among asymptomatic populations (6 J). In this trial, 13 SPLCs 
were identified by A2/BJ, and the sensitivity and specificity 
were 77-82% and 65-81%, respectively. Among the cases 
. identified as positive by immnjiocytochermstry and image cy- 
tometry, 67% developed SPLC within 1 year (62). Whereas the 
previous immunocytochemistry studies on material from the 
older screening materia) from the NCI-supported screening 
studies were made on sputum cells cytologically classified with 
moderately or gravely atypical metaplastic appearance, the latter 
studies nave been done on cytologically "normal appearing" 
cells. More recently Sueoka et al. (63) reported the confirmation 
of the value of overexpression of hnRNP A2/BI to detect 
preclinical lung cancer in Japan. Efforts to improve the sensi- 
> tiviry of hnRNP markers are ongoing (64). 

PCR Techniques. PCR techniques have been used for 
the evaluation of molecular biomarkers for early lung cancer 
detection. In a pilot study with selected patients from the Johns 
Hopkins Lung Project (JHLP), 8 (53%) of 15 patients with 
adenocarcinoma or large cell carcinoma were detected by mu- 
tations in sputum cells from 1 to 13 months before clinical 
diagnosis (65). However, the method seemed to be less sensitive 
than the protein marker described above, and the identification 
of specific gene abnormalities is further limited by the need to 
know the specific mutation sequence with which to probe the 
sputum specimens. Currently, this approach is not practical for 
screening undiagnosed individuals. Future advances in gene 
chip technology may permit testing for al) possible mutations of 
common oncogenes and TSGs in clinical specimens of asymp- 
tomatic individuals (62). 

Microsatellite markers are small repeating DNA sequences 
found in the noncoding regions of a gene. PCR amplification of 
these repeat sequences provides a rapid method for assessment 
of LOU and facilitates the mapping of suppressor genes (66, 
67). Microsatellite alterations are extension or deletions of these 
repeated elements. Detection of microsatellite alterations in 
histological or cytological specimens may facilitate the detec- 
tion of clonal preneoplastic or neoplastic cell populations. Al- 
though the detection of microsatellite alterations does not indi- 
cate the specific genetic change in the rumor, detection of clonal 
cell populations might serve as a cancer screening marker (65). 
Identical alterations have been found in lung cancers and cor- 
responding sputum samples demonstrating minimal atypia (68). 
The pJ6 gene is located on the short arm of chromosome 
9(9p2l) and is frequently mutated or inactivated in ftimors and 
cell lines derived from lung cancer (69. 70). Belinsky et al. (71) 
measured hypcrmetlivJaijon of the CpG islands in the sputum of 
lung rancer patients and demonstrated a high correlation with 
early stages of non-small cell hmg cancer, which indicated that 
p)6 CpG hypermethylation could be useful in the prediction of 
fuhue lung cancer. However, prospective studies are needed to 
evaluate the role of pi 6 hypennethylation as a marker for early 
lung cancer detection! Multiple other genes arc inactivated by 
hypennethylation in long cancer (72), and the detection of 
hypermethylalion may be useful for risk assessment and early 
diagnosis. 

Computer -a ssiMed lma»c Analysis. Computer- assisted 
image analysis was initially used to detect ma tig nancy- associ- 
ate* I changes ic^., subvisnal or n on obvious changes in the 
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distribution of DNA in the nuclei of histologically normal cells 
in the vicinity of preinvasive or invasive cancer; 73). In a 
retrospective analysis of sputum cytology slides, malignancy- 
associated changes alone correctly identified 74% of the sub- 
jects who later developed squamous cell carcinoma (74). The 
technique has been improved, and recent data showed sensitiv- 
ities of 75% for stage 0/1 lung cancer and 85% for adenocarci- 
nomas with a specificity of 90% (75). This quantitative micros- 
copy technique allows the examining of thousands of cells per 
slide within a relative short time. Similar techniques have been 
approved in the United States for cervical cancer screening, and 
might, in the future, play a role for lung cancer screening. 
However, no prospective clinical studies has evaluated this 
technology in a larger lung cancer screening setting; 

High Throughput Technology. With future advances in 
^ene chip technology, it might become feasible to probe for 
expression of multiple genes in sputum specimens of asymp- 
tomatic individuals. However, this requires a iarge amount of 
undegraded RNA from respiratory tract cells. With the high 
throughput technology, a higher sensitivity might be achieved 
by using multiple markers at the cost of achieving a lower 
specificity, which would be undesirable for a screening study. 

In conclusion, we need to reevaluate the role of sputum 
cytology for screening and early detection of lung cancer be- 
cause of advances in biomarkers and technology. Ongoing stud- 
ies with standard and biomarker analysis in high-risk groups 
might change the previous negative altitude and provide a new 
perspective on sputum cytology as a mass screening tool when 
applied in a high-risk population. Adding different molecular 
diagnostic tests gives the possibility for early diagnosis far in 
advance of clinical presentation. However, validation of the 
tests in larger prospective studies is necessary, and the individ- 
ual tests have to be compared with each other to define the role 
of early diagnosis in the overall management of high-risk sub- 
jects. Furthermore, health economic issues have to be consid- 
ered. 

Specimens for Clinical Testing: BAL 

BAL involves the infusion and reaspiration of a sterile 
. saline solution in distal segments of the lung via a fiberoptic 
bronchoscope. Ahiendt et ai. (76) examined a series of 50 
resected non-SCLC tumor patients and compared the rumor and 
BAL with regaid to molecular markers including p53 mutations, 
K-ras mutation, the mcthyfaiion status of the CpG island of the 
■ pI6 gene, and microsatellite alteration (Tables I and 2). With 
the possible exception of the test for microsatellite alteration, all 
of the test? had relatively high sensitivity and could detect 
mutant cells in the presence of a large excess of normal ce lls. 
The "frequencies of these changes in the rumors ranecd from 
27% (for K-ras mutations) to 5/>% {for p53 mutations). As 
expected. p53 mutations were moic frequent rn centra I (piedom- 
i nam ly- squamous cell) tumors, and K-ras mutations were more 
frequent in peripheral (predominantly adenocarcinoma) tumors. 
The specificity was high (nearly 100%) because, with the ex 
ccption of mic rosarellrle alterations., the same genetic chance in 
13 At. sample as in rumors was always found, but the sensitivirv 
was low. and in only 53% of tumors that contained molecular 
lesions were the same abnormalities detected in 'corresponding 
BAL fluids. Specifically, the tests were least helpful in the 



group of patients in whom improved diagnostic abilities are 
most needed, those with small, peripherally located tumors (77). 
Unfortunately, the investigators were not able to compare the 
molecular tests with routine cytopathologicaJ analysis of the 
BAL specimens. The sensitivity of the molecular tests in BAL 
specimens has to be improved, and we need to know the results 
from subjects at increased risk (current and former smokers 
without lung cancer and survivors of previous cancer of the 
upper respiratory tract) and subjects with chronic lung diseases 
as well as results from healthy never smokers. 

A European group has previously shown that genetic al- 
terations detected in DNA from bronchial lavage of individuals 
with lung cancer were also found in individuals with no evi- 
dence of malignant disease (78), which raises the question about 
the specificity of such molecular damage in neoplastic condi- 
tions. To improve the sensitivity and specificity of detecting 
allelic imbalance in lung tumors, high throughput PCR : based 
microsatellite assays have been established (79). In a recent 
study by Fielding et al (80), Ihe up-regulalion of hnRNP A2/B 1 
was found to be a promising marker in BAL for Ihe detection of 
premalignant and malignant bronchial lesions with a diagnostic 
sensitivity of 96% and a specificity of 82%. 

It is too early yet to make conclusions as to whether BAL 
examinations will add to other pathological/molecular biologi- 
cal clinical studies. To obtain diagnostic material for BAL 
bronchoscopy is required, and we do not have any data that 
compare BAL examinations with biopsies. Thus, we do not 
know whether BAL is a valuable adjunct to the biopsies taken 
under the same bronchoscopy procedure. 

Specimens for Clinical Testing: Peripheral Blood 

For many years scientists have searched for a lung cancer- 
specific tumor marker that could be detected in peripheral blood. 
Optimism was raised in the "early" immunocytochemistry era 
by the use of monoclonal antibodies raised against more-or-less 
specific epithelial epitopes. In the search for epithelial cells in 
peripheral blood and bone marrow, monoclonal antibodies 
against cytokeratin have been used. However, these reactions 
Lire clearly not cancer- specific, and some antibodies have been 
shown to. cross- react with normal blood or bone marrow ele- 
ments (81, 82). Another explanation could be that cells from the 
macrophage/monocyte system may contain proteins derived 
from the primary tumor that have undergone necrosis and 
apoptosis and that these processed proteins are recognized by 
the antibodies (S2). On the basis of "traditional" immunocyto- 
chemistry, no markers have been able to detect premalignant or 
cnrly- malignant disorders based on a peripheral blood sample. 
However, wirh the development of DNA technologies, new 
possibilities have been raised, and. with the use of PGR tech- 
niques, some promising reports have been published. 

Nanogram quantities of DNA circulating in blood arc pres- 
ent in healthy individuals (S3, 84). Tumor DNA is also released 
into the plasma component in increased quantities (85, 86). 
Thus, the plasma and serum of cancer patients is enriched in 
DNA, an average four times the amount of free DNA as com- 
pared with normal controls (87). In a study by Chen et a}. (88). 
a comparison of microsatellile alterations in rumor and plasma 
DNA was done in SOX patients, and 93% of the patients wjih 
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Table J Tissues and other resources for the study of molecular markers 



Specimen 



Ref. 



Comments 



Tumor tissue Numerous 

i 

Sputum 65, 68, 71, 74 

Surrogate organ- 140 



141, 142, 143 



42, 43, 45, 144, 145 



76, 78, 146, 147, 148 



72, 92, 149 



Bronchial brush/wash 
Bronchia) (issues „ 



BAL fluids 



Blood componenis 



Tissue for molecular staging 



Tumor celt lines 152, 153 



Cultures of nonrnalignant tissues 



Nonmalignani tissue from patients 156, 157. 153 
and from cancer- free jetoiives 



Mixture of cell types, may require microdissection (139). Extensively used 
for most. studies. Alcohol-fixed fine-needle aspirates may be used for 
mutational arid other studies. 

Respiratory cells usually in small minority. Most samples fixed in 
Saccomanno's fixative. Several studies have been performed on these 
specimens many years later. 

Predominantly squamous epithelial cells. Buccal smears; brushings of 
tongue or tonsil may be explored as potential surrogate organs resulting 
from the field effect of tobacco damage of the entire upper., 
aerodigestive tract. This concept needs to be confirmed. 

Predominantly respiratory cells. Fresh, frozen, or alcohol-fixed samples are 
suitable for multiple studies including FJSH." 

Usually from bronchial biopsies, but may be obtained from surgical 
resection specimens. Formalin fixation and paraffin embedding required 
for histological diagnosis, although EAS1 preps may permit 
identification and isolation of subpopulations. Paraffin sections may be 
used for genotyping polymorphisms, for allelotyping, and for in situ 
hybridization. 

BAL fluids are useful for examining the peripheral airway cells, which are 
the precursor cells of most adenocarcinomas. Numerous mononuclear 
cells present. Enrichment of epithelial cells desirable. 

Analysis of circulating tumor cells and genetic material shed by dying 
tumor cells into the plasma component may yield useful biological and 
diagnostic information. Gene mutations and presence of epithelial cell 
markers have been used to detect circulating rumor cells. Gene 
mutations, allelic loss, microsatellite alterations, and aberrant 
methylation have been used to identify tumor cell DNA released into 
the fluid compartment. 

Although little data exists for lung cancers, regional lymph nodes, sentinel 
lymph nodes, and surgical resection margins have been used in other 
tumor types for molecular staging. 

Provide an unlimited self-replicating source of high-quality molecular 
reagents and have been used for numerous studies. Cell lines may or 
may not reflect the properties of the tumors from which they were 
derived (26). although they probably represent cellular subpopulations 
(27). Aggressive metastatic tumors are more likely to be successfully 
cultured (28) lesulting in skewed d^la. 

Epithelial cultures may be useful for studying molecular changes during 
multistage pathogenesis. Temporary as well as a few immortalized 
cultures from non malign ant epithelial cells have been established. 
D-lymphoblastoid cultures are useful for linkage analysis, for genetic 
suspect ibility studies, and for allelotyping corresponding rumors. 

T issues such as buccal smears, tumor- free lymph nodes, and peripheral 
blood mononuclear cells are useful as controls foi linkage analysis, for 
generic susceptibility studies, and for allelotyping corresponding tumors. 



" FISH, fluorescence in situ hybridization; EASf. epithelial aggregate separation and isolation. 



microsatellite alterations in tumor DNA also had modifications 
in the plasma DNA. However, some patients had LOH only in 
the rumor DNA. Because most of the microsatellite alterations 
were similar in tumor DNA and plasma DNA. they concluded 
that some of the DNA circulating in the blood comes from the 
tumor. Thus, modifications of circulating DNA can be used as 
an early detection marker. Detection of aberrant DNA methyl- 
ation in serum DNA in patients with non-SCJLC has been 
reported (72). Although the number of patients was small and 
the hypermr thylated DNA was found in all stages, it opens up 
for the possibility to be used as an early lung cancel detection 
marker. Furthermore, p}3 and ras gene mutations have been 



detected in the plasma and serum of patients with colorectal 
cancers (89-? J), pancreatic carcinomas (92. 93). and hemato- 
logical malignancies (94). 

In conclusion, the limited direct accessibility of lung car- 
cinomas has led to efforts to identify tumor-associated soluble 
markers in serum or plasma. Many of the currently recognized 
soluble markers were fust identified as "tumor"" markers but. 
when evaluated in nonneoplastic tissue, have often been found 
in normal cells as well as in tumors. Foi early detection of lung 
cancer, we need more clinical data evaluating these new molec- 
ular biological markets from multiple sites, especially in high* 
risk groups. 
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Table 2 Molecular approaches for lung cancer investigation 



Specimen 



Red 



Gene mutations 



Allelotyping 



Gene expression a I RNA 
and protein level 



Molecular cytogenetics 



Comparaiive genomic 
hybridization 

Morphometric studies 



159, 160, 161 



.18, 158 



H5 f 162, 163 ; 164, 165, 166 



40, 167. I6S. 169. 170 



171, 172 



Comment 



Widely used technique, especially for p53 and ras genes. Often used 
to determine the r61e of a newly discovered gene in the 
pathogenesis of lung cancer May be of diagnostic and prognostic 
significance. Multiple methodologies available. ... 

Useful as a partial substitute for mutational analysis and for 
determining the chromosomal locations of putative tumor 
suppressor genes. Widely used to study multistage pathogenesis. 
Readily performed on formalin- fixed and microdissected tissues. 
Increasing use of genotyping using automatic sequencers. 

Northern blotting and reverse transcript ion-PCR are widely used to 
investigate gene expression. Western blotting often used for , 
detection of protein expression. In siru hybridization for message 
expression can be performed on paraffin-embedded tissues and, 
thus, can be used to investigate multistage pathogenesis. 
Microarray techniques offer promise of examining all or most of 
the genome but currently require relatively large amount of high- 
quality RNA from purified cell populations Sage technique useful 
for investigation and identification 'of expressed genes. Similarly, 
advances in proteomics will permit simultaneous detection of 
multiple proteins. Numerous immunohistochemica) studies of 
. oncogene expression have been used to study multistage 
pathogenesis. Of particular interest, hnRNP expression on 
exfoliated epithelial cells in sputum samples may predict for 
development of cancer. 

In situ hybridization studies of fixed materials or using smears has 
provided considerable information about numerical and structural 

changes. 

Useful for detection of gene amplifications. Less sensitive for the 
detection of regions of allelic loss. 

May be applied to paraffin-embedded tissues. Useful for determining 
aneuploidy and for measuring a number of nuclear and 
cytoplasmic parameters. ' . 



. Specimens for Clinical Testing: Bronchoscopy 

WJLB is the most commonly used diagnostic loo I for ob- 
taining a definite histological diagnosis of lung cancer. Bron- 
choscopy has major diagnostic limitations for premalignant le- 
sions. Because these lesions are only a few cells thick (0.2-1 
mm) and have a surface diameter of only a few millimeters, ihey 
rarely are observed as visual abnormalities. WooJner (95) re- 
ported that squamous cell CIS was visible to experienced bron- 
choscopists in only 29% of cases. To address this limitation, 
fluorescence bronchoscopy was developed. Early studies of 
fluorescence bronchoscopy entailed the use of fluorescent drugs 
(hematoporphyrin dyes) that were preferentially retained in ma- 
lignant tissue (96). Although., studies evaluating this nppioach 
did. in fact, show thai eailv invasive and in siiu cancels could be 
localized., the detection of dysplasia irmnined problematic (97- 
100). Furthermore, the development of photodynamic diagnos- 
tic systems was hampeied by problems including skin photo- 
sensitizing and interference with tissue autofluorescence. To 
overcome these problems, a new laser photodynamic diagnostic 
system was developed (101). This system detected tumor- 
specific drug fluorescence at 630 nm wavelength, which is far 
from normal tissue autofluorescence (500-580 nm). and inter- 
ference bv autofiuorcsccnce horn normal tissue should then 
have been eliminated; but it remained a significant problem 
(102). 



Another approach was developed by Palcic et a I. (103), 
who noticed the lack of autofluorescence in the tumor lesions by 
using blue light (442 nm) rather than white light to illuminate 
the bronchial surface. They amplified the difference in autofluo- 
rescence between normal, premalignant, and tumor tissue for 
clinical use (103, 104). Using a high- quality- charge coupled 
device and special algorithm, the L1FJ6 was developed, taking 
advantage of the principle that dysplastic and malignant tissues 
reduce autofluorcscent signals compared with normal tissue 
(Fig- 3). 

Several studies have been performed comparing the diag- 
nostic specificity and sensitivity of LIFE bronchoscopy versus 
WLD in diagnosing preinvasive and early- invasive lesions 
(105-109; Table 3). Most of the studies reported a higher 
diagnostic sensitivity of LIFE bronchoscopy in the detection 
premalignant and eatly-malignani lesions at the cost of lower 
specificity (i.e., more false-positive results). In most of these 
studies, lesions with moderate dysplasia or worse were the target 
of the study nnd rated as "positive.'' The prevalence of prein- 
vasive and early lung cancer varies widely from one study to 
another, from 20.2% (105) to 65.8% (102). The explanation 
might be beyond the risk profile of genetic variations oi differ- 
ent levels of experience among the endoscopists as well as the 
pathologists involved. Furthermore, theie seems to be a naming 
effect in using the LIFE bronchoscope, which has been demon- 
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Fig. 3 A, normal WLB ami normal LIFE bronchoscopy. /?,,WLB shows -inflammatory changes in ibe bronchial mucosa but no suspicion of 
■malignancy (lefi). LIFE bronchoscopy shows Jfffusc reduced auiofluoiescencc (visualized by diffuse reJ-brownish c»lorizQUon\ arrows'). Dropsy 
demonstrated diffuse severe dysplasia. 



strated by Venmans et ai ( J 07). In (heir study, the diagnostic 
sensitivity increased from 67 to 80% when comparing the first 
and the second half of the study. The use of the LIFE device in 
conjunction with WLB improved ihe detection i3ie of preneo- 
plastic lesions and CIS significantly t Table 3). Kuiie er ul. (106) 
looked for more subtle tissue transformation, but theii study 
included few patients with moderate dysplasia or .worse. No 
improvement in the evaluation of metaplasia imles was ob- 
served by the use of LIFE bronchoscopy. Thus, differences in 
the study population might explain the differ cut conclusion. 
There are still no clinical studies with sufficient long-term data 
showing that moderate dysplasia is the most relevant clinical 
piedicior of eventual malignancy. Limitations in making con- 
clusions from the existing studies rue also the potential meth- 
odological bias related to the order in which ihe different bion- 
choscopy procedures are dene and whether the same examiner 
has performed botli procedures. To address these issues, a 



prospective randomized study between LIFE bronchoscopy and 
WLD was done at the University of Colorado Cancer Center. 
The study design included a randomization wjih regard lo the 
order of procedure as well as the otder of the individual bron- 
choscopisl (109). Ihe order of ihe procedure and of the indi- 
vidual bionehoscopisi did not affect the results. The study also 
demonstrated a significantly higher sensitivity in delecting ple- 
ural ignant lesions visualized by the LIFE., but at the cost of a 
lower specificity (J0°). The reason for the low diagnostic spec- 
ificity found with the LIFE bionchoscopy in the different studies 
might be attributable lo the visualization of more abnormal foci 
wjth the LIFE bronchoscope, with the consequence that a larger 
number of biopsies were taken and. thus, there w as a higher risk 
of more false- positive results. The use of LIFE bionchoscopy 
has led to the identification of a new morphological entity, the 
ASD. which is described above. In a recent morphological study 
angic*dyspbsiie changes were frequently found in preneoplastic 
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Fig. 4 Seventy-one-year-old man with a spicular nodule in upper lefl 
lobe demonstrated on low-dose helical CT (picture), but not visible on 
chest X -radiography. CT- guided biopsy showed adenocarcinoma. 



and early-malignant lesions in the bronchi (26). The morpho- 
logical entity has been confirmed in preneoplasias among smok- 
ers, and the perspectives of this finding have been extensively 
discussed (110). The prognostic significance of this morpholog- 
ical entity is currently studied in ongoing long-term follow-up 
studies. Future studies have lo evaluate the role of ASD as a 
biomarker for early lesions and whether it can be used as a 
marker for treatment effect or therapeutic target for chernopre- 
vention. 

The LIFE bronchoscope may play an important role in the 
screening and follow-up of subjects at high risk of developing 
lung cancer. At this stage, however, it is unknown whether the 
LIFE bronchoscope will lead to a reduction in lung cancer 
mortality. There are also no data on cost-effectiveness and 
.cost-benefit analyses available for this new diagnostic proce- 
dure. The use of the LIFE bronchoscope may also in the future 
be extended to other indications, e g. f patients staged as having 
resectable lung cancer.on one side. Whether LIFE bronchoscopy 
of the contralateral lung will disclose abnormalities, which 
would change the therapeutic decision, is not yd reported. . 

'Decent Advances in Kailiology 

The previous NCI-sponsored screening trials failed to dem- 
onstrate any reduction' in the lung cancer mortality by sputum 
cytology and yearly chest radiography as mass screening tools 
for lung cancer screening. Limitations of design and execution 
of the studies, however, have been discussed extensively (8. 
III. 112). An extended follow-up (median, 20.5 years) of the 
Mayo Lung Project was recently published (113). There was 
still no difference in lung cancer mortality between the inter- 
vention arm and the conltol arm (4.4 versus 3.9 deaths per 1000 
person- years). However, the median survival for patients with 
resected early-stage disease wos 16.0 years in the intervention 
arm versus 5.0 years in the usual- care nnn [P < 0.05). The lattei 
findings have raised the question as to whether some small 
lesions with limited clinical relevance may have been identified 
in the intervention arm, and the question of "overdiagnosis" was 
discussed in accompanying editorials ( I I't'l. 



Mass screening for lung cancer has been performed in 
Japan for many years and has been performed in over 500,000 
. people in about 80% of the local communities (JJ5). Sobue et 
at (H6) observed that annual clinic-based chest X-ray screen- 
ing for lung cancer in Japan showed reduced hmg cancer mor- 
tality by about one-fourth among individuals who. underwent 
screening once a year. In this screening program, the relative 
odds ratio of dying from lung cancer within 12 months was 
0.535 and in the 12-24-month period was 0.638 (1 17). How- 
ever, many studies have focused on the pitfalls in the detection 
of abnormalities by radiography (118-122). The limit of chest 
radiographic sensitivity for nodule detection is roughly I cm in 
diameter, by which time the rumor has over 10 9 cells and may 
already have violated bronchial epithelium and vascular epithe- 
lium. CT has been shown. to be more effective in the detection 
of peripheral hmg lesions compared with plain radiography or 
conventional tomography of the whole lung (123, 124). 

Spiral CT scan is a relatively new technology with the 
ability to continuously acquire data resulting in a shorter scan- 
ning time, a lower radiation exposure, and improved diagnostic 
accuracy compared with those of plain radiography (125-127). 
Spiral CT allows the whole chest to be imaged in one or two 
breath-holds, reducing motion artifacts and eliminating respira- 
tory misregistration or missing nodules. Although there is 
greater radiation exposure with CT than with chest radiography, 
, low- dose techniques (lower mA of 30-50 compared with 200 
for conventional CT) have achieved calculated exposure doses 
that are J 7% that of conventional CT and 10 times that of chest 
radiographs. Further reduction in radiation dose while maintain- 
ing diagnostic accuracy is a topic of current research. Further- 
more, for the baseline screening, low- dose spiral-CT-scan i.v. 
contrast is not administered. Nodules as small as 1-5 mm can be 
shown with modern spiral CT technology (25, 128). The obvi- 
ous advantages with this new technology led some groups in 
Japan and in the United States to look to low-dose spiral CT as 
a tool for screening (Refs. 129-131; Tables 4 and 5). 

In a Japanese report, spiral CT scans and chest radiographs 
were done twice a year in 1369 individuals (J 29). Peripheral 
' lung cancer was detected in 15 (0.3%) of 3457 examinations, 
and, among the 15 lung cancer cases detected, the results of 
chest X-ray were negative in 11(73%), and the tumors were 
detected only by low-dose spiral CT. The detection rates of 
low-dose spiral CT and chest X-ray were 0.43% (15 of 3457 
examinations) and 0.12% (4 of 3457 examinations), respec- 
tively. Furthermore. 14 (93%) of the J 5 Jung cancers were stage 
1 disease. The histology showed that II of the 15 lung cancer 
cases were adenocarcinoma, and 4 had squamous cell carci- 
noma. The effective exposure dose with spiral Cf scan in that 
study was calculated to about one-sixth that of conventional CT. 

The ELCAP in New York was designed lo determine: (n) 
the frequency with which nodules were detected; (/>) the fie- 
quency with which detected nodules repicsent malignant dis- 
ease: and (c) the frequency with which malignant nodules are 
curable (131). In the LLCAP study. 27 lung cancers were found 
among 1000 subjects screened. Among the 27 patients with 
cancer. 85% had stage I disease (Table 5). 

Another population- based study on low -dose CT screening 
has been published by Sone et al (130). using a mobile low- 
dose spiral CT scanner. The detection rale was 0.48% {i.e., 4-> 
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Author 



Table 3 Bronchoscopy versus WLB in diagnosing premaligriant and early-malignant lesions 

Sensitivity Specificity Predictive values 



Relative Relative PPV NPV 

No of L1FE+ , sensitivity L1FE+ specificity L1FE+ L1FE+ PPV NPV PPV NPV 

biopsies WLB LIFE WLB L1FE+WLB WLB LIFE WLB LIFE+WLB WLB WLB LIFE LIFE WLB WLB 



Lam ei al. (105) 
Kurie et al b (106) 
Venmans et al (107) 
Vermuten et al. (108) 
Kennedy et al (109) 



700 
234 
139 
172 
394 



0.67 
NR 
NR 
0.93 
0.79 



NR 0.25 

0.38 NR 

0.89 078 

NR 0.25 

0.72 0.18 



6.3(2.7)* 

NR 
L43 
3.75 
4.4 



0.66 
NR 
NR 
0.21 
03 



NR 0.90 

0.56 NR 

0.61 0.88 

NR 0.87 

0.43 0.78 



NR 
NR 
NR 
NR 

0.38 



0.33 
NR 
0.20 
0.13 
0.21 



0.89 
NR 
NR 
0.96 



NR 
0J6 
0.14 
NR 



0.85 0.25 



NR 0.39 0.83 

0.81 NR NR 

0.99 0.32 0.98 

NR 0.19 0.90 

0.87 0.17 0.80 



• PPV, positive predictive value; NPV, negative predictive value; NR, not reported. 

* Based on reference pathologist. 

r If invasive carcinoma is included. 





No. of individuals 
srudied 


True 
positive n 


False 
positive" % 


Predictive 
value % 


Detection rate % 




Age incl. 
V 


Authors 


LDCT 


. X-ray 


Pack-yr 


Kanelcoef al (129) 
Sonc etal (130) 
Henscbke */ at. (131) 


1369 - 

3967 

1000 


15 
19 
27 


15.6 
5.0 
20.1 


6.6 
8.8 
11.6 


0.43 
0.46-0.5 
.2.7 


0.12 
0.70 


>20 
>30* 
>10 r 


>50 
40-74 
>60 



" Defined as individuals with "test-positive," in whom further workup gave no suspicion of malignancy. 
h The study also included a group of nonsmokers. 
* Average - 45 (not reported in the other studies). 



Table 5 Histology, stage, and size of primary lung cancer delected by low-dose spiral CT 





No. of cancers/ 
No. screened 


Histology % 




TNM % 






Size (mm) 




Author 


Adeno" Squam. Other 


1 


11 111 


IV 


Average 


Range <10 11-20 


>2I 


Kaneko etal (129) 
Sone et al (130) 
Henschke etui (131) 


15/1369 (1.1%) 
19/5483 (0.3%) 
27/1000(2.7%) 


73 ' 17 
' 63 5 32 
67 3 30 


93 
84 
85 


7 

4 II 


16 


12 
' 17 


8-18 

6-47 4 14 
15 8 


■3 
4 



" Adcno, adenocarcinoma; Squanv. squamous cell carcinoma; TNM, rumor-node-meiastasis. 



cases per 1000 examinations). Surprisingly, there was no dif- 
ference in ihe detection rale among smokers (0.52%) versus 
nonsmokcrs (0.46%). The results from the thiee population- 
based studies are summarized in Tables 4 and 5. The conclusion 
from these studies is that 85% of the lung cancel s detected by 
low-dose CT were in stage ) f of feting improved possibility for 
curative treatment and better prognosis in general. However, the 
issue of "false-positive" scans has to be taken into consideration. 
Thus far. up to 20% of the participants with nodules on the scan 
had no malignancy during the follow-up period. The possibility 
thai the cancers found represent incidental cancers as in the 
Mayo Lung Project must also be considered ( 1 14). The results 
from these studies confirm the expectation that low-dose CT 
increases the detection of small nonealcified nodules and. that 
lung cancer at an railjer and moie curable stage are detected. 
The mobile CT scieening study by Sone et r.l (130) showed thai 
low-dose CT increased the likelihood of detection of malignant 
disease 10 times as compared with radiogiaphy. The overall rate 
of malignant disease was lower in ihe Japanese studies (129. 
130) compared with the ELCAP study i Kef. 131: detection tales 
0.43-0.48% versus 2.7%). This could be because the Japanese 
studies screened individuals from the genera! population ages 



40-74, whereas ELCAP screened people at high risk, ages ^6(X 
with a tobacco history of at least 10 pack-years. Thus, as 
expected, the risk of the population to be screened affects the 
rate of cancer detection. 

Questions remaining to be answered, include: (a) what aie 
the diagnostic sensitivity and specificity of this piocedure: and 
(6) does screening reduce lung cancer mortality? The spiral CT 
has not been as sensitive for small central cancers as it is for 
small peripheral cancers (129, 131). Minute nodules of lung 
cancer that are near the threshold of delegability may be over- 
looked at spiral CT scieening (132). A prospective study of the 
diagnostic sensitivity of spiral CT has recently shown that the 
diagnostic sensitivity exceeded the sensitivity of conventional 
CT in previous reports (25). However, there were limitations in 
the detection of intrapulmonary nodules smaller than 6 mm and 
of pleural lesions. Compared with surgery (thoracotomy with 
palpation of deflated lung, resection, and histology), the sensi- 
tivity of spiral CT was 60% for intiapuhnonary nodules of <6 
mm and 95% lor nodules of >6 mm and was 100% for neo- 
plastic lesions 5:6 mm. furthcimore. a matked difference in the 
sensitivities of two independent observers was found foi nod- 
ules smaller than 6 mm, whereas agreement was much better for 



6^10-mm nodules (25). Given these promising preliminary clin- 
ical results, further research is needed to determine the optimal 
technique for spiral CT screening, which includes collimation, 
reconstruction interval, pitch, and viewing methods. Decreasing 
the slice thickness to 3 mm, monitoring the viewing of exami- 
nations, and computer-aided diagnosis have been used to im- 
prove the diagnostic capability of spiral CT in the detection of 
pulmonary nodules (135-136). 

Future large scale randomized studies have to confirm 
whether in. fact spiral CT screening will lead to a reduction in 
rung cancer mortality. In a randomized study, the following 
questions arise: (a) what is the optimal high-risk group to study 
and what should be the control arm? (b) what should be the end 
points (goals) of the studies? The ultimate goal is to reduce the 
lung cancer mortality. However, although this is a long-term 
goal, intermediate end points from such studies should be eval- 
uated. The change to more curable stages at diagnosis for the 
Jung cancer patients is one such immediate goal; (c) what is the 
optimal workup and the morbidity of this program? (d) what is 
the cost of such a screening program? and (e) what is the 
false-positive rate of the screening findings? Incorporation of 
smoking cessation programs should be included in the future 
design of screening studies because it has been shown that 
screening with low-dose CT in participants who are stiJ) smok- 
ing provides substantial motivation for smoking cessation (137). 

The studies with spiral CT-scan have demonstrated the 
superior diagnostic ability in the detection of small peripherally 
located tumors, most of the malignant ones of adenocarcinoma 
type of histology, the diagnostic sensitivity of spiral CT for 
more centrally located tumors (mostly squamous cell carci- 
noma) is significantly tower than for the peripherally located 
ones. Through these spiral CT studies, we will learn about the 
biology, pathology, and clinical course of these small tumors, 
which might be different from what we know about clinically 
more evident rumors defected routinely in previous smdies. 

Because lung cancer is so common, the introduction of any 
new screening technique in this area has to be underpinned by 
careful definition of the cost implications and must be justified 
by compelling evidence. The cost- effectiveness of the spiral 
CT approach should be assessed by evaluating the rate of 
over-diagnosing nonmalignant, relatively common abnormali- 
ties and comparing CT imagine to other diagnostic technologies. 

PET with FDG has recently emerged as a practical and 
useful imaging modality in the preoperative staging of patients 
with lung cancer. However, whereas CT is most frequently used 
to provide additional anatomical and morphological information 
about lesions, the FDG PUT imaging provides physiological and 
metabolic information that characterizes lesions mat are inde- 
terminate by CT. 1 : DG PUT imaging takes advantage of the 
increased accumulation of FDG in transformed cells and is 
sensitive (-95%) for the detection of cancer in patients who 
have indeterminate lesions on CT (138). The specificity (~ 8.*>%) 
of PET imaging is slightly less than its sensitivity because some 
.inflammatory processes avidly accumulate FDG. The high neg- 
ative predictive value of PHT suggests that lesions considered 
negative on the snidy are benign, biopsy is .not needed, and 
radiographic follow-up is recommended. Several studies have 
documented the increased ;;ccuracy of PET compared with CT 
in (he evaluation ol rbe hilar and mediastinal lymph node status 
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in patients with lung cancer (138). However, the PET resolution 
is sufficient only for nodules S;6 cm and will not be helpful in 
detecting the very small nodules. Compared with low-dose 
spiral CT, the FDG PET scan is more expensive and time 
consuming. The role of PET scan in early diagnosis of lung 
cancer in an asymptomatic high-risk population is not yet eval- 
uated. However, future studies have to include PET evaluation 
to define its role in a population screening setting. 

Conclusion 

Recent advances in molecular biology and pathology have 
led to a better understanding and documentation of morpholog- 
ical changes in the bronchial epithelium before development of 
clinical evident lung carcinomas. Combined with technical de- 
velopments in radiological and bronchoscope techniques, these 
procedures offer great promise in diagnosing lung cancer, far in 
advance of clinical presentation. Any of these individual proce- 
dures could be incorporated into the routine management of 
individuals at risk for developing primary or secondary lung 
cancer, and for several of these methods, clinical studies are 
under way. Preliminary reported data are very promising for the 
early detection of lung cancer. Future studies must incorporate 
the different methods in a multidisciplinary scientific setting to 
evaluate the role of the individual method in the overall man- 
agement for individuals at high risk for developing Jung cancer. 
Several of these tests might diagnose the disease at the stage of 
clonal expansion before invasive carcinoma has developed. A 
management and intervention strategy appropriate to that stage 
of disease have to be developed. Preliminary studies of cherno- 
prcvention agents are reported, and new agents based on other 
biological mechanisms are under development and ready for 
clinical trials. It is now time to plan clinical trials that evaluate 
both diagnostic and therapeutic approaches to access their im- 
pact on the incidence of clinical lung cancer. 
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Localization of tissue inhibitor of metalloproteinascs 1 (TIMP-1) iu human 
colorectal adenoma and adenocarcinoma. 
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Tissue inhibitor of matrix metalloproteases I (TIMP-1) inhibits the proteolytic activity of 
matnx metalloproteases and hereby prevents cancer invasion. However T4-T^ f 
possesses other functions such as inhibition of apoptosis, induction o ma l n 

rj ° d fr ° m ^'orecta! cancer patients and that hid! TIMP-1 levek 

pred,ct poor prognose To clarify the role of TIMP-1 i„ colorectal mmfri.enesis the 

hlrSl^ r C n breCtal adenocare ~ TIMP-1 mRNA was detected by in s ru ' 
hybnd 1Za t IO a In all cases TIMP-1 expression was found in fibroblast-Iike cdb loSed at 

was seen m any of the cases in benign or malignant epithelial cells i n vascular celk nr 
smooth ^uscle cells. Comparison of sect.ons processed for TIMP-1 ,n s^S z^on 
w,m sections .mmunohistochemically stained with antibodies against TIMP-1 showed 
g^d coirelation between TIMP-1 mRNA and immunoreactivity Combining TIMP^ I in 

mRNA u J^, ™ A m "^fibroblasts but not in macrophages TIMP-l 
SSStTw£22 V ° f ■ .t— 8 - adenomaari in b^cases 
condusS, tLp T ,nfl ammation ^ the epithelial-stromal interface. In 

^ STT ,S 3 rare Went " benig " hUman 00,011 tissue but » Wghly 
expressed by myofibroblasts in association with invading colon cancer cells. 
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Localization of Tissue Inhibitor of Metaijoproteinases 1 (TIMP-1) in Human 
Colorectal Adenoma and Adenocarcinoma 
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2™'*^ "» e W | "l«tew» 1 CHMP-l) inhibits 

toe proteolytic activity of Matrix raetalloproteases and hereby 
prevents cancer invasion. However, TIMP-1 also posse^esoIhS 
Amctoms ^.rh a. inhihiti on^optosis^dnXn^^S 
transformation ami ci; m ..u»;^.r ^ ™ nau 8 naBt 



re^ c^c?^^ 1 J 1 ^" 1 * i» blood fr^, P X 

•rectal cancer patients and that high TIMP-1 .levels oredict iZr 

easjhe expression patten, of TIMP-1 in benign and malSt 
colorectal tumors was studied. In all of 24 rasesTf 
^en^cardnoma TTMP-, mRNA was deLled^ 

cetlo^.^ 

cells located at the invasive front but was seen only SDOradieallv in 
normal mucosa. No TIMP-1 mRNA was seen in aiVrf Se ^ £ 
benign or malignant epithelial cells, in vascular cX" smooth 
n°v^H C ^ ^"P^ 0 - of sections processed for^-lT^ 
hybridization with sections immunohistocbemically stained "with 

T^^A?'.™"* 1 Sh ° Wed correla^btCn 
h3»u- ?i .mummoreactivity. Combining TIMP-l /„ situ 

hybndizafaon with immunohistochemical staS for a-smVSh 

buTiotln^r ° I™ Sba ^ TIMP - 1 i» myofibrSs 

but not m macrophages. TIMP-1 mRNA was detected in 2 of7 



ftTSSSS 1 5 ^ bybritotiM: ~^em- 

' hAS rtre<JUi f Site fW CaDCer Ce " invasioD ""astasis is the 
breakdown of tissue barriers mediated by proteolytic enles 
such as the matrix metalloproteinases (MMP).u 'Under ^omS 

tea^temtr^ ^ *"* de S rad W activities <TZ 
MAM* are kept at bay by the presence of (he naturally occuirin. 

a 28 10>a glycoprotein demonstrated to be present in most bodilv 
nssues and fluids, binds and inhibits MMPsm a i l stoToS 

h^Z' ereXprCSSi0n 0fTIMP - 1 in «rio« cancer S 
has shown a suppress.ve ,<fc in Ae malignant profession 
ever, as opposed to this ami-invasive role of fST'seve^I 
recent studies have demonstrated quite different fusions of tis 
MMP-mhibitor including stimulation of ceil growth roaWant 
transformation and inhibition of apop.osis, su^esC^ pos^Mc 

nf., '. . . 35 been ^""'ated that TIMP- 1 may actually 
play a dual role in cancer progression and metastasis™: * 

M^p e m^NA d in f f haVe , dem ~ ed wmor tissue levek of 
™no„ Pr °' em are s 'Snificantly increased in various 

mahonant diseases and that such .MMP elevations are correfeied 
w,d . cancer eel! invasion, me.astas.s and shon pa£„ sU^P^ 

TIMP-1 mRNA and prote.n .n several cancer types."-" Mor ° 
over, we and others have demonstrated that measurement of 'fa 
creased plasma levels of TW-1 by immunoass^ T^L as a 
suong marke, for shon survival and recurrence of d7setse in 
pafems wim colonial cance,— Sim i larIy , a strong co reTauo" 



between high protein levels and poor prognosis is known for the 
rype-I plasminogen activator inhibitor (PAI-1)>^5 Consider™* 
the pro tease inhibiting fu nction of these inhibitors, these findin J 
F^umuoyersial; bowevef, alternative functions ha™ been 
SwS. 1 ^ **. ™ P ' 1 as ^^ed above M well as for 

In order to better understand the role of TI\ff-l in colorectal 
cancer, biochemical analyses may provide some indicabWA 
number of studies of the localization of TIMP-1 in colorectal 
cancer have been published; however, the results of these S» 

TIMP-1 mRNA was expressed both in invasive adenccarcinoma. 
J™""* ^ ad ^nta and that the expression w£T 
served m bom die stromal as well as the epimehaf^mr^nent ^ 

T J!r ~! mR ^ A 7 as M P ressed 01 >)y in the stromal compart- 
ment of colorectal adenocarcinomas in spindle-shaped cells sur- 
rounding die mvasive cancer cells. The results of LnSoni 
cbemtcal studies of TIMP-1 in colon are also conflicting: S 
and colleagues" reported that . TIMP-1 was expressed in me oTn 
nective tissue and basement membrane in bod, normal mucosa 
adenomas and adenocarcinomas with only KtUe staining of the' 
neoplastic 1 epiftehum. On the other hand, Tomita and colleagues™ 
reported that TIMP-1. was expressed in both stromal and "Sal 
cells in colonic po yps and adenomas, as well as in adenocarcino- 
mas, in wh,ch the neoplastic cells were strongly immunoreactive 
In order toresolve these inconsistencies, we undertook our study 

s^,^,r a /-?rt b p y ^ dl?ati0n aDd ^"obistochemistrv demon- 
strand that TIMP-] is expressed in myofibroblasts in the stroma at 
the invasive front of colorectal adenocarcinomas. Because TIMP-1 
was vmually absent from normal colorectal epithelium. weTal- 
dfftl^ , P ° SS , 1>lh,y , 0f Using ^T- 1 as ' Vaostic tool to 

^ C0,0reC,al adeDOmaS from I***' sla I e A colorectal 
adenocarcinomas. . 



Materia] and methods " 
Tissue samples 

r,;,^' S u e J° a,eri , a i. iDC,uded was ob ^ ined hom University Hos- 
p.tal of Hv,dovre (Copenhagen, Denmark) in accordance with a 

OlTS l' Ve " by thC l0CaJ Scienlific ethica l committee (KF 
if n' n" neen f cKvaI Sara P ,es tf«wnaBB fixed and paraf/ii, 
Sfa^ H CO,,ecIedfrom 198 9 » 1993 included Dukes" stage A 
co orectal adenocarcinomas (« = g) and colorectal adenomatous 
P°l)ps In — 6, j were pedunculated (1 with mild and 2 with 
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moderate dysplasia) and 3 were sessile (1 with moderate and 2 
with focally severe dysplasia)]. Samples from 16 colorectal ade- 
nocarcinomas (1 Dukes* stage A, 6 Dukes* stage B, 8 Dukes', stage 
C and I Dukes* stage D), 1 villous adenoma and 1 malignant colon 
lymphoma were prospectively collected during 1999-2000. These 
prospectively collected tissue specimens were dissected so that 
samples contained both normal mucosa, and tumor tissue and were 
obtained within 30 min following surgical bowel resection. The 
specimens were immediately fixed in 4% neutral buffered formalin 
for 20-24 hr and then paraffin embedded. The 14 archival samples 
had also been formalin fixed and paraffin embedded. * 

Generation of honoverlapping TIMP-1 cP/M fragments by PCR 
The full length TIMP-1 cDNA (GenBank NM _003254) cloned 
in pSP64 vector 29 was used as template to generate 2 nonoverlap- 
ping PGR fragments for in vitro transcription, and named f 104 (bp 
56-378) and f 106 (bp 398-680). First, the whole insert (-780 bp) 
was cut out by digestion with Hind 111 ana Hamttl and punned 
after agarose gel electrophoresis using the Qiaex II gel extraction 
kit (Qiagen, Crawley, pnited Kingdom). To generate nonoverlap- 
ping antisense probes and the corresponding sense probe, 2 PCR 
fragments were generated using, upstream" primers flanked by a 
linker sequence containing an EcoRl restriction enzyme site (un- 
derlined nucleotides) and a T3 polymerase binding sequence 
(boldface) 5 '- (gaeaatt cattaaccctcactaaagggaga)-3' , and down- 
stream primers flanked by a linker sequence containing a BamVR 
restriction enzyme site and a T7 polymerase binding sequence 
5 Vgptcc taatacgactcactatagggag)-3' 9. The TIMP- 1 specific up- 
stream primers were 5'- acccaccatggccccctttg -3' for fl04 and 5'- 
(linker) - gcaggatggactetlgcaca -3' for ft 06, and the downstream 
primers were 5'- linker- actcctcgctgcggttgtgg -3' for f!04 and 5' - 
(linker) - tatctgggaccgcagggact -3.'. for fl06. PCR using the 2 fl 04 
primers of the 2 f 106 primers was done as previously described. 30 
The PCR products were purified by column chromatography 
using S-200HR microspin columns (Amersbam Pharmacia Bio- 
tech, Inc., Piscatway, NJ), and their size tested by agarose gel 
electrophoresis. Both migrated as —300 bp fragments in accor- 
dance with the predicted size (322 and 282 bp, respectively). An 
ABI. PRISM 310 genetic analyzer was employed for DNA se- 
quencing analysis and was performed according to the manufac- 
turer's instructions (Perkin Elmer, Applied Biosystems, Foster. 
City, CA) using the primers specified above. The DNA sequences 
obtained were confirmed by comparison with the specific TIMP-1 
cDNA nucleic acid sequence (GenBank NM_003254). 

Plasmids containing human MMP-2 cDNA (pCol720 1 , bp 647- 
1284) and human MMP-9 cDNA ( P Col9202, bp 1751-2326) have 
been described elsewhere. 31 

In vitro transcription 

Antisense and sense riboprobes were labeled with 55 S VTP 
(NEN, Boston, MA) by in vitro transcription using T7 and T3 
RNA polymerases (Roche, Base], Switzerland). The DNA tem- 
plate was digested with DNase (Promega, Madison, WT). Nonin- 
corporated 3:> S UTP and DNA was removed by column cinema- 
tography using S-200HR microspin columns (Amersham 
Pharmacia Biotech, Inc., Piscatway, NJ). The 35 S activity was 
adjusted for every probe by dilution to 500,000 cpm/pJ. 

In situ hybridization 

In situ hybridization was performed essentially as described 
previously. 32 hi brief, 3 u.m paraffin sections were- deparaffinized 
in xylene, hydrated with graded ethanol and boiled in a microwave 
oven for 10-12 min in 10 mM citrate buffer, pH 5.0, After 
additional 20 min at room temperature, the sections were dehy- 
drated will) graded ethanol and die j5 S labeled probes (2X 10 6 cpm 
in 20 hybridization mixture 3 "' per slide) incubated overnight at 
55°C in a humidified chamber. Sections were washed in Ilellen- 
dahl chambers wiih SSC buffeis containing 0.1% SDS and 10 mM 
DTr at 150 rpm at 55°C using a Buhler incubation shaker (Jo- 
hanna Otto GmbH, Hechingen, Germany) for 10 min hi 2XSSC, 



10 min in 0.5 X SSC, and 10 min in 0.2XSSC. Sections were then 
RNase A treated for 10 min to remove nonspecifically bound 
riboprobe. Subsequent wash' was performed in 0.2XSSC as spec- 
ified above. Sections were dehydrated and soaked" into an autora- • 
diographic emulsion (Of ord), exposed for 5r-7 days if not otherwise 
stated and finally developed. Sections were counterstained with 
haematoxylin and eosin. 

Jmmunoperoxidase staining 

ImmuriohistCK;hemistry was performed essentially- as described 
previously.^ Five micrometer paraffin sections were deparaf- 
finized with xylene and hydrated through ethanol/water dilutions. 
Tissue pretreatment was performed with protease-K (5 p-g/ml) 
digestion for 20 miri. Sections were blocked for endogenous per- 
oxidase activity by treatment with 19b hydrogen peroxide for 
15 min. The sections were washed in 50 mM Tris 150 mM NaQ, 
pH 7.6, containing 05% Triton 'X-lOO (TBS-T). Incubation with 

-a^r^edSes^was-done oyernight at4°C. Sheep polyclonal antibodies 
(pAb) against TTNflM and nonimmune goal IgG were used at a 
final concentration of 4.0 p-g/ml. Two monoclonal antibodies 
(MAb) against TIMP-1 , 33 NM4 {clone rnX6A, NeoMarkers* Fre- 
mont, CA) and CalB2 (clone 147-6D11, CalBiohem, Oncogene 
Res. Products, Cambridge, MA), and a JylAb against trinitro- 
phenyl (TNP) 3 * were all incubated at 1.0 p.g/ml (all 3 MAbs are 
lgGl). CalB2 MAb recognizes both free TIMP-1 and TTMP-1 
in complex with MMPs. 33 NM4 MAb only recognizes free 
TIMP-1. 33 According to the manufacturer's descriptions, both 
MAbs are raised using recombinant human TTMP-h The- sheep 

. polyclonal antibodies were raised by immunization with TIMP-1 
purified from human dermal fibroblasts. The IgG was obtained by 
triple precipitation nsing ammonium-sulfate and characterized by 
immuncKhffusion and rocket inmiun electrophoresis. 35 In adchV 
tion, we have shown that the pAb recognize both free and MMP- 
complexed TTMP-1. 36 Furthermore, the specificity of the antibod- 
ies was analyzed by Western blotting analysis against recombinant 
human TIMP-1 expressed in NSO mouse myeloma cells. Here, the 
antibody preparation recognizes a band of approximately 28 kDa 
in accordance with the molecular weight of TTMP-1. To certify 
that the pAb recognize TTMP-1 in colon rumors, the antibodies 
were immobilized on a sepharose column. Total protein extracted 
from 3 colon adenocarcinomas was passed through the column 5 
times and the bound and subsequently eluted protein analyzed in a 
Western blot using a TTMP-1 monoclonal antibody (MAC 15). A 
single band of approximately 28 kDa was revealed in accordance 
with the molecular weight of TTMP-1 (results not shown). In 
immunohistochemistry, the sheep pAb were detected with biotin- 
ylated rabbit-anti-goat IgG, which cross-react with sheep IgG 
(1:100, code E466, DakoCytomation} followed by horseradish 
peroxidase in complex with streptavidin (code K317, DakoCyto- 
mation). The MAbs were detected with the Envision-mouse re- 
agent (EnVision reagent, K4003, DakoCytoraation), followed by 
tyramine amplification, using biolinyl tyramine substrate as-spec- 
ified by the manufacturer (Nen, Boston, MA). Sections were - 
developed with NoyaRed substrate as specified by the manufac- 
turer (Vector Laboratories, Burlingname, CA) for 15 min. Finally, 
sections were counterstained in Mayers haematoxylin, dehydrated 
in ethanol and mounted. 

Combined in situ hybridization and immunohistochemistry 

Double labeling by combining in situ hybridization and immu- 
nohistochemistry on paraffin sections has been described previ- 
ously 32 In brief, using MAb against a-sm-actin (clone 1A4) 
diluted 1:1000. against cytokeratin (clone AE1/AE3) diluted 
1:1000, or against CD68 (clone PGM1) diluted l:200 r sections 
were incubated for 2 hours at room temperature and then detected 
with ant i-mouse-lgG/horse radish peroxidase- conjugated polymers 
(Envision-mouse reagent, DakoCytomation, Glostrup. Denmark). 
Sections were developed with diaminobenzidine (DAB) for 
7-10 min, and immediately dehydrated for in situ hybridization, 
which was performed as described above using the antisense 
probes of H04. Sections were counterstained with haematoxylin. 
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Results 

Analysis of TIMP-1 probes and antibodies for in situ 
hybridization and immunoMstochemistry 

Histopathologic*] diagnosis of prospectively collected speci- 
mens from 18 colorectal lesions revealed 16 colorectal adenocar- 
cinomas, 1 villous adenoma and 1 malignant lymphoma. ^S- 
labeled anusense and sense RNA probes were generated by in vitro 
™^ Ptl ™? m 2 n0nover,a PPin& DNA sequences of the human. 
TIMP- 1 cDNA and tested by in situ hybridization on adjacent 
sectors from 5 of the colorectal adenocarcinomas. The 2 antisense 
probes showed an identical hybridization pattern in all the 5 cases, 
Jocated in the stroma] compartment surrounding the invading 
cancer cells, while no specific signal was seen with the 2 sense 

S K^k 7 ^ WhCther ™ P - 1 :mRNA was accom - 
parned by TIMP-1 protem expression, imrnunohisiochemistry was 

performed on 8 of the adenocarcinomas (including the 5 men-' 



mnm pojycJpnal antibodjeToli i£5on? adjacent to 

JIMP-1 in situ hybridized sections. The TIMP-1 mRNA and 
immunoreacn^ity was observed in the same cells in aO of the 9 

*ntf™p i °T ^\ inchldin S lymphoma. The 

anti-TO4P-] polyclonal antibodies did not react with other ceil 
populations m all of 8 adenocarcinomas and the malignant lym- 
phoma apart from some normal and malignant epithelial cells that 
WCre 7™L ° D * e luminaJ a P ical * urfa <*- Two MAbs 

canon but showed a staining partem similar to that of the poly- 
clonal antibody preparation (Fig. 2B\ with the only exceptional 
neither: of the 2 MAbs stained the luminal apical surface of the 
normal and malignant epifoelium: No signal was obtained with 

XTS UnC »T ^ 01 3 mb < of same s " bc ^s as Calb2 and . 
NM4) directed against the synthetic hapten trinitrophenyl (TNP). 

TIMP- J mRNA expression patterns in colon cancer 

ina E S° n t °, f ™ P -\ mRNA was men in the remain- 

in S 9. colorectal lesions by in situ hybridization. TEMP-] mRNA 




cJl 1G fi URE "ll 7 " i'^ h >' bridi * at >°n with 2 nonoverlappiuo TIMP-] 
specific probes in-human coJon cancer. Tnree adjacent Actions Z a 

labeled a.uisense probes for TIMP-1 mRNA [fJ06 (^) moT* 
and la correspondmg TIMP-] sense pxobe, f 06 ( ) T^e n Z hv 

^ ™W ^5 k W ItC f 8lton in darkfield i'^ination (M>. 
twhriV™ P ?^ eS Sh0w lhe same hybridization partem and the 

,h?™D , P n? T 6c SIgnaI 15 seen with ,he ssi >se probe (c) Note that 

tivipT S . t e ,HVasJVe " DCCr «'k (C3) that are devoid of 
TIMP-1 m s„u hybndiaiion signal, ^d): Bar ■="] 00 ^ 



expression was in all the cases of colon adenocarcinoma (includina 
those mentioned above) highly expressed in stromal fibroblast-IiVp 
cells located at the invasive front (Fig. 3a,<[). TIMP-1 mRNA 
signal was also observed in fibroblast-like cells located in nC 
tumor stroma towards the colonic lumen in 8 of 10 cases whem 
tins tissue s*ucture was present (data not shown). No or little 

rf't \f^l A d£teCted w centI ^P^ of the carcinomas. 

colorectal adenocarcinomas, we observed TIMP 1 
mRNA Signal in some fibroblast-like cells located around the 
muscle layer, of some arteries located in the submucosa distant 
from the cancer area. The normal colonic mucosa, includino the 
lamina propria that- was present in all samples tested, was "ene^alhv 
negative (Fig. 3t,e)..Only a relatively weak TIMP-1 mRNA simal 
was detected in stromal fibroblast-like cells surrounding one or a 
very few normal crypts (Fig. 3c, j) in 3 out of 6 cases tested with 
extended exposure time (10 days vs. usually 5 days). In the villous 
J^ n ^ t ^l*™™ ] y a few ™P-* mRNA positive cells asso- 
rted wxtfa-fbr^^^ation (data not shown). In the malignant 
ryrnphoma of the colon, TDvIP-1 mRNA expressing fibroblast-like 
cells were, different from the adenocarcinomas, located in a dif- 
fuse pattern throughout the whole tumor. No TIMP-l mRNA 
^ . Was ™ ™y of the 18 cases in the cancer cells, 

smooth muscle cells or vascular cells. - . 

Characterization of TIMP-1 mRNA expressing cells 

cZ°A Y^^P** 1 mRNA fibroblast-like cells 

could be (myo)fibroblasts and/or macrophages, sections from 4 
colorectal adenocarcinomas and the malignant lymphoma were 
hrspmmunohistochemically stained with antibodies directed 

^T^^^^Sr^^ - f ^ ofib ^t/smooth mnscle 
cells (SMC)] or CD68 (for detection of macrophages) and subse- 
quently incubated with a TIMP-1 mRNA aitisense pr<Z Z 
normal colon ussue, a-sm-actin is expressed by vascular smooth 
muscle cells, smooth muscle cells of lamina muscularis mucosae 
and tunica muscularis as well as pericrypta] myofibroblasts " In 
colon rumors, a-sm-actin is expressed by tumor- associated fibro- 
blast-hke cells located throughout the tumor stroma, which are 
defined as myofibroblasts. No TIMP-1 mRNA was detected in any 
o>sm-acun positive smooth muscle cells, including those of the 
vessels, the lamina muscularis mucosae and the tunica muscularis 
In addmon, no TIMP-1 mRNA was detected in the a-sm-actin 
positive pencryptal myofibroblasts of the lamina propria in any of . 
the 5 lesions. TIMP-j mRNA signal was in contrast seen in 
a-sm-actm-positi ve tumor associated myofibroblasts located at the 
invasive front of the colon cancers. In 3 of the adenocarcinomas, 
more than 80% of TTMP-I mRNA-positive cells located close to 
J £i CanCCr Cells werc a-sm-actin-positive (Fig. 4). 
TIMP-1 mRNA positive fibroblast-like cells located more distant 
from the invasive cancer cells, towards the submucosa, expressed 
little or no a-sm-actin. In 1 adenocarcinoma and in the malignant 
lymphoma approximately 50% of the TIMP1 mRNA positive cells - 
expressed a-sm-actin. Thus, the TIMP-1 mRNA expressing cells 
constirute a subpopularion of tumor-associated myofibroblasts lo- 
cated at the invasive front of the tumor. No TIMP-1 mRNA signal 
could be identified in any of the CD68-positive ceJIs (Fig. 4). 

Expression of TIMP-J and MMP-2 and 9 in colon cancer 

MMP-2 and MMP-9 are 2 type IV co]]ao enases expressed in the 

M^p V o e » C T er ' iSSUe ° S colorectal adenocarcinomas. Thus 
MMP-2 has been reported (o be expressed by fibroblast-like cells 
m the cancer stroma.""* and MMP-9 by macrophages at the 
leading edge of the invasive cancer » To directlv compare the 
expression patterns of MMP-2 and MMP-9 with that of TIMP-1 
adjacent sections from 5 colorectal adenocarcinomas werc hybrid- 
ized with probes for TIMP- J, MMP-2 and. MMP-9 mRNAs We 
found tliat the expression of TIMP- 1 mRNA was localized char- 
acteristically at the invasive front of the o JO win ? tumor, whereas 
lhe expression of MMP-2 mRNA was most intense in the central 
a.eas : S /io wing decreased expression towards die invasive from 
(hg. da). MMP-9 mRNA expressing cells were " found nt the 
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i hybridization and hnimmohistocbemistry for TIMP-1 in human colon cancer. (/i) Two adjacent sections from a human colon 
ere incubated with polyclonal antibodies against TIMP-1 (a,c) and a TIMP-1 mRNA antisense probe (b). The HMP-1 



Figure 2 -Jn situ I 

adenocarcinoma were incubated with polyclonal . .._ _ 

immunoreactivity (red-brown color, arrows in a and c) and lheTBVJP-1 mRNA (silver grains, anows in b) are identified in the same cells (arrows 
in a.b). Immunoperoxidase staining with ihe TIMP-1 pAb reveals the T1MP-1 -positive cells as fibroblast- like cells (arrows in c) located in the 
stroma (St)- No TJMP-I immunoreactivity is seen in cancer cells (Ca). a. b: bars = 50 urn; c: bars =13 \tm. {B) Four consecutive adjacent 
sections were incubated with CalB2 MAb anti-TlMP-] (a), NM4 MAb {b\ sheep ami TIMP-1 pAb (c) or mouse ami TNP (d). The 3 MAbs 
were detected with Envision Te agent followed by TS amplification and the sheep pAb with bio tiny la ted rabbit anti-goal followed by 
HRP-conjugated stfeptavidin (see Material and methods). The 3 TIMP-1 antibodies react with the same cells (arrows). No immunoieactivity is 
seeD when the sections are incubated wiih ami- TNP. 



invasive front like those expressing TIMP-1 mRNA but with a 
distinctly different distribution. Foci with high expression of 
TIMP-1 mRNA were not accompanied with increased expression 
of MMP-9 mRNA and vice-versa (Fig. 5b). Thus, TIMP-1 mRNA 
expression is not coregulated with MMP-2 or MMP-9 mRNA 
expression. 

T1MP1 in adenomas and Dukes' srage A carcinomas 

TJMP-J antigen can readily be measured in blood and we have 
previously reported that levels of T1MP- 1 in blood are significantly 



elevated in colorectal cancer patients compared to healthy donors 
and that high plasma TIMP-1 levels are associated with short 
survival of colorectal cancer patients. 2 '- 36 TIMP-1 has therefore 
been suggested to be a novel marker for detection of early stage 
colorectal cancer and for prognostic stratification- of colorectal 
cancer paiients. ?) - 39 These findings, together with the characteristic 
expression pattern of TIMP-I afthe invasive front of virtually all 
the colon cancers and the absence or minute TIMP-1 expression m 
normal arid benign colon mucosa, prompted the evaluation of 
TIMP-1 expression as a marker for early invasive colon cancer. 
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^'Sfc^^ »^™" colon cancer. Sections were firs, processed for immuno- 

£S£ u" Se,,S ! , pr0be " ^f" 1 mRN A sisnal s co^ a ] zed w S^sm -.^ SUbse< > n<in, ». b 7 » ™ hybridization tL* L f 10* toS? -1 

TTMpI Una !• CD ffP? s ' mc '"acrqphaoK (red arrows in b) Note s^nie of rh^^i ^ A P ° S '" Ve CeUs (black aiTOWS in >» «■ 
I JMP-J mRNA sional (red arrows in o). Bars = 13 ^.m. he «- sm - a "in-pos.tive myofibroblasts have little or no 



Th efore we compared 71MP-1 mRNA expression.;,, m addi- 

Z onal 8 £1 ? de "° mat0US P 0 ^ *e expression in an 
addmonal 8 Dukes s,o»e A colorectal adenocarrinomas. JIMP- 1 

Duke* s aoe A caronomas showed T)MP-| mRNA sio na l at the 

ST XT' 5 ' ,n ' P0S " iVe ade "°"' a (Pe^nciated pe) 
II m r? «P re -«>or. was confined to a single focus in 

hi i!! ?h . ' l " S,0) ?? ,cal an! ">' si s of additional sections from 
h. sample clearly revealed disruption of the dysplastic epithelium 
n hesame area^ln the other TJMP- , mRNA positive adenoma 
oc, cd V^V T1MP -' mRNA «P ressi "? fibroblasts were 
ETSST snows' d ' reC,,y - ^ 



he nl r m ?^ A ex P' css ' on in myofibroblasts located a. 

„„. !T, ! ° f !bC nmoK - Wbereas «P re ^°» ««s detected 
celk n«L , ^ K n f S> . and in ' hese W3Sseen in fibroblast-like 
>mlr(T^T ,T h focal lnfl; »»™"°n at tbe epitbeljal-s.romal 
.merface ,n 2 of the cases and with arteries in the sttbmucosa in I 



Discussion 

^Z ^' Tf m » d /> n ^ en '° Clnrify ,he ^ssion and cellular 
^ °> MM , P «*iW>or TIM PI io human colon ade- 

^ dj f We ' e '°" Dded °" ,he use ol 2 specific 
annsense RNA probes dertveel from 2 nonoverjnppin S 'JTMP-l 
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Figure 5 - In situ hybridization for TTMP-1 , MMPr2 and MMP-9 in 
human colon cancer. (A) Adjacent sections were incubated with probes 
specific for TIMP-1 mRNA {a,c) and MMP-2 mRNA (b, d), respec- 
tively, and is showu in brigbtfield (a,b) and darkfidd illumination 
{c.d). The TTMP-1 mRNA signal increases towards the submucosa 
(Sm) whereas the MMP-2 mRNA signal decreases and is most intense 
in the central areas (Ca). {B) Adjacent sections were incubated with 
probes specific for TIMP-1 mRNA (a,c) and MMP-9 mRNA (b,d), 
respectively, and is here shown in brightfieJd ia.b) and darkfield 
illumination ic,d). Both the TIMP-1 mRNA signal and the MMP-9 
mRNA signal are most intense at the invasive front towards the 
submucosa (Sm), but their expression panems are quite different, with 
MMP-9 showing die most restricted expression. Bars == 100 u.m. 

cDNA fragments and specific pAb and MAbs againsi human 
TIMP-1. The TIMP-1 mRNA signal in all colorectal adenocarci- 
nomas investigated was seen in fibroblast-like cells located in the 
tumor periphery. An identical hybridization pattern was observed 
with the 2 antisense TJMP-1 probes and application of comple- 
mentary sense probes on neighboring tissue sections as negative 
controls did not result in any hybridization signal; therefore, we 
conclude that the hybridization signal generated with the antisense 
probes represents the genuine TIMP-1 mRNA. TIMP-1 inimuno- 
rcaciivity w;is also distinctly located in fibroblast-like stromal cells 
»n_the_iumor periphery, and these cells were identified to be the 
same cells as the TJMP-1 mRNA expressing cells. A pieparation 



of sheep pAb against human TUVffVl 35 and 2 well- characterized 
MAbs stained the very same cells in the tumor stroma. Weak 
staining of the apical surface of some normal and malignant 
epithelial cells was observed with the pAb in some of the samples; - 
No staining was obtained when roe^anti-TIMP-l antibodies were 
substituted with nonimmune goat serum or anti-TNP MAb incu- 
bated at the saine concentrations. These mimunohistocheinical 
findings strongly suggest that th& TTMP-1 antigen detected in the 
fibroblast-like cells represents the genuine TTMP-1 protein. 

In our study, we found TTMP-1 mRNA expression in stromal 
fibroblast-like cells located in the tumor periphery in all colorectal 
adenocarcinomas tested, whereas no expression was detected in. 
the. cancer cells in. any -of the cases tested." This finding is in 
agreement with studies by Zeng and colleagues, 12 ^ 0 but is partly in 
disagreement with findings by Newell and ; colleagues. 27 In addi- 
tion to TTMP-1 mRNA signal in fibroblast-like cells in the tumor 
i4>eriphef y r Newe ll and colleagues 27 detected a weak TTMP-1 
mRNA signal in both benign and malignalt epithelial cells. 27 This 
observation was, however, based on the use of probes- from a 
. single TTMP-1 cDNA subclone and no additional controls- to verify 
the expression pattern. The difference between our results and 
those of Newell and colleagues may be explained by methodolog- 
ical differences, since the procedure employed, by Newell and 
colleagues was in several steps different from the one used in the 
present study* Newell and colleagues used 3 ^ : labeled probes, 
whereas we used 35 s -labeled probes. It is in this context notewor- 
thy that in order to look for a low expression level of TTMP-1 
mRNA, we performed in situ hybridization experiments with pro- 
longed exposure time (10 days vs. usually 5 days) with both our 
TTMP-1 antisense probes and both TTMP-1 sense probes, but with 
this challenge we did not detect any TTMP-1 mRNA in any 
epithelial cells. It cannot be excluded though that the TIMP-1 
mRNA is expressed in epithelial cells below the detection limit of 
our in situ hybridization procedure. 

An interesting observation in our study was the characteristic 
intense TTMP-1 mRNA and protein expression in the tumor pe- 
riphery of all colon adenocarcinomas, while little or no expression , 
was seen in the center of the carcinomas. Only in the colorectal 
lymphoma did we find TTMP-1 mRNA and protein expression in 
fibroblast-like cells located throughout the tumor tissue. The 
TTMP-1 expression pattern in the colon adenocarcinomas is in 
contrast to the expression pattern reported by Hewitt and col- 
leagues, 19 who found that the TIMP-T staining in most colorectal 
adenocarcinomas was equally intense in fibroblasts throughout the 
tumors and that some of the cases even showed decreased TTMP-1 
signal intensity towards the tumor periphery. This difference may 
be explained by possible cross-reactivity of the polyclonal anti- 
bodies employed by Hewitt and colleagues or that Hewitt and 
colleagues employed cryostat sections, while we analyzed paraffin 
sections. * . 

The Tl.MP-1 expressing cells had a fibroblast-like morphology . 
and using combined in situ hybridization for TTMP-1 mRNA and 
immunohistochemisuy for a-sm-actin, we found that many, gen- 
erally more than 50%, of the TTMP-1 mRNA positive cells coex- 
pressed a-sm-actin. According to the cellular morphology of the 
T1MP-T expressing cells and their localization in the invasive 
front, we could conclude Uiat die cells were myofibroblasts and not 
smooth muscle cells. 

, The myofibroblast is a cell type present in the normal colon 
mucosa, that originally was described as a pericryptal fibro- 
blast 3 ^ 0 and later was identified with antibodies against a-sm- 
actin. 37 In the lamina propria, the myofibroblasts form a continu- 
ous cell layer just below the intestinal epithelium. The pen crypt 31 
myofibroblasts are phenotypically different from the neighboring 
quiescent interstitial fibroblasts that do not express markers ot 
smooth muscle cells/" During early steps of colonic lumori genesis 
the number of myofibroblasts is significantly increased. 111 The 
TT MP-3 expressing myofibroblasts may be generated after activa- 
tion of the pericryptal myofibroblasts and/or the quiescent inter- 
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stmal fibroblasts. Adegboyega and colleagues 4 ' hypothesized that 
the tumor-associated myofibroblasts originate from the quiescent 
interstitial fibroblasts of the lamina propria, rather iban from peri- 
cryptal myofibroblast or smooth muscle cells, which may help to 
explain why we found some of the TIMP-1 mRNA expressing 
fibroblast-hke cells a-sm-actin-positive and some a-sm-actin-neo! 
aove. . ^ 

Several MMPs including MMP-2, MMP-I1 and MMP- 14 are 
expressed by fibroblast-like cells in human colon cancer 2731,42-44 
some of which may indeed be myofibroblasts. The role of the (myo-) 
fibroblasts in colon cancer progression is not known. Since TIMP- 1 in 
human colon cancer appears only to be expressed by fibroblast -like 
cells most of which are myofibroblasts, and since high TIMP-1 levels 
measured in blood or rumor extracts from colon cancer patients are 
strongly associated with a poor prognosis? 0 - 21 ; t COuld ^ d 
that the TIMP-1 expressing myofibroblasts play a tumor-promotin° 
role Immunohistochemical localization studies of proteins involved 
in the aenvauon and regulation of the efficient serine protease plas- 
minogen, including urokinase plasminogen activator (uPA) and its 
specific inhibitor PAH show that both are mainly expressed by 
myofibroblasts in human breast cancer.-*-* Hieh levels of uPA and 
PAM are strongly correlated with poor prognosis in breast can- 
cer, supporting the assumption that the myofibroblast express a 
promoting role in cancer invasion. We recently reported thai the 
predominant PAM expressing cell in human colorectal cancer ulso is 

/rT^ 0 , Sl '' y 3nd Mrlier Studies indicaled lhaI ^vated levels 

01 lf \ C ° l0n C3ncer JMlients are associated with poor pro°no- 

sis.- Together these findings indicate thai myofibroblasts are sn on^lv 
conlnbutmg l0 , he expression of proteins involved in the reeuIationVf 
extracellular matrix degrading proteases that facilitate cancer invasion 
and metastasis. 

A particularly interesting finding of the present study was the 
absence of TIMP-]. mRNA in 4 of 7 adenomas, whereas in all of 
J Dukes stage A carcinomas the TIMP-1 mRNA was expressed in 
fibroblast- like celts along the invasive from, in the 2 benign lesions 
in which the TIMP-1 „,RNA was seen in the adenomata the" 



TIMP-1 mRNA positive cells were confined to a single focus with 
locally increased inflammation related to the dysplastic epithelium. 
Evident disruption of the dysplastic epithelium was observed in the 
adenoma with most intense TJMP-1 mRNA signal. Intestinal in- 
flammation may be caused by disruption of the mucous epithelium 
that leads to focal leakage of mucinous colon material into the 
lamina propria. Increased intestinal permeability is a common 
deficiency in Crohn* s disease and interestingly TIMP-1 mRNA 
was found in the intestinal granulation tissue of Crohn's disease 50 
and is expressed by myofibroblasts isolated from Crohn's dis- 
ease Induction of TIMP-1 in myofibroblasts in a benisn or 
preinvasive tumor may also be a response to locally increased 
MMP activity or a response to the presence of a specific MMP in 
the local microenvironment. MMP-2 and MMP-9 mRNA expres- 
sion, however, did not appear to be cpregulated with TIMP-1 
mRNA expression in the colorectal adenocarcinomas. Specific 
MMPs .may indeed be involved in the transition of noninvasive to 
invasive disease; in smdies of preinvasive lesions (ductal carcino- 
mas in situ) of the human breast we recently reported that MMP- 13 
is specifically expressed in myofibroblasts associated with micro- 
invasive events.- 2 Future studies may clarify whether 71MP-1 
expression in colorectal adenomas is correlated with expression of 
specific MMPs, cytokines and/or growth factors, such as TGF-pl 
and TGF-p25l, and whether TIMP- Lean be used as a hislopalho- 
logical marker for malignancy in colorectal tumors. 
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SKSESET activator """ ,ccd by dcug " e virus tafection ° f h °™» 

Dengue hemorrhagic fever and dengue shock syndrome (DHF/DSS) are severe 

celt 2i a hf B - * Pnmaiy iS ° ,ated end °™ <*"s, human umbilical cord veins 
cells, and a human microvascular endothelial cell line. DV infection sienificandv 

endothehal cells was induced by DV as demonstrated bv RT PPR a IS a t 5 

Furthermore, a good correlation between sera levels of IL-6 and tPA was found in DHF 
but not DF pafents. These results suggest that IL-6 can regulate dSSa 
production of endothelial cells, which may play important inThe ltho g nic 
development of DHF/DSS. Copyright 2003 Wiley-Liss, Inc. P am °8 enic 
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Neu oncogene expression in ovarian rumors: a quantitative study. 
fi«MtnerJ^;Ca™e^ 

Department of Pathology, Tufts University School of Medicme,Massachusetts. 

Spnfe 'eUsT^ ^r' 0 ? 7 Sl0t b,0t and P^einproduct expression by 

SESv^ 'n^unohistochemistry in 57 primary and metastatic ovarii Y 
neoplasms, two paraovarian leiomyosarcomas, and eight normal ovaries Some61% of 

*?v^t 5 - 0 /0 0{thG ovanan t" 001 * that overexpressed neu were of 
epntehal type. Epithelial ovarian tumors had significantly higher amounts^e neu 
oncogene product as determined by capture ELISA than eitheTgernT^^ Z ZZ 

m tSS ^ I ^ oncogene product as measured by capture 

T^* TZu ST? had Ae higheSt > P 00 ^ differentiated carcinomTnot 
etherise specfied had the lowest (p less than 0.025). ELISA values mRNA 

oppression, and immunohistochemical staining intensity did not correlate with staee ; 

tMSA is a simple, effective way to measure the neu oncogene protein product and that 
there , S ^a good correlation between ELISA. levels and immunohLochemical stamina . 
intensity. However, ELISA values did not correlate with stage or histoid proZfic 
factors m ovanan neoplasms. e piognosnc 

PMID: 1353878 [PubMed - indexed for MEDLINE] 
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Real-time quantitative RT-PCR of cyclin Dl mRNA in mantle cell 
lymphoma: comparison with FISH and immunohistochemistiy. 

guiP, HowcJG , Crouch J, Nimmakavalu M .. Qumsiveh MIS TalliniG Flynn SD 
Smith BR . ' — . ' 

Department of Laboratory Medicine, Yale University School of Medicine 333 Cedar 
Street, P.O. Box 208035, New Haven, CT 06520-8035, US A. ' 

Presence of the balanced translocation t(l I;14)(ql3 ;q32) and the consequent 
oppression of cyclin Dl found in mantle cell lymphoma (MCL) has been shown to be 
of important diagnostic value. Although many molecular and immunohistochemical 
approaches have been applied to analyze cyclin Dl status, correlative studies to compare 
different.met.hods for the diagnosis of MCL are lacking. In this study, we examined 39 
archived paraffin specimens from patients diagnosed with a variety of 
lymphoproliferative diseases including nine cases meeting morphologic and 
immunophenotypic criteria for MCL by (1) real-time quantitative RT-PCR to evaluate 
eye m Dl mRNA expression; (2) dual fluorescence in situ hybridization (FISH) to 
evaluate the t(l 1 ; 14) translocation in interphase nuclei; and (3) tissue array 
immunohistochemislry to evaluate the cyclin Dl protein level. Among the nine cases of 
possible MCL, seven cases showed overexpression of cyclin Dl mRNA (cyclin Dl 

SS5 V ri^w ° aSeS Sh0W6d n ° Cydin D 1 a ^ (A increase (°y c,i n negative 

MCL-hke ) In six of seven cyclin Dl positive cases; the t(l 1;14) translocation was 
demonstrated by FISH analysis; in one Case FISH was unsuccessful. Six of the seven 
cyclin Dl mRNA overexpressing cases showed increased cyclin Dl protein on tissue 
array rnimunohistochemistry; one was technically suboptimal. Among the two cyclin Dl 
negative MCL-hke cases, FISH confirmed the absence of the t(l 1 ; 14) translocation in 
both cases. All other lymphoproliferative diseases studied were found to have low or no 
cyclin Dl mRNA expression and were easily distinguishable from the cyclin Dl 
overexpressing MCLs by all three techniques. In addition, to confirming the need to 
assess cydra Dl status, as well as, morphology and immunophenotyping to establish the 
diagnosis of MCL, this study demonstrates good correlation and comparability between 
measure of cychn Dl mRNA, the 1 1;14 translocation and cyclin Dl protein. 
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activity n human tumors: demonstration of tumor-type specificity and 
implications for cancer chemotherapy. ""ciryand 

HusainJ, MohjgrJL, SeiglerHF, Besterman Jlvf 
: S:SS WOey ' *" «— * Triage P*. 

Topobome^e I has been identified as an inlraeellular la^et of cmpteineoin a nlane 

nevels are elevated mojer solid tumors, (b) the elevated enzyme is caiyTa y ™ ive 
n hese tumors, and (c) the increase in topoisomerase I levels rcolorectal tamo^r 
rault of increased transcription or translatioa Topoisomerase I levels were ™3Si in 

»unS!rt h w" 1 r ,0r ^ al - Pr ° State ' and their mSrraf " 

SSTEfl y bbttihg 311(1 by direCt detem unation of catalytic activity^ 

rumors snowed 5-3 5-fold increases in topoisomerase I levels, compared to their normal 
colon mucosa In the case of prostate tumors, the increase w^s 2-10-fold comTaSTwl 
bemgn hyperplastic prostate tissue from the same patients. However no dXen^e Was 

32p£X?" ^ f™* 0 ^ to P°' somerase I was detexmined by a quantitative 

32P-transfer assay m crude homogenates, without isolating nuclei. Colorectal and 
prostate himors exhibited 1 1-40- and 4-26-fold increases, fespect vely SI 

LI' ?■ uT*' hdn T tUm ° B did n0t sh0W "V alteration in cata&ica^vity 
compared o their normal matched samples. Thus, for all three tumor tvpeX e was a 

good correlation between enzyme levels and catalytic activity. Fina" y^recSlImors 

7Z d ZS£ State ,6Ve,S - A 2 " 33 - f ^ d ^ in nXTlevek wL 

found m colorectal tumors, compared to normal colon mucosa. These results surest that 
alterations in topoisomerase I expression in humans are tumor type spec fie andS the 
n rase in t0p0IS0merase , M ^ from ^ ** the 

topoisomerase I gene or increased mRNA stability, P 
PMID: 8275492 [PubMed - indexed for MEDLINE] 



♦ f 

mmunohistochemistiy and quantitative RT^!' the °*eome of patients. METHODS: 
in 43 archival specimens of^^^^^^^-f^ expression of pKi-67 
were not treated with neo-adjuvant ttcn^^^^ST^ Carcinoma ' wh ° 

JabeJmg mdex of 31.3% (range 10 3-66 4%\" W . We determined a median pKi-67 (MIB-1 ) 
range ,0.01-0.69); indices\J|^^ (DeltaC(T): ' 

assorted with a significantly &v«^^Sf^gSf? mRNA DeltaC(T) values were 
correlated to prognostic outcome. A bultivaSteink? *f ^ ^ ,mg mdi ° es were not 
indicated that tumor stage (UrCC) ari ^Tl nZf C1IUCaI and bioI <>gieal factors 
prognostic factors. CONCLUSION ^ S ,Si^V XP reSS J° n ,evel were ^dependent 
newprognosticindicatorforprima^^^ 
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Modulation of glucagon receptor expression and response in transfected 
human embryonic kidney cells. 

Ikegami T, Cypess AM . Bouscarel D 
• SSTc^^^ UniVC ^ ^ ^ Washington, 

The modulation of glucagon receptor (GR) expression and biological response was 
the^Rw^ 

the GR wtth dtfferent denotes. The GR mRNA expression level in these clones was 
upregu ated by cellular cAMP accumulation and printed a good Jn^nZZoto 
tto protem express™ level and the maximum number of glucagon binding^ 
However the determination of glueagon-induced cAMP accumulation ifthe e cell lines 
revealed that the^enhancement of receptor expression did not lead to a propina 
increase m CAMP formation. Under these conditions, the maximum cAMP production 
el U< ^ y "* f0r5k0Hn W3S n0t si ^antly Afferent among seated clones 
greatest suscept.b.Iity for agomst-mduced desensitization compared with clones with 
^r^ 

recruit non-GR-spec.fic desensU.2at.0n mechanism(s). Furthermore, the partial inhibition 
or alterahon of the overall cAMP synthesis pathway at the receptor evel may be7 

PMID: 11546678 [PubMed - indexed for MEDLINE] 
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Developmental regulation of acidic fibroblast growth factor (aFGF) 
expression in bovine retina. 

I^SMSmE, Jsm^onmJL, B^raK, LaurentM, CourtoisY, JeannvJC . 
Unite de Recherches Gerontologiques, U. 118 INSERM, Paris, France. 

vaSefv of SKST* faCt ° r F> iS ,f Si6naUing ra ° ,eCub im P licatcd in a ^de 
bZn l^ l & u P ^ SUCh 38 0611 differentiation and survival. It has 

been punfied from bovine return: The present study was carried out to detect which cells 
m the bovine retina expressed aFGF at the different stages of embryonic and 
development. The specific aFGF mRNA and protein were detected by inS ' 
hybndiza .on employing riboprobes and immunocytochemistry using affinity purified 
polyclonal human recombinant aFGF antibodies respectively^ signal Tw* ^detected bv 

™i ^T-, * h0g fr iS °5 the retina - By 8 ' 9 months of embryonic development 
nude, of the 3 neuronal layers (ganglion cell layer, inner and outer nuclear layeL were 
^uniformly and intensely labeled. A slight labeling of the pigmented epitheHum ofZ 

^cSS° T b H e dlr ° Ugh0Ut devel °P ment maturatiTThese rLts shbwed a 
good correlation between message and protein expression in these cell types. In contrast 
ghal cells m the nerve fiber layer and vascular endothelial cells displayed a nuclear 
irnmunos airung for the protein in the absence of message. These data suggest that aFGF 
mech^sms" **" differentiation b * autoc ™ and parTcrine 

PMID: 7507349 [PubMed - indexed for MEDLINE] 
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The p2I(Cipl) protein, a cyclin inhibitor, regulates the levels and the 
intracellular localization of CDC25A in mice regenerating livers. 

JaimeM, Pujol MJ , SerratosaJ, Pantoja C. Canela N. Casanovas O. Serrano M 
Agcll N. BachsO . ' 

Department of Cell Biology and Pathology, Faculty of Medicine, Instihit dlnvestigacions 
Bwmediques August Pi Sunyer (IDIBAPS), University of Barcelona, Barcelona, Spain. 

Liver cells from p21(Cipl-/-) mice subjected to partial hepatectomy (PH) progress into 
DNA synthesis faster than those from wild-type mice. These cells also show a premature 
induction of cyclin E/cyclin-dependent kinase (CDK) 2 activity. We studied the 
mechanisms whereby cells lacking p21(Cipl) showed a premature induction of this 
activity. Whereas the levels of CDK2, cyclin E, and P 27(Kipl) were similar in both wild- 
type and p2l(Cipl-/-) mice, those of the activator CDC25A were much higher in 
p21(Cipl-/-) quiescent and regenerating livers than in wild-type animals. Moreover, 
p21(Cipl-/-) cells also showed a premature translocation of CDC25A from cytoplasm 
into the. nucleus. The ectopic expression of p21(Cipl) into mice embryo fibroblasts from 
p21(Gipl-/-) mice decreased the levels of GDC25A and delayed its nuclear translocation 
The levels of CDC25A messenger RNA in p2l(Cipi-/-) cells were higher than in wild- 
type cells, suggesting that this, increase might be responsible, at least in part, for the high 
levels of CDC25A protein in these cells. Thus, the results reported here indicate that 
p21(Cipl) regulates the levels and the intracellular localization of CDC25A^ We also 
found a good correlation between CDC25A nuclear translocation and cyclin E/CDK2 
activation. In conclusion, premature translocation of CDC25A to the nucleus might be 
involved in the advanced induction of cyclin E/CDK2 activity and DNA replication in 
cells from animals lacking p21(Cipl). 
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Alteration of frizzled expression in renal cell carcinoma. 
JansscnsN, AadriesI^Jajn^^ 

Department of Biochemistry, University of Antwerp, Wilrijk, Belgium 
njansse9@prdbe.jnj.com B 

To evaluate the involvement of fiizzled receptors (Fzds) in oncogenesis, we investigated 
mRNA expression levels of several human Fzds in more than 30 different bSXr 

n ° mal tiSSUe S3m P ,es > -ngreTTe ° f 
TtfTU? W ^ ** *° leveI of Fzd5 was increased in 

8 of 1 1 renal carcmoma samples whilst Fzd8 mRNA was increased in 7 of 1 1 renal 
woman, samples. Western blot analysis of crude membrane fractions reva£ that 

Se^ 

2 Wnt/beta-catenm signaling pathway activation was confirmed by 
ft e increased express,on of a set of target genes. Using a kidney tumor tissue array Fzd5 
protem express.on was mvestigated in a broader panel of kidney tumor samplJTzdS 
membrane stammg was detected in 30% of clear cell carcinomas, and there w2 a stone 
correlate W1 th nuclear cyclin D I staining in the samples. Our data sug^ste7ma< Xd 

ZSrilr" T berS °f *f fami,y " aad downstrermTgltuld 
prov de alternative mechanisms leading to activation of the Wnt signaling pathway in 
renal carcinogenes.s. Fzd family members may have a role as a biomarker 

PMID: 15557753 [PubMed - indexed for MEDLINE] 
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Abstract 

To evaluate the involvement of frizzled receptors (Fzds) 
in oncogenesis, we investigated mRNA expression lev- 
. els of several human Fzds in more than 30 differ- 
ent human tumor samples and their corresponding 
Imatched) normal tissue samples,. using real-time quan- 
titative PCR. We observed that the mRNA level of Fzd5 
was markedly increased in 8 of 11 renal carcinoma sam- 
ples whilst Fzd8 mRNA was increased in 7 of 1 1 renal 
carcinoma samples. Western blot analysis of crude 
membrane fractions revealed that Fzd5 protein expres- 
sion in the matched tumor/normal kidney samples corre- 
lated with the observed mRNA level. Wnl/p-catenin sig- 
naling pathway activation was confirmed by the in- 
creased expression of a set of target genes. Using a kid- 
ney tumor tissue array, Fzd5 protein expression was 
investigated in a broader panel of kidney tumor samples. 
F2d5 membrane staining was detected in 30% of clear 
cell carcinomas, and there was a strong correlation with 
nuclear cyclin D1 staining in the samples. Our data sug- 
gested that altered expression of certain members of the 



Fzd family, and their downstream targets, could provide 
alternative mechanisms leading to activation of the Wnt 
signaling pathway in renal carcinogenesis. Fzd family 
members may have a role as a biomarker. 

'■ Copyright O 2004 S- Kir get AG, 8«d 



introduction 

The Wnt signaling pathway is evolutionary conserved 
and controls many events during embryonic develop- 
ment. Members of the Wnt gene family of secreted glyco- 
proteins are involved in embryonic induction, generation 
of cell polarity, cell proliferation and the determination of 
cell fate [1,2]. Recently, it has become evident that the 
Wnt pathway is also deregulated in a range of tumors [3]. 

The Wnt signaling pathway is activated when Wnt pro- 
teins bind to a ceJl surface receptor complex consisting of 
a member of the frizzled receptor (Fzd) family and either 
low-density-lipoprotein receptor-related protein (LRP)5 
or LRP6 [4, 5]. A detailed characterization of the Fzds 
and the immediate downstream events after Wnt binding 
has been hampered by the lack of pure biologically active 
Writs. 

Downstream of the receptor complex, three pathways 
may be initiated, depending on the composition of the 
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ligand and receptor complex The 'Wm/p-catenin path- 
way*, the 4 Wnt/Ca 2+ pathway' or the Wnt 'polarity path- 
way' [6). The Writ/P-catenin pathway has been iinked to 
carcinogenesis. Genetic alterations in components of this 
pathway {adenomatous polyposis coli, APC, axtn and P- 
catenin)can result in the accumulation of non-phosphory- 
lated p-catenin [3, 7] and this can promote carcinogenesis. 
Conversely, neither the Wnt/Ca 2+ pathway nor the Wnt 
polarity pathway involves the activation of P-catenin [for 
review, see ref. 1,6). 

Mutations in one of the three regulatory genes (APC, 
p-catenin and axin), overexpression of Wnts and Fzds or 
the expression of a constitutively active Fzd have been 
. linked to Wnl/P-catenin pathway activation in various 
tumors [8, 9], 

To evaluate the involvement of Fzds in oncogenesis, 
we investigated mRNA expression levels of several hu- 
man F2ds <Fzd2, 3, 5, 6, 7, 8 and 9) in more than 30 differ 
ent human tumor samples using real-time quantitative 
PCR. Each sample was compared with its corresponding 
(matched) normal tissue sample. The most striking obser- 
vation was the dramatically increased Fzd5 and Fzd8 
mRNA expression seen in the renal carcinoma samples. 
This was confirmed at the protein level using Western 
blotting. Kidney tumor tissue arrays confirmed Fzd5 
membrane staining in 30% of clear cell carcinomas, with 
nuclear cyclin Dl showing a strong correlation with the 
Fzd5 membrane labeling. Fzd8 protein expression analy- 
sis was not performed due to the lack of suitable reagents. 
These data suggest that Fzd5 may have a role in renal cell 
carcinogenesis due to its frequent overexpression obr 
served inlhese tumor samples. Potential future applica- 
tions could include uses in tumor targeting or as a poten- 
tial biomarker. 



Materials and Methods 

Tissite Samp/es 

Frozen tumor tissue samples with corresponding normal tissue 
from the same patient were derived either from human biopsy or 
autopsy material (Department of Pathology, University of Antwerp, 
kindly provided by Prof. E. Van Marck). Tissue specimens were 
snap-frozen in liquid nitrogen and kept at -80*C uniil use. Frozen 
sections of kidney tumor and normal tissue samples were stained 
with hcmaioxylin-eosin to suppon the pathologist's observations and 
to confirm the type of kidney tumor. Paraffin-embedded tissue slides 
or renal carcinoma, king carcinoma, breast and colon carcinoma 
were obtained, after encryption, from (he Department of Pathology 
(Middelheim Hospital. Antwerp. Belgium). The CLI human kidney 
cancer (Su perBioChips Laboratories) tissue array used in .this study 
contained 59 tissue samples consisting of 9 normal kidney tissues. 



30 clear cell renal carcinoma samples and another 20 renal cell tumor 
types tchromophiK chromophobe, papillary type, collecting duct car- 
cinoma and samples with mixed types). 

&V/J Isolation and Reverse Transcription 
Total RNA was extracted from tissue specimens using Ultraspec 
Reagent (Biolecx, USA) according to the manufacturers instruc- 
tions. All total RNA was routinely treated with DNase (DNA-free kit, 
Ambion, USA). 1 ug of total RNA was used to synthesize cDNA 
using o)Jg6-dT primers (Superscript; Invitrogen, Merelbeke, Bel- 
gium). Reverse transcription was performed at 42 'C for 60 min, fol- 
lowed by 70* C for 10 min. 

Real-Time PCR . 

Real-time PCR was performed on either an ABJ Prism 7700 or 
7900 Sequence detection system (perkin-Elrner Applied Biosystems, 
Foster City, Calif., USA) using ihc 5' nuclease assay (Taqman™). 
Primer and probe sequences were designed using Primer Express (PE 
Applied Biosystems) and are shown in table I. Quantitative values 
were obtained from the threshold cycle number (Ct) at which the 
increase in the signal associated with exponential growth of PCR 
products is delected using PE Biosystems analysis software, accord- 
ing to the manufacturer's instructions. . v 

We have used the 2" A *° method to analyze the relative changes 
in gene expression of the diiTerent genes between tumor and corre- 
sponding normal tissue samples. We used the mitochondrial ATP 
synthase 6 (ATPsy6) as the endogenous RNA con Ixol 1 1 0; Janssens et 
ah, in prep.}, and each sample was normalized to its ATPsy6 content. 
The relative expression of the target gene was also normalized to the 
corresponding normal tissue sample (calibrator). Results, expressed 
as the amourit of target sample relative to the ATPsy6 gene and the 
calibrator, were determined as" follows, N - 2-' 6 ° Mm P k - aomfcnioo 
where the hC\ values of the sample and calibrator were determined 
by subtracting the average Ct value of the sample and the calibrator 
from the average Ct value of ihc ATPsy6 gene. Amplification was 
done essentially as described previously [1 0J. Briefly, 50 pi of reac- 
tion mixture containing I ul of cE>NA template were amplified as 
follows: incubation at 50*C for 2 min, denaturalion at 95°C for 
1 0 min, and 50 cycles at 95 • C for 1 5 s and 60*C for I min. 

Membrane Preparation. Gel Electrophoresis and Immvnobhtttng 
Tissue samples were weighed, suspended at a 40 times dilution 
i= AO volumes/original wet weight of tissue (v/w)J in 50 mM Tns* 
HCI buffer, pH 7.4, and homogenized with an Ultra-Turrax homoge- 
nizes After cent rifugaiion for lO.min,24,0O0^at 4*C, the pellet was 
washed three times by resuspension in the Tris-HCl buffer followed 
by centrifugal ion. The final membrane pellets were stored at - 80* C 
in the Tris-HCl buffer at a concentration of 0.5-1 rog/mL The Brad- 
ford protein assay {Pierce, Aalst, Belgium) was used for protein deter- 
mination. Proteins (50 ug) were separated by 89b SDS-PAGE and 
transferred to nitrocellulose membranes. After primary and second- 
ary antibody incubation, the aniigen-aritibody-peroxidase complex 
was detected by cherriiJuminescence (Pierce, Aalst, Belgium) accord- 
ing to the manufacturer's instructions. 

Immunohistochemistry 

I mm unohist oc he mistry was performed on tO-u^VAihick cryoscc- 
tions of unfixed tumor (issue and on 6-n;V/-thick paraffin "sections 
from renal tumor tissue fixed by formalin or by an alcohol-based fixa- 
tive. Adjacent tissue blocks from renal tumors were processed with 
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Table 1. Real-time PCR primer ond probe sequences 



Target cDNA Primer/probe sequences' 


Fragment 
position 11 


Accession 
No.' 


FZD2 


V**/ ***^^*-i»*|JlL.V|54Jv-J 

(b) 5'-giamgatcatglagaccgtgaagic-3' 

(c) 5'-FAM-tacacgccgcgcatgtcgc-TAMRA-3' 


1,545-1,613 


AB017364 


FZD3 


fa) 5*-tcac°ccattecatpp«*-V • 

(b) 5'-itgtcaccitcaattttaltcatcg-3' 

(c) 5'-FAM-catccccgga3ciciaaccaicatcciitt-TAMRA-3' 


1,473- J 547 


. AB039723 


F2D5 


(a) S'-teccaaeeiractierMit- V 

(b) 5'-tctccaagicgecgcg-3' 

(c) 5^FAM-ccttc3lggt£ctg»gecccc-TAMRA-3' 


2,143-2,204 


HSU43318 


FZD6 


(a) 5'-ctagcacccccaggliaagagaa-3' 

(b) 5'-cccagagagtclggagatggat-3' 

fc) 5'-FAM-igiggtgaacclgcctcgccag-TAMRA-3' 


2,094-2,J70 


AF072873 - 


rJLUt 


(a) 5'-cctg!ggaaaggcataactgtg-3' 

(b) 5'-aaccaacgggaaacctcaga-3' 

(c) S'-FAM-aagcaacm tataggraaagcagcgcaa-TAMRA-3' 


2,687-2,762 


ABO J 7365 


FZD8 


(a) 5Mgiggtcgg»gcicigctl*3' 

(b) 5'-cgciccaigicgaiaaggaag"3' 

(c) 5'-FAM-ccaccttcgccaccgtc£cca^TAMRA-3' 


853-919 


" AB043703 


TZD9 


(a) 5'-ccccgggagciacggac-3' : 

(b) 5'-lagica!gtgcaagaccacgg-3' 

(c) 5'-FAM-tggcacgcactgccactataaggct-TAMRA-3'. - . 


1,696-1,763 


HSU82I69 


ATPsy6 


fill V.DOtot'kMOtftlnApMhlHnt *1* 

iaj j -ggigiaggigrgcciigiggi-j 

(b) 5'-gggcgcagtgaUaiaggcit-3' 

(c) 5-FAM-aagigggclagggcaimiaoiciiagagcg-TAMRA-3' 


. 58f>-503 


AF36827I 


c-myc 


(a) 5'-accaccagcagcgacictga-3' 

(b) 5Mccagcagaaggigaicc3gact-3' 

(c) 3 ; -FAM-Dccimgccaggagccigcctci-TAMRA-3' 


I.297 T MI3 


HSMYC1 


Cyclin D\ 


(a) 5'-gaacctggccgcaaigac-3'- 

(b) 5'-cgcctciggcaittigga-3' 

(c) 5'-FAM-ccgcacgaiIicartgaacacit-TAMRA-3 / 


4,148-4,211. 


AF51 1593 


PPAR5 


(a) 5'-agcatcctcaccggcaaa-3' 

(b) 5'-gtctcgatgicgtgg3tcaca-3' 

(c) 5'- FA M-ccagecacaeggcgccct-TAMR A-3' 


932-990 . 


NM^006238 . 


a (a) -Sense primer, (b) = anliscnse primer, (c) = probe, ' 
*> Fragment positions are given according to the EMBL/GenBank accession No. of doned sequence. 
c EMBUCenBank accession No. of cloned sequence. 



formalin and with the alcohol-based fixative. Paraffin and cryosec- 
rions were mounted on po)y-Z.-Iysine or 3-aminopropyl!rielhoxys>- 
lane-getatin-coated slides. The 59 tissue samples on the CL1 human 
renal cancer tissue array slides were, all fixed with 'formalin, and 
embedded in paraffin, and ihe sections were mounted on siJane- 
coated slides (SuperBioChips Laboratories). In addition to renal car- 
cinoma tissue, sections from 10 formalin-fixed paraffin-embedded 
Jung carcinomas were stained for F2d5. Colon and breast tumors 



were used as positive controls for p-catentn and cyclin DI immuno- 
staining. 

The following primary antibodies were used: F2d5 (Upstate Bio- 
technology), (J-caienin (Zymed), cyclin Dl (Zymed). E-cadherin 
(Novaccstra) and cytokeratin 8 (Biogenex). Cryosections were fixed 
in 4% paraformaldehyde for 5 min; acetone lor 5 m in at -20 °C and 
70% cthanol for 5 min. Endogenous peroxidase activity was 
quenched using 3% HjOs. Paraffin sect tons of formalin- and alcohol- 
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Table 2. Fzd mRNA expression in lumor samples 



Sample 3 


Tissue 


Tumor rype 


.vfold expression increase*' 
















KZD2 


FZD3 


F2D5 


F2D6 


FZD7 


FZD8 . 


F2D9 


133702 


kidney 


* adenocarcinoma • • 


0.17 


0.5 


3J2 


*> 1 1 
z.l J 


V.VO 


1.32 




137770 . 


kidney 


renal cell carcinoma 


1.23 


3.56 


836 


2.61 


1.3 


J? 91 * 

v. AM 


J.H-* 


138844 


kidney 


renal cell carcinoma 


0.31 


0.11 


6.84 


1.18 


0.23 


J.JO 


7 ft 


137146 


kidney 


renal cell carcinoma 


0 47 


ft A$ 


J.JO 


2.23 


0.73 


4.42 


1.37 


137564 


kidney 


. renal cell carcinoma 


■ J.tJ 




Q A 

7.0 


1.57 


7.64 


3.52 . 




.133408 


kidney 


renal cell carcinoma 


7 ? 07 


i\ Off 


O.Jy 


2.48 


5.05 


16.72 


- ■ 


139188- 


kidney 


renal cell carcinoma 


1 1 
J.I 


U.56 


U.00 


1.33- 


0.37 


4.41 


K53 


135699 ■ 


kidney 


renal cell carcinoma 




U.JO 


. A 87 

*f.oJ 


0.9 


0.34 


2.54 


2.31 


139064 . 


kidney 


renal cell carcinoma 


5J6 


1.82 


1.25 


2.36 


1.8 


2.19 


2.85 


J 34585 


kidney 


renal cell carcinoma 


1.47 


0.72 


1.38 


0.47 


0.09 


o;$ 




140279 


kidney 


renal cell carcinoma 


7.33 


18.17 


3.93 


6.05 


6.41 


4.65 


7.91 


137252 


ovary 


carcinosarcoma • 


0.4 


339 


0.49 


0.44 


0.92 


0.54 


0.53 


138256 


ovary 


papillary carcinoma 


0.7 


5.17 


1.22 


2.19 


o;4 


0.M 




146472* 


ovary 


serous papillary carcinoma - 


0.39 


3.29 


2.56 


1.34 


3.94 


.0.67 


0.59 


145845 


colon 


adenocarcinoma 


3.45 


3.36 


0.58 


1,22 


1,67 


124 


1.99 


146145 


colon 


adenocarcinoma . 


5.46 


6.74 


4.42 


6.57 


3.36 . 


6.08 


0.15 


146630 


colon 


adenocarcinoma 


4.01 


4.73 


0.3 


1.16 


0.54 


0.55 


0.12 


146633 


colon 


adenocarcinoma 


1.87 


1.07 


1.62 


1.47 


0.51 


2.07 




147055 


colon 


adenocarcinoma 


0.66 


1.41 


1.01 


1.33 


0.24 


0.79 . 


4.87 



142253 Jung adenocarcinoma 0.67 5.17 0.43 ' 1.87 

143036 lung adenocarcinoma 0.67 1.24 2.63 1.13 

.138938 lung adenocarcinoma 0.93 0.88 1.07 i.3 

133563 lung adenocarcinoma 2.76 1.23 0.31 1 34 

144387 Jung adenocarcinoma 12.85 0.43 0.49 0.25 

137304 lung acinary adenocarcinoma 0.54 - 9,15 1.65 1.25 

144546 lung epithelial carcinoma 0.1 0.67 0.2) 8.1 J 

137621 lung epithelial carcinoma \32 2.19 0.37 2.26 

145532 lung epithelial carcinoma 1.09 - 0.16 0.91 



0.33 
1.11 
1.73 
0.99 
2.73 
0.63 
0.27 
0.52 
1.47 



0.59 
1.04 
0.38 
0.78 
0.54 
3.41 
0.97 
0.45 
0.13 



0.41 

9.19 
0.76 
4.97 

2.64 
1.32 
1.06 
03 



2.43 
32.82 
2.28 
0.88 



0.33 
16.7 
20.82 

028 



143987 testis embryonal carcinoma 0.24 0.6 19.43 1.33 23.12 

137332 siomach leiomyoma jpj? X91 0.45 1.53 18.98 

139026 stroma gastrointestinal carcinoma 66.1 3.51 0.03 6.67 8.44 

136049 rectum adenocarcinoma . 1.71 0.52 0.4 0.54 0.29 

140794 gall bladder adenosquamous carcinoma 0.19 - 0.93 3.02 0.2 

* Sample identification numbers were given by the pathologist. 

b Results are expressed as x-lbld increase of the gene in the tumor tissue sample compared to its matched normal tissue sample after 
normalizing both samples on the basis of their ATPsy6 content. A ojioJT of Mold was used to define differential expression. Significant 
i> 3-fold) increases in the expression level of the Fzd receptor are shown in italics. - = Expression of the target gene undetectable in one or both 
. samples (tumor and/or normal). 



fixed (issue were processed with a irypsinHritratermicrowave pre- 
treatment or with an EDTA-microwave pretreatment to unmask epi- 
topes, respectively. Sections were then sequentially processed with 
primary antibodies, biptinybted secondary antibodies and streplav- 
idin-bioiin-peroxidase (F2d5. E-cadherin and cytokeratin 8). For |J- 
catenin. polyclonal rabbit antibody with the En Vision detection sys- 
tem (DAKO) was used. The slides were further developed using 3- 
amino-9-eihylcarba20le. couniersiained with hemalaun and mount- 



ed with glycerin gelatin. Stained sections were observed with an 
Axioplan 2 microscope equipped with an Axiocam digital camera. 
Staining intensity for p-catenin was scored as no staining (value 0), 
weak and fragmentary staining of cell membranes (value 1). moder- 
ate membrane siaining of less than 50% of the tumor cells (valued), 
moderate membrane staining of more than 50% of tumor cells (value 
3) and strong membrane staining of more than 75% of tumor cells 
(value 4). The cyclin Dl staining was quantified as a percentage of 
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F>g. 1. Fzd5 protein expression in matched 
tumor/normal kidney samples. T - Tumor 
sample; N - matched normal sample. Sam- 
ple identification numbers arc given by the 
pathologist. 



133702 137770 138844 137146 137564 133403 

T N T N T.NT-N T H T N 



139188 135699 ■ 139064 134585 140279 

T T N T N T N T N 



F2d5 



- Actin , 



Fx65 



cyclin DJ-immunoreaciive nuclei in tumor cells in three fields (area: 
1 8,641 um J )ofeach tumor sample; The total number of tumor nuclei 
ranged from 51 10 164. The correlation between Fzd5 and 0-catenm 
staining, and between Fzd5 and cyclin Dl staining was evaluated by 
the Mann-Whitney U test. . 



sion level was observed to be relatively Jo w in these Jung 
tissues compared to the other tissues investigated. 



Resuhs 

FzdmRNA Expression in Matched Human 

Tumor/Normal Tissue Samples 

Fzd expression in tumor tissue was compared with Fzd 
expression in matched norma) tissue samples and normal- 
ized to the expression of the housekeeping gene mitochon- 
drial ATPsy6 (table 2). A 3-fold increase was considered 
significant. . 

In the kidney tumor samples, in which 10 of 1 1 sam- 
ples were clear cell carcinomas, Fzd5 was upregulated in 8 
of the J J samples. A similar observation was made for 
Fzd8 and Fzd2, which were upreguJated in 7 and 5 renaJ 
tumor samples, respectively. None of the other Fzds 
showed consistent upreguJation. 

Both Fzd2 and Fzd3 were upreguJated in 3 of 5 colon 
adenocarcinoma samples. No other Fzd expression was 
significantly different compared to the normal colon tis- 
sue sample. Fzd3 showed an increased expression in all 3 
ovarian carcinoma samples. Fzd expression was not al- 
tered in any of the Jung tumor samples. The Fzd expres- 



Western Blot and Jmmunohistochemistry Analysis on 
Renal Carcinomas 

Western blotting was used to evaluate Fzd5 protein 
expression in the renal tissue samples used for mRNA 
expression analysis. Membrane fractions of the renal car- 
cinoma and corresponding normal tissue samples were 
prepared. As previously shown (table 2), Fzd5 ;mRNA 
upregulation was detected in 8 of the 1 1 matched tumor/ 
normal samples. Increased expression of Fzd5 protein 
was seen in membrane fractions from 9 of I I samples 
(fig. J). In most cases, concomitant increases in Fzd5 
mRNA and protein levels were observed. 

Hematoxylin-eosin staining of the cryosecl ion ed tu- 
mors confirmed the presence of clear cell carcinoma. 
Fzd5 immunostaining in clear cell carcinoma (fig. 2a, b) 
was observed to be localized to cell membranes and to 
nuclei. Cytokeratin 8 (fig. 2c) and E-cadherin (fig. 2d) 
were also detected. E-cadherin labeling of cell membranes 
in clear cell carcinoma was Jess intense and patchy com- 
pared to epithelial cells of normal renal tissue. 0-Catenin 
slaining was confined to the cell membrane. 0-Catenin 
levels in the clear cell carcinoma membranes were highly 
variable. Nuclear P-catenin staining was not observed in 
any of the samples. Epithelial cells in normal renal, tissue 
showed intense membrane staining and some cytoplasmic 
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staining. In addition, weak p-catenin staining of endothe- 
lial cells was observed. A high number of cyclin Dl- 
immunoreactive nuclei was observed in clearcell carcino- 
. ma (fig. '21), 

On the CL1 human kidney cancer tissue array, 30% 
(n = 9) of the clear cell carcinoma tumor samples (n = 30) 
showed Fzd5 immunoreactivity (fig. 3a). Membrane-as- 
sociated J3-catenin staining was observed in 33% of the 
Fzd5-positive tumor samples and 57% of Fzd5-negative 
clear renal cell carcinoma samples (table 3; fig. 3c, d). 
Again, nuclear P-catenin staining was never observed. 
Statistical analysis did not reveal a difference in the 
expression of P-catenin between FzdSrpositive and Fzd5- 
negative tumor samples (fig. 4a). 

Nuclear cyclin D J was observed in. 89% of the Fzd5- 
positive clear cell carcinoma samples (table 3; fig.3e). 
Only 38% of the Fzd5-negative clear cell carcinoma sam- 
ples contained nuclear cyclin Dl. Statistical analysis 
showed a significantly higher cyclin Dl expression in 
Fzd5-positive compared to Fzd5-negative tumor samples 
(fig. 4b). 



Expression of PPAR5 was investigated because it repre- 
sents a direct target of the P-catenin pathway with T cell 
factor binding sites in its promoter. Expression of c-myc 
was found to be upregulated in 7 or 1 1, whilst cyclin Dl 
was upregulated in 10 of 1 1 kidney tumor samples (ta- 
ble 4). PPAR5 was upregulated in 9 cases. All three select- 
ed target genes showed a marked upregulation in . the 
majority of renal tumors, which suggested that the Wnt/ 
P-catenin pathway was activated in these samples. 

Discussion 11 



c-myd Cyclin Dl and Peroxisome 
Proiiferator-Actiyated Receptor 8 Expression in Renal 
Carcinomas 

Wnt/p-catenin.pathway activation in the kidney tissue 
samples was investigated looking at the expression of a 
number of target genes, which have previously been 
shown to be upregulated when the pathway is active. 
Gene expression of c-myc, cyclin Dl and peroxisome 
prbliferator-activated receptor 5 (PPAR5) was analyzed. 
Increased expression of both c-myc and cyclin Dl genes 
have been impJicaied in cell proliferation, and carcino- 
genesis, and they represent two ofthe more important and 
closely studied target genes of the Wnt signaling pathway. 



Hg.2. Distribution of FzdS fa, b), cytokcratin 8 (c), E-cadberin (d) 
andp-catcnin (e) immunoreactivity , n paraffin sections from a renal 
luroor processed by an alcohol fixative. From the same tumor, a for- 
malin-fixed block was used Tor cycJin Dl immunosiaming (I). Fzd5 
immunosiainingshows distinci immunoreactivity in cell membranes 
and in nuclei of clear cell renal carcinoma. Clear cells are immunore- 
active for cytokeratin 8. 0-Caienin and E-cadherin staining of mem- 
branes is rather weak, and.not uniform, in dear ceJI renal carcinoma. 
Nuclear p-catenin immunoreactivity was not observed. In clear cell 
renal carcinoma, many nuclei showed cycJin Dl immunoreactivity. 
The inset in I shows a detailed view ofthe cyclin Dl labeling of nuclei 
* in clear celj renal carcinoma. 



Fzd family member overexpression has been postu- 
lated to play key role.s in different tumor types such as 
esophageal carcinoma fl J], gastric cancer [J 2] and head 
and neck squamous cell carcinoma [13], The current 
study evaluated the potential implication of Fzds as 
tumor-associated antigens in different tumor types. We 
screened a number of matched normal/tumor tissue sam- 
ples for the expression of a variety of Fzds using real-time 
quantitative PCR. 

Results obtained revealed that both Fzd5 and Fzd8 
mRNA were overexpressed in the majority of renal carci- 
noma samples when compared to the matched normal 
kidney samples. Fzd2 and Fzd3 were upregulated in 3 of 5 
colon adenocarcinoma samples. Fzd3 was also upregu- 
lated in the ovarian tumor tissue samples compared to the 
matched normal tissue samples. None of the other Fzds 
evaluated showed a specific differential expression pat- 
tern in any of the samples studied. Fzd5 and Fzd 8 show 
69.1 % similarity and belong to the same subgroup of Fzds 
[14]. The significantly higher expression of Fzd5 and 
Fzd8 in the renal tumor samples, as compared to the nor- 
mal renal sampfes, suggests a higher probabiJity that this . 
subgroup rnay be implicated in the progression of renal 
cancer. Therefore, we decided to further examine the pos- * 
sible role of Fzd5 in renal carcinoma. 

We observed, using Western blotting, that protein lev- 
els were mostly consistent with mRNA levels in the tumor 
samples. In order to be able to determine the Fzd5 expres- 
sion in a broader range of kidney tissues, we utilized a 
tissue array. F?d5 membrane staining was detected in 9 of 
30 (30%) clear cell carcinomas, and importantly, mem- 
brane staining was not detected in the matched 9 normal 
kidney tissue samples. 

Since the Wnt signaling pathway appears to play an 
important role in embryonic development, in particular 
embryonic kidney induction [15, 16], activation of this 
pathway in the adult kidney due to mutation or Overex- 
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Table 3. Correlation between Fzd5 and 
0-caienin orcyclin DJ expression 

F 2 d5.% 



ft-Catenin.+ 
^Caiehin- 
Cydin Dl + 
CycItn'Dl - 



33 
67 
89 
II 



57 
43 
38 
62 



pressxm or one of t he components of the pathway could 
be a determining factor in the development of rena] can- 
cers .Therefore, several studies ha ve JooJced into the possj. 
bJe function the Wnt/p-catenin pathway plays in raa] 
carcinogenesis. APC gene mutations have been demon, 
strated not to be involved in renal carcinoma [I7,]8J Jn 
addHion, p-catehin mutations are rare events in renal car- 
cinoma [19, 20]. Nevertheless, cytoplasmic accumulation 
of p-catenm has been reported in a number of renal cell 
carcinomas [19], and thus the Wnt signaling pathway 



Table 4. Wnt/0<atenin target gene mRNA 
Sample 3 Tissue 



expression in tumor samples 
Tumor type x-foWexp^rion increase* 



133702 

137770 

138844 

137146 

137564 

J 33408 

139188 

135699 

139064 

134585 

140279 



Jcidney adenocarcinoma 
kidney renal cell carcinoma 
Sidney renal cell carcinoma 
kidney renal cell carcinoma 

. Sidney renal cell carcinoma 

kidney renal cell carcinoma 

kidney renal cell carcinoma 

kidney renal cell carcinoma 

kidney renal cell carcinoma 

kidney renal cell carcinoma 

kidney renal cell carcinoma 



c-myc 


eyclinJDl 


PPAR5 


0.54 


4.52 


0,52 


13.9 . 


28.91 


5.53 


2.39 


31.49 


7.48 


7.62 


15.38 


3.15 i 


33.82 


19.65 


8.65 


7.8 


8.92 


4.86 


2.22 


9.92 


11.67 


12.18 


22.73 


5J3 


22.it 


5.04 


6.33 


1.79 


1.14 


1.67 


61.68 


54.95 


14.62 
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Fid5+ 



FxdS- 



SajnrJe .dcm.ficat.on numbers were given by the pathologist. 

Results are expressed as x-foid increase of ihe^nrm .h,. • 
10 its matched normal .issue sample after no™,." * h t' Umor ««« compared 
ATTsy6 content. A cutoff of tS^ 'rtTZT* 5? ' 3mp,eS ° n "* basis of th ™ 
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F*dS* 



F»9 v 3.T 2 d5, p-carenin and cyctin Dl immunostaining of the CI i - 
arrays. The left column of imam nxyr^u J^, ng .°' ,h « CLI "™l carcinoma nssue 
show an overview of eac ^StJS^^T^ Tnm "w™!* U *™s 
tiveclearccllrenalcarcinomaT oSu^ 

Tht right column of irnages is taker from *r al^t I r ' rCna ' Ca ™°™- 

carcinoma from mis tumor sample d£^ 

duiinct membrane P-c aI enin staining (d) ' b}3nd CyC,,n Dl < f > buI doesshow 



Fig. 4. Box plot charts (thick black line = 
median) illustrating the relationship be- 
tween F*d5 immunosiaining and P-caienin 
(a) and cyclin Dl (b) expression m clear cell 
renal carcinoma. No significant correlation 
was observed between the 0-catenin scores of 
Fzd5-positive and -negative clear cej) renal 
carcmoma. Nuclear cyclin. Dl 5Ia i„ ing in 
clear cell renal carcinoma showed a signifi- 
cant difference between F2d5-posmvc and 
Fzd5-negative tumor samples. 
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might act as an inducer of tumorigenesis in the kidney, nin was not observed in any of the tumors or on the tissue 
This view is supported by the observation that aberrant array. This result is consistent with the data presented for 
activity ofiheWnt signaling pathway has been reported in renal cell carcinomas by Kim et al. [i9J. They did not 
renal-cancer-derived eel) lines. Zanget al. [21] observed a detect nuclearp-catenin staining in the 52 renal ceD carci- 
higber expression level of Wnt 5a andFzdS mRNA in the nomas examined. The lack of nuclear 0-catenin staining 
renal cancer cell line GRC-1 than in the normal renal cell has also been reported by others in tumors that might 
line HK-2. Expression of ^cateniri was also higher in have arisen from Wnt/p-catenin pathway activation [28- 
GRC-1 than in HK-2. 31]. 

To determine the status of the canonical Wnt signaling While expression of both Wnt5a and Fzd5 does induce 
pathway in our renal carcinoma samples, we have quanti- duplication or the Xenopus head, exogenous expression of 
tated the mRNA levels of three important target genes of Fz65 in a Xenopus model does not induce duplication of 
T cell factor/lymphoid enhancer factor activation by the head [32]/Fzd5 does not activate the p-catenin signal- 
0-catenin. The mRNA levels of these three target genes ing pathway on its own, as the presence of its endogenous 
(omyc, cyclin Dl and PPAR 5) correlated largely with the ligand is also required. Our results.suggest that F2d5 may 
expression, of F2d5 in these samples, suggesting that the have a role in renal cell carcinogenesis due to its frequent 
canonical pathway is activated. On the kidney tissue overexpression observed in these tumor samples, and we 
array, cyclin Dl prot em expression showed a highly sig- hypothesize that if Fzd5 is overexpressed, it has a rather 
nificant correlation with the Fzd5 expression in the tumor limited effect, on p-catenin signaling. However, in the 
samples (table 3). Cyclin Dl protein is frequently. overex- presence of its endogenous still unknown ligand,' it acti- 
pressed in various tumors, but in only a. proportion of the vates the canonical Wnt signaling pathway. The elucida- 
cases is it due to amplification of the cyclin Dl gene [22]. tion of this ligand and its binding characteristics is still 
Therefore, other mechanisms such as upregulatiori of gene under in vestigation. Ultimately, knowledge of the specific 
transcription may play a substantial role in the overex- expression patterns of both Wnt and Fzd members could 
pression of cyclin Dl [23-26]. Our data, showing in- lead to directed tumor targeting or could be used as a 
creased cyclin Dl expression in renal carcinoma samples, tumor marker, 
are consistent with the results ofStassar et al. [27], They 
studied genes that are associated with human renal carci- 
noma by suppression subtractive hybridization and re- Acknowledgment 
ported 14 differentially expressed genes, including cyclin 

Dl, Although we would have expected an increased ntt- w e are grateful to Pro/X Van Marck (University of Antwerp) for 
dear ffceatenin staining, nuclear accumulation of p-cate- providinfiUS wilh ,hc «i"ue samples. 
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Immunohistoclicmtcal analysis of NY-ESO-1 antigen expression in normal 
and malignant human tissues. 

Jungbluth A A , Chen YT , Stockert E . Busam KJ , Kolb D, Ivei sen K , Coplan K , 
Williamson B , Altorki N , Old LJ . 

Ludwig Institute for Cancer Research, Memorial Sloan-Kettering Cancer Center, New 
York, NY, USA.jungblua@mskcc.org 

NY-ESO-1, a member of the CT (cancer/testis) family of antigens, is expressed in normal 
testis and in a range of human tumor types. Knowledge of NY-ESOI expression has 
depended on RT-PCR detection of mRNA and there is a need for detecting NY-ESO-1 at 
the protein level. In the present study , a method for the immunochemical detection of 
NY-ESO-lin paraffin-embedded tissues has been developed and used to define the 
expression pattern of NY-ESO-1 in normal tissues and in a panel of human tumors. No 
normal tissue other than testis showed NY-ESO-1 reactivity, and expression in testis was^ 
restricted to germ cells particularly spermatogonia. In human tumors, the frequency of 
NY-ESO-1 antigen expression corresponds with past analysis of NY-ESO-1 mRNA 
expression e.g., 20-30% of lung cancers, bladder cancers and melanoma, and no 
expression in colon and renal cancer. Co-typing of NY-ESO-1 antigen and mRNA 
expression in a large panel of lung cancers showed a good correlation. There is great 
variability in NY-ESO-1 expression in individual tumors, ranging from an infrequent 
homogeneous pattern of staining to highly heterogeneous antigen expression. Copyright 
2001 Wiley-Liss, Inc. 
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Multidrug resistance puospkoglycoprotein (ABCBl) in the mouse 
placenta: fetal protection. 

Kalabis GM, Kostaki A. Andrews MH Petropoulos S . Gibb W. Matthews SG . 
Department of Physiology, University of Toronto, Ontario, Canada. 

The multidrug resistance phosphoglycoprotein ATP-binding cassette subfamily B 
(ABCBl) actively extrudes a range of structurally and functionally diverse xenobiotics as 
well as glucocorticoids. ABCB 1 is present in many cancer cell types as well as in normal 
tissues. Although it has been localized within the mouse placenta, virtually nothing is 
known about its regulation. In the mouse, two genes, Abcbla and Abcblb, encode 
ABCBl. We hypothesized that there are changes in placental Abcbla and Abcblb gene 
expression and ABCBl protein levels during pregnancy. Using in situ hybridization, we 
demonstrated that Abcblb mRNA is the predominant placental isoform and that there are 
profound gestational changes in the expression of both Abcbla and Abcblb mRNA. 
Placentas from pregnant mice were analyzed between Embryonic Days (E) 9.5 and 19 
(term approximately 19.5d). Abcblb mRNA was detected in invading trophoblast ceils 
by E9.5, peaked within the placental labyrinth at E12.5, and then progressively decreased 
toward term (P < 0.0001). Abcbla mRNA, although lower than that of Abcblb at 
mtdgestation, paralleled changes in Abcb I b mRNA. Changes in Abcb I mRNA were 
reflected by a significant decrease in ABCB 1 protein (P < 0.05). A strong correlation 
existed between placental Abcb 1 b mRNA and maternal progesterone concentrations, 
indicating a potential role of progesterone in regulation of placental Abcblb mRNA.' In 
conclusion, there are dramatic decreases in Abcbla and Abcblb mRNA and in ABCBl at 
the maternal-fetal interface over the second half of gestation, suggesting that the fetus 
may become increasingly susceptible to the influences of xenobiotics and natural steroids 
in the maternal circulation. 
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Expression of human telomerase reverse transcriptase gene and protein 
and of estrogen and progesterone receptors, in breast tumors: Preliminary 
data from neo-adjuvant chemotherapy. 

ggffgSffi fe""uyama N, Hashimoto M, NakanmraK, OkanoT, Kurabavashi R 
jVaojaH, HomnaN, Ogawa T, Kanunislii M . Takubo K . 

Division ofBreast and Endocrine Surgery, Department of Surgery, Graduate School of 
Medicme, The University of Tokyo, Bunkyo-ku, Tokyo 1 1 3-8655, Japan, kanmori- 
dts@umin.ac.jp. 

Human telomerase reverse transcriptase (hTERT), the catalytic subunit of telomerase is 
very closely associated with telomerase activity. Telomerase has been implicated in ' 
cellular, immortalization and carcinogenesis, in situ detection of hTERT will aid in 
determining the localization of telomerase-positive cells. The aim of this study was to 
detect expression of hTERT mRNA, hTERT protein, estrogen receptor (ER) and 
progesterone receptor (PR) in paraffin-embedded breast tissue samples and o investigate 
the relat,onsh ip between hTERT expression and various clinicopathological parameters 
in breast tumongenesis. We used in situ hybridization (ISH) to examine hTERT K ene 
expression, and immunohistochemistry (IHQ to examine expression of hTERT protein 
ER and PR, m breast tissues including 64 adenocarcinomas, 2 phyllode tumors and their 
adjacent normal breast tissues. hTERT gene expression was detected by ISH in 56 (88%) 

dSc^b^ 

detected by IHC m 52 (81%) carcinomas, but in neither of the 2 phyllode tumors 
Moreover ER and PR were expressed in 42 (66%) and 42 (66%) carcinomas 
respectively, and in neither of the 2 phyllode tumors. In 4 cases of breast carcinoma that 
singly expressed hTERT gene and protein before treatment, neo-adjuvant 
chemotherapy led to disappearance of gene and protein expression in all cases. There was 
a strong .correlation between detection of hTERT gene expression by ISH and of hTERT 
protein by ICH in tissue specimens from breast tumors. These results suggest that 
detection of hTERT protein by ICH can be used to distinguish breast cancers as a 
potential diagnostic and therapeutic marker. 
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Abstract. Human telomerase reverse transcriptase (hTERT), 
the catalytic subunit of telomerase. is very closely associated 
with telomerase activity. Telomerase has been implicated in 
cellular immortalization and carcinogenesis. In m detection of 
hTERT will aid in determining the localization of telomerase- 
positive cells. The aim of this study was to delect expression of 
hTERT mRNA, bTERT piotein, estrogen receptor (ER) and 
progesterone recepior (PR) in paraffin-embedded breast nssue 
samples and to investigate the relationship between hTEKl 
expression and various clinicopaihological parameters in breast 
turoorigenesis. We used in siw hybridization (1SH) to examine 
hTERT °ene expression, and immunohisiocbernistry (1HQ to 
examine°expression of hTERT protein, ER and PR. in breast 
tissues including 64 adenocarcinomas. 2 phyllode tumors and 
.heir adjacent normal breast tissues. hTERT gene expression 
was detected by 1SH in 56 (88%) carcinomas, but in neither 
or the 2 phyllode tumors. hTERT protein expression was 
detected by 1HC in 52 (81%) carcinomas, but m neither of 
the 2 phyllode rumors. Moreover. ER and PR were expressed 
in 42 (66%) and 42 (66%) carcinomas, respectively, and in 
neither of the 2 phyllode tumors. In 4 cases of breast carcinoma 
that strongly expressed hTERT gene and protein before 
treatment, neo-adjuvant chemotherapy led to disappearance of 
gene and protein expression in all cases. There was a strong 
correlation between detection of hTERT gene expression by 
JSH and of hTERT protein by 1CH in tissue specimens from 
breast .umors. These results suggest thai detection of hTERT 
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protein by 1CH can be used to distinguish breast cancers as a 
potential diagnostic and therapeutic marker. 

Introduction 

Breast cancer is the most frequent malignancy in women, 
affecting up to one in every eight females worldwide. The 
most important clinicopaihological prognostic parameter so 
far identified is the absence or presence of lymph node 
metastasis, but the identification of further parameters tor 
both lymph node-posi.ive and -negative patients would 
facilitate an individually based risk-directed therapy (1). A 
promising emerging molecular marker is •e'omeras-. a 
ribonucleopro.ein enzyme complex, which when activated 
or upregulated allows tumor cells to escape from cellular 
senescence and .0 proliferate indefinitely (2). The human 
telomere is a simple repeat sequence of six bases (TTA GGG) 
that is located at the ends of each chromosome (3). Telomeres 
are believed to protect against degeneration, reconstruction, 
fusion, and loss (4) and .0 promote the homologous pairing 
or chromosomes (5). The end-to-end chromosome fusions 
observed in some tumors may result from the loss of itlorneres 
and may be partly responsible for the genetic instability 
associated wi.h lumorigenesis. Telomerase catalyzes the 
synthesis of telomere DNA and facilitates cell immorlal.zaOon 
through the stabilization of chromosomal structure (6-8). 
Although the expression of the human RNA component of 
telomerase (hTERC) is widespread, the restricted expression 
pattern of the mRNA of hTERT, the human telomerase 
catalytic sobunit tene, is correlated with telomerase activity 
(8-13) As telomerase activity seems to be the key player in 
,umo. cell immortality, it has importance as a target molecule 
for ami-cancer therapy. Telbmerase activity has been shown 
to correlate wilh poor clinical outcome in neuroblastomas 
and o.her tumors (14). For breast cancer, however, telomerase 
aciivity is a controversial prognostic marker: some studies 
sugocsl lhat telomerase activity, clinicopaihological parameters 
3 „d disease outcome are linked, whereas others do not lind 
this association (14-23). 
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We have succeeded in very clearly and sensitively demon- 
strating hTERT mRNA in thyroid, colorectal, parathyroid 
and lung tissues by use of an oligonucleotide probe (13,24-26). 
Strong correlation has been observed between hTERT mRNA 
and/or protein expression and telomerase in a variety of 
malignant tumors (13.14,24,25.27,28). In the present study, 
we used ISH to examine expression of the hTERT gene, and 
IHC to examine expression of hTERT protein, ER and PR, in 
64 carcinomas and 2 phyllodes rumors of breast to determine 
whether hTERT protein can be used to differentiate breast 
cancers. We also analyzed hTERT mRNA and protein 
expression with special reference to clinical features and 
histological findings to investigate the potential role of 
hTERT mRNA expression analysis in predicting the biological 
characteristics of breast cancers. Since hTERT expression in 
breast rumors has not previously been analysed by ISH or IHC. 
our investigation also examined various clinicopatbological 
parameters, including age, histopathplogical.type, tumor size, 
lymph node status, relapses, and the expression of ER and PR. 

* Materials and methods 

Tissue collection. Sixty-six samples were obtained during 66 
mastectomies: 64 breast carcinomas, 2 phyllodes tumors and 
66 specimens of the adjacent normal breast gland. In 4 eases, 
samples were obtained during core needle biopsies (CNB) 
before neo-adjuvant therapy and again during mastectomies 
after neo-adjuvant therapy; for these cases, all measurements 
and examinations were performed both before and after the 
neo-adjuvant therapy. Hie patients ranged in age from 32 lb 
90 years. The patients with carcinomas ranged in age from 37 
to 90, mean 56, years and were all women. The women with 
phyllode tumors were aged 32 and 38 years, respectively. 
The surgical and CNB tissue samples were frozen rapidly 
wiih liquid nitrogen and stored at -80*C until fixation. Then, 
they were fixed in \0% buffer formalin solution and 
embedded in paraffin. Surgical and CNB samples were 
collected from the patients after obtaining their informed 
consent, and the study protocol was approved by the Medical 
Department of the University of Tokyo Ethics Committee. 
The pathologic diagnoses weie made by the surgical 
pathological specialists at our institute on the basis of 
examination of hematoxylin-eosin stained slides. A 
pathological review was performed for all breast tumors 
according to the BRE score. pT and pN staging were assigned 
according to the 1997 WHO classification (7th edition). 

MCF-7 human breast cancer cells, kindly provided by the 
Cell Resource Center for Biomedical Research. Institute of 
Development. Aging and Cancer, Tohoku University, were 
used as positive controls. The cells were incubated in RPJvtl- 
1640 medium with 25 mM HEPES buffer, L-glutamine. and 
10% fetal bovine serum (Gibco, Grand Island, NY) on a 
chamber-attached slideglass (Lab-Tek* Chamber Slide™: 
Nalge Nunc International „ Naperville, IL) in a humidified 5% 
CO, atmosphere at 37"C, The cells were then fixed with 10% 
buffered neutral formalin (Sigma Chemical Co., St. Louis, 
MO). The culiured MCF-7 cell line that was used as a positive 
control was tested for telomerase with a PCR- based standard 
TRAP assay (6.! 3). These cells were also used to pieparc cell 
blocks. Briefly, the cells were fixed in 10% buffered neutral 



. formalin, resuspended in molten agarose and then embedded in 
paraffin. Sections from these cell blocks were used as positive 
procedural controls in ISH and IHC. The negative control in 
ISH was obtained by replacing the oligonucleotide probe with 
RNase. The negative control in IHC was obtained by replacing 
the primary antibody with Tris-buffered saline (IBS). 

Oligonucleotide probe for ISH. The specificity of the oligo- 
nucleotide sequence was initially determined by a GenEMBL 
database search using the Genetics Computer Group Sequence 
Analysis Program (GCG, Madison, WI) based on the fastA 
algorithm (29); the sequence exhibited 100% homology 
with the hTERT gene sequence. A dfT)^ oligonucleotide 
was used to verify the integrity and lack of degradation of 
the mRNA in each sample. All oligonucleotide probes 
were synthesized with a hapten-labeled nucleotide, such as 
digoxigenin-dUTP (Boehringer-Mannheim), at the 3* end via 
direct coupling by using standard phosphoramidite chemistry 
(Research Genetics, Humsville, AL) (30). The probe used for 
detection of hTERT by ISH was generated from the original 
sequence for Homo sapiens telomerase reverse transcriptase 
(AF0 15950), 2766-2800: 5*-GCCTCGTCTTCTACAGGGA 
AGTTCACCACTGTCTT-3' (13,24-26). 

ISH. ISH was performed with the GenPoint nucleic acid hypcr- 
detection system (Dako, Carpenteria, CA) (31). FormaJin- 
fixed, paraffin-embedded tissue sections (5 pm thick) were 
deparaffiniied in xylene and a graded alcohol series. Tissue 

. sections and CNB samples were then pretreated with target 
retrieval solution (Dako, SI 700) at 95*C and proteinase K 
(Dako, S3004) at room temperature. NexU the tissues and 
CNB samples were fixed in 03% hydrogen peroxide followed 
by a methyl alcohol series at room temperature. Digoxigenin- 
labcled anti-sense oligonucleotide in mRNA in situ hybri- 
dization solution (Dako. S3304) was placed over the tissues 
and CNB samples. After hybridization at 37*C overnight, 
the slides were washed in stringent wash solution (Dako, 
GenPoint System Kit) at 45X. The tissues and CNB samples 
were exposed io avidin blocking solution (Dako, X059O) 
at room temperature followed by biotin blocking solution 
(Dako, X0590) at room temperature. The tissues and samples 
were then incubated at room temperature with a sheep 
monoclonal hapten-labeled anti-digoxigenin antibody (Dako. 
p5104), and the slides were then fixed with biotinyl tyramine 
(Dako, GenPoint System Kit) at room temperature. Finally, 
the slides were incubated with HRP-conjugated streptavidin 
(Dako. GenPoint System Kit) at room temperature. Since 
33*-diaminobenzidine ictrahydrochloride (DAB) was used as 
ihe substrate, a positive reaction was visible as a brown color 
under a light microscope. The sections were weakJy counter- 

siained with 0.1% hematoxylin. 

JHC. IHC was performed by the avidin-biotin complex/ 
horseradish peroxidase method. Tissue sections were stained 
for hTERT with a commercially available monoclonal antibody 
(NCL-L-hTERT: Novocastra, Newcastle upon Tyne. UK). 
Seciions were dewaxed in xylene. Antigen retrieval was done 
by incubating sections immersed in 0.01 M citrate buffer at 
pH 6.0 in a microwave oven at 99"C. The sections were 
allowed io cool down at room temperature. The sections 
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■ Figure I . Representative resuhs of the TRAP assay. If functional telomeres*, is 
present, the enzyme adds DNA to the substrate in 6-base-pair {bp) increments, 
resulting in a ladder- like distribution of products. The 6- bp ladder signals are 
apparent for MCF-7 and breast cancers (case nos. 6. 14, 23.44 and 59) and are 
not apparent for lysis buffer as the negative control. An extract of MCF-7 was 
used as a positive control for the TRAP assay and as an Interna) Telorncnoe 
Assay Standard (ITAS) positive control for PCR amplification, with lysis 
buffer as the negative control (Ly*b). 

were then immersed in 1% hydrogen peroxide (H 2 0 2 ) in 
methanol lo block endogenous peroxidase activity. Following 
that, the sections were washed in TBS (pH 7.6) before being 
incubated in normal rabbit serum for 20 min to block non- 
specific binding. After draining off the excess serum, the 
sections were incubated with the primary antibody at room 
temperature. The sections were washed in TBS before being 
incubated with the secondary antibody (biotinylaied rabbit 
anti-mouse, Dako). The sections were washed again with 
TBS and incubated with avidin biotin complex/horseradish 
peroxidase. After washing the sections with TBS, peroxidase 
activity was visualized under light microscopy by applying 
DAB chromogen (Dako). The sections were couniersrained 
with hematoxylin, dehydrated in increasing grades of alcohol 
and finally mounted in dibutyl phthalate (DPX) mountanl. 

Homogeneous staining or a speckled/dotted pattern in the 
nucleus was considered positive staining; and absence of 
distinct nuclear staining was taken as negative staining. 
Grading of the percentage of stained cells (hTERT labeling 
index) was performed by previously published. criteria (J) as 
follows: Grade 1, negative staining; Grade 2, 1-10% positive 
staining nuclei; Grade 3; J 1-50% positive nuclei; and Grade 4, 
>50% positive nuclei. Immunostained slides for ER and 
PR were scored as previously described (32,33). Jn brief, 
each entire slide was evaluated by light microscopy. First, a 
proportion score was assigned, which represents the estimated 
pioponion of positive- staining tumor cells (0, none; 1 , < 1/100; 
2. 1/100 to 1/10; 3, 1/10 to 1/3; 4, 1/3 to 2/3; and 5, >2/3). 
Next, ari intensity score was assigned, which represents (he 
average intensity of positive tumor cells (0. none; I, weak, 
2, intermediate; and 3, strong). The proportion and intensity 
scores were then added to obtain a total score, which ranged 
from 0 to 8. Slides were scored by pathologists who did not 
have knowledge of ligand-binding results or patient outcome. 



Table I. Relationships between mRNA status (negative/ 
positive) by ISH and standard clinical, pathological, and 
biological factors in the 66 tumors. 

No. of patients (%) 

Total 

population hTERT hTERT ' 

(%) negative positive P- value* 



Total 66 10(15.2) 56(84.8) 

Age NS 

<50 26 6(23.1) 20(76.9) 

>50 40 . 6(15.0) 34(85.0) 

Hisiopatho- 

logical type ^ NS 

Scirrhous 32 * 2 (6.4) 30(93.6) 

Papillorubular 20 2(10.0) 18(90D) 

Solid tubular 6 1 (16.7) 5(833) 

Mucinous 2 1 (50.0) .1 (50.0) 

Non-invasive 4 2(50.0) 2(50.0) 

Phyllodes 2 2 (100) 0 (0) 

Tumor size : NS 
(cm? 

Tl (<2.0) 20 0 (0) 20 (100) 

T2 (2.0-5.0) 34 . 6(17.6) 28(82.4) 

T3 (>5.0) 10 2 (20.0) 8 (80.0) 

Lymph node 

starus b NS 

pNO 34 6(17.6) 28(82.4) 

pNl 26 2 (7.7) 24(923) 

pN2+pNM 4 0 (0) 4 (100) 

Relapse NS 

+ 10 0 (0) .10 ()00) 
56 10(17.6) 46(82.4) 

ER expression NS 

+ (*2) 42 6(143) 36(85.7) 

- (<2) 24 4 (16.7) 20(833) 

PR expression NS 

-f (5:2) 42 4 (9.5) 38(90.5) 

- (<2) 24 6(25.0) 18(75.0) 



V test. NS, not significant: "Information available for 64 patients. 



Statistical analysis. Differences in p- values were analyzed 
with the x 7 test for independence, and Fisher's test was used 
for correlations. In all comparisons, p<0.05 was considered 
significant. 

Results 

Representative results of the TRAP assay are shown in Fig. 1. 
The cultured cells, which were tested for lelomerase activity 
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Figure 2. Correlation between histologic diagnosis, human lelomerasc revere transcripl&st (hTERT) mRNA by 1SH. hT£RT p/ottin by IHC. estrogen 
receptor (EH) by IHC and pio£«icjonc receptor (PR) by IHC in bicwr cancers, (a; H&E); (b. hTERT mRNA); <e, hTERT protein): {d. ER) and (e. PR). 



with the TRAP assay, cave positive results with all procedural 
controls (MCF-7 and 5 breast cancer samples) (Fig. I). 

JSH revealed that hTERT mRNA was strongly expressed 
in the nuclei and cytoplasm of almost alj of the MCF-7 human 
cancer cells (data not shown). Expression of hTERT mRNA 
was detected in 56 (88%) of the 64 breast cancers and in none 
of the phyllodes tumors of the breast (Table I) wijh the anti- 
sense probe, whereas no expression was detected with the anti- 
sense probe treated with RNase (data not shown). The levels of 
expression were heterogeneous within the carcinomatous 
. regions. Strong expression of hTERT mRNA was not confined 
to the carcinomatous regions but was also delected in 
infiltrating lymphocytes (Fie. 2a and b). Higher expression 
levels of both signals of hTERT mRNA were delected in 
some sections containing both carcinomas and lymphocytes, 
compared with the adjacent non-cancerous mammary gland, 
but no clear differences in signal intensity were observed 
between carcinomas and lymphocytes. The signals in both 
the normal and cancer tissues were mainly present in the 
lymphocytes, and the signal intensity was similar in both, 
although a precise quantitative comparison of the in situ signals 
was impossible. 



IHC revealed thai hTERT protein was strongly expressed 
in the nuclei, nuclear membrane and cytoplasm of almost all 
of the MCF-7 human cancer cells (data not shown). Expression 
of hTERT protein was delected in 52 (81%) of the 64 breast 
cancers and in hone of the phyllode tumors of the breast 
(Table 11). The levels of expression were heterogeneous within 
the carcinomatous regions. As shown in Fig. 2c; strong 
expression of hTERT protein was observed in nuclei, nuclear 
membrane and cytoplasm, similar to the pattern in MCF-7 
human cancer cells. Normal mammary gland and stromal 
cells generally showed negative immunoreaciivily against 
hTERT protein antibody. 

A nuclear signal for the ER (Fig. 2d), as assessed by IHC. 
was observed in 36 (56%) of the 64 breast cancers and in 
none of the phyllode tumors of the breast, with positive 
scores ranging from 2 to 8 (Tables 1 and I)). A nuclear signal 
for the PR (Fig. 2e). as assessed by IHC, was observed in 38 
{59%) of the 64 breast cancers and in none of the phyllode 
tumors of the breast, with positive scores ranging from 2 to 8 
(Tables I and II). 

We used ISH and IHC to examine hTERT expression 
in 4 cases of breast cancer before and after neo-adjuvani 




Table II. Relationships between mRNA status (negative/ 
positive) by IHC and standard clinical, pathological, and 
biological factors in the 66 tumors. 

No. of patients (%) 

Total hTERT hTERT 

negative positive P-value* 



Total 


66 


14(2J.2) 


52 (78,8) 


' Ape 










26 


10(383) 


16 (61.5) 




40 


4(10.0) 


36 (90.0) 


Histopathologic^! 








type 








Scirrhous . 


'X'y 




jU \yJ.O) 


Papillorubular 


20 


4 (20.0) 


16(80.0) 


Solid tubular 


6 


* 4 (667) 


2 (333) 


Mucinous 


2 


0 (0) 


2 (100) 


Non-invasive 


■ 4 . 


. 2(50.0) 


2 (50.0) 


Phyllodes 


2 


2 (100) 


0 (0) 


Tumor si2e (cm)> 








TJ (<2.0> 


20 


4 (20.0) 


16(80.0) 


T2 (2.0 sjay 


34 


6(17.6) 


28 (82.4) 


T3 (>5.0) 


10 


2 (20.0) 


8 (80.0) 


Lymph node status** 








pNO 


34 


6(17.6) 


28 (824) 


P NJ 


26 


6(23.1) 


20 (76.9) 


pN2+pNM 


4 


0 (0) 


4 (100) 


Relapse 










10 


1 (10.0) 


9 (90.0) 




56 


13(23.2) 


43 (79.6) 


ER expression 










42 


6(143) 


36(85.7) 


- (<2) 


24 


8 (333) 


16(66.7) 


PR expression 










42 


6(143) 


36 (85:7) 


- (<2) 


24 


8(333) 


16(66.7) 



NS 



NS 



NS 



NS 



NS 



*X ? lesL NS. noi significant, information avDilabtc for 64 paiiems. 



chemotherapy. Before chemotherapy . all 4 of ihe bieast 
carcinomas strongly expressed hTERT by both ISH and 
IHC. After chemotherapy. hTERT expression completely 
disappeared in all 4 cases (Table III). hTERT expression by 
lymphocytes was detectable by ISH and IHC both before and 
after chemotherapy in all 4 rases, and the level of expression 
did not appear to be altered by treat me nt. 

No correlation was observed between hTERT mRNA 
expression and any of the clinicoparhological parameters 
age. histopatholosicaJ type, tumor vizc, lymph node status. 




Table III. Relationship of hTERT mRNA and protein 
expression before and after neoadjuvant chemotherapy. 

hTERT mRNA hTERT protein 



Case Age Neoadjuvant. Before After Before After 



1 


80 


AnastrozoJe 






2 


78 


Anastrozolc . 


+ 


+ 


3 


35 


FEO 


-f 


+ 


4 


37 


AO 







"FEC5FU (500 mgmr), Epirubicin (70 mg/nr). Cyclophosphamide 
(500 mg/m 2 ). b AC, Doxorubicin (60 mg/nr), Cyclophosphamide 
(500 mg/m 1 ). Before, before neoadjuvant chemotherapy. After, 
after neoadjuvant chemotherapy. + » positive; negative. 



relapses, and the expression of ER and PR. Similarly, there 
was no correlation between hTERT protein expression and 
any of these clinicopathological parameters. There was a 
correlation between hTERT mRNA expression and hTERT 
protein expression in breast cancers (p<0.005). 

Discussion 

This study reports a comparison of hTERT mRNA expression 
by ISH and hTERT protein expression by IHC in tissue 
sections from breast tumors. hTERT mRNA was detected 
by ISH in 56 of the 64 breast cancers and in MCF-7 human 
breast cancer cells. Breast cancer cell nuclei stained strongly 
positive with the specific anti-sense probe but not with the 
anti-sense probe treated with RNase (data not shown). Tissue 
lymphocytes also stained positively. with the ami- sense probe, 
but the stromal cells did not. Expression of hTERT protein 
was observed by IHC in 52 of the 64 breast cancers. hTERT 
mRNA and protein expressions were highly correlated in 
bieast cancers (p<0.005). Detection of the hTERT protein by 
IHC has permitted further analysis of carcinogenesis and 
cancer diagnosis (34). 

In recent years, there has been disagreement over the 
suitability of telomerase activiiy as a prognostic biologic 
marker in breast cancer that may help to differentiate patients 
for individually based risk-related therapy. Hiyama et al (15). 
in a study of 140 breast cancer specimens with the TRAP 
assay, found a strong association between telomerase activity 
and stage classification and observed telomerase activity 
in 68% of stage 1 tumors and 95% of stage IV tumors. 
Poremba et al(\), using tissue microarrays, found a statistically 
significant correlation between tumoi-specific survival (overall 
survival) and hTERT expression in breast cancer. However, 
some problems in interpretation may affect this apparent 
consensus. First, some samples of breast cancer tissue may 
be extensively contaminated by infiltrating lymphocytes 
during operative manipulations, especially in advanced disease, 
causing overcstimarion of telomerase activity and/or hTERT 
expression, in our previous reports, higher expression levels 



of signals for both hTERT mRNA and protein were delected 
in some sections containing both carcinoma and lymphocytes 
in thyroid and colorectal cancers (13,24). Secondly, Poremba 
et al (1) used polyclonal antibodies against hTERT protein as 
a signal for expression. In our hands, polyclonal antibodies 
against hTERT protein give rise to strong background signals 
and are not clearly specific for measuring expression in cancer 
tissues. We have carefully compared the reactivity against 
hTERT protein of the monoclonal antibody used in the 
present study with that of some polyclonal antibodies. Use of 
the monoclonal antibody in IHC allowed clear demonstration 
of hTERT protein expression, with results similar to those 
of 1SH for hTERT mRNA expression. Furthermore, IHC is 
technically much easier to perform than 1SH, since con- 
tamination of samples by RNase is not an issue in IHC. 

To the best of pur knowledge, this report is the first on the 
study of hTERT expression in breast cancer as a function of 
neoadjuvant treatment. We examined hTERT expression 
in 4 cases of breast cancer before and after chemotherapy. 
Before chemotherapy, hTERT was strongly expressed in 
all 4 carcinomas, but after chemotherapy hTERT expression 
had completely disappeared in all 4 cases. hTERT expression 
by. lymphocyte's was detectable by JSH and JHC both before 
and after chemotherapy in all 4 cases, and the level of 
expression did not appear to be altered by treatment. 

In conclusion, determination of hTERT mRNA expression 
by 1SH and hTERT protein expression by IHC can be used to 
obtain information contributing to a histopathologic^! diagnosis 
during screening of breast cancers. By use of a monoclonal 
antibody, we could very clearly and sensitively demonstrate 
hTERT protein expression in breast cancer tissues but not in 
non-cancerous tissues. We also demonstrated that 4 carcinomas 
with originally positive immunoreactivity against hTERT 
protein became negative after neo-adjuvant chemotherapy. 
These results suggest ihat determination of hTERT protein 
by ICH can be used as a potential diagnostic and therapeutic 
marker to distinguish breast cancers. 
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Increased expression of proteasome subunits in skeletal muscle of cancer' 
patients with weight loss. 

jgialJ, Hiac_AV, Fearon KC , Dejong CH , Tisdale MJ 

Pharmaceutical Sciences Research Institute, Aston University, Birmingham B4 7ET, UK. 

Atrophy of skeletal muscle is common in patients with cancer and results in increased 
morbidity and mortality. In order to design effective therapy the mechanism by which 
this occurs needs to be elucidated. Most studies suggest that the ubiquitin-proteasome 
proteolytic pathway is most important in intracellular proteolysis, although there have 
been no reports on the activity of this pathway in patients with different extents of weight 
loss. In this report the expression of the ubiquitin-proteasome pathway in rectus 
Abdominis muscle has been determined in cancer patients with weight loss of 0-34% 
usmg a competitive reverse transcriptase polymerase chain reaction to measure 
expression o f mRNA fpr proteasome subunits C2 and C5, while protein expression has 
been determmed by western blotting. Overall, both C2 and C5 gene expression was 
increased by about three-fold in skeletal muscle of cachectic cancer patients (average 
we.ght-loss 14.5*-2.5%), compared with that in patients without weight loss^Sor 
wthout cancer. The level of gene expression was dependent on the amount of weight 
oss mcreasmg maximally for both proteasome subunits in patients with weight loss of 
l^-iy A. Further increases m weight loss reduced expression of mRNA for both 

weiSo^ t™*' alth ° Ugh , it WaSSm e,CVated in with patients with no . 

• Zfff) / Th ^ n ° f ° r m inCfeaSe in **P^°n at weight losses less 

Inhnnl h?T7k S * g °° d ^ G]ation between expression of proteasome 20Sal P ha 
subunits, detected by western blotting, and C2 and C5 mRNA, showingthat increased 
gene express.on resulted in increased protein synthesis. Expression of the ubiquitin 
conjugating enzyme, E2(14k), with Weight Joss followed a similar pattern to (hat of 
proteasome subunits. These results suggest variations in the expression of key 
™ P °,T? ° f ^ ubic * u ! tin -P rotea s°me pathway with weight loss of cancer patients and 
suggest that another mechanism of protein degradation must be operative for patients 
with weight loss less than 10%. »vc inpatients 

PMID: 161251 16 [PubMed - in process] 
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Expression of the ubiquithi-proteasoiuc pathway and muscle loss in 
experimental cancer cachexia. 

KbalJ, Wykc SM, RusseU ST, ffiae AV. Tisdale MJ . 

Pharmaceutical Sciences Research Institute, Aston University, Birmingham, UK. 

Muscle protein degradation is thought to play a major role in muscle atrophy in cancer 
cachexia. To uweshgate the importance of the ubiquitin-protcasome pathway, which has 
been suggested to be the main degradative pathway mediating progressive protein loss in 
cachexia, the expression of mRNA for proteasome subunits C2 and C5 as well as the 
ubiqu.tm-co.uugating enzyme, E2(l4k), has been determined in gastrocnemius and 
pectoral muscles of mice bearing the MAC 16 adenocarcinoma, using competitive 
juant.tat.ve reverse transcriptase polymerase chain reaction. Protein levels of proteasome 
suburuts and E2(14k) were determined by immunoblotting, to ensure changes I mR^T 
were reflected ,n changes in protein expression. Muscle weights correlated linearly with 
wetght loss dunng die course of the study. There was a good correlation between 
express.on of^and E2(l4k) mRNA and protein levels in gastrocnemius muscle with 
increases of 6-8-fold for C2 and two-fold for E2(14k) between 12 and 20% weightToss 
followed by a decrease in expression at weight losses of 25-27%, although loss of muscle 
protein continued. In contrast, expression of C5 mRNA only increased two-fold and was 
elevated s.nularly at all weight losses between 7.5 and 27%: Both proteasome functional 
activity, and proteasome-specific tyrosine release as a measure of total protein 
degradat,on was also maximal at 18-20% weight loss and decreased at higher weight loss 
Proteasome expression in pectoral muscle followed a different pattern with increases in ' 
C2 and C5 and E2(14k) mRNA only being seen at weight losses above 1 7% although 
muscle loss mcreased progressively with increasing weight loss. These results suggest 
tiiatact.vat.on of the ub.quitin-proteasome pathway plays a major role in protein toss in 
gastrocnemius muscle, up to 20% weight loss, but that other factors such as depression in 
protein synthesis may play a more important role at higher weight loss. 

PMID: 16160695 [PubMed - in process] 
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Cell type-specific occurrence of caveolin-lalpha and -lbeta in the lung 
caused by expression of distinct mRNAs. 

KogoH, Aiba T, Fujimoto T . 

Department of Anatomy and Molecular Cell Biology, Nagoya University Graduate 
School of Medicine, Showa-ku, Nagoya 466-8550, Japan. hkogo@fUjita-hu.ac.jp 

Two isofonns of caveolin-l, alpha and beta, had been thought to be generated by 
alternative translation initiation of an mRNA (FL mRNA), but we showed previously that 
a variant mRNA (5<V mRNA) encodes the beta isoform specifically. In the present sLy 
we demonstrated strong correlation between the expression of the caveolin-l protein 
isofonns and mRNA variants in culture cells and the developing mouse lung. The alpha 
isoform protein and FL mRNA were expressed constantly during the lung development 
wherea^expression of the beta isoform protein and 5'V mRNA was negligible in the fetal 
lung before 17.5 days post coitum, and markedly increased simultaneously at 18 5 days 
post cmtum, when the alveolar type I cells started to differentiate. Immunohistochemical 
analy Sls revealed the cell type-specific expression of the two isoforms; the alveolar type I 
cell expresses the beta isoform predominantly, while the endothelium harbors (he alpha" 
isoform chtefly. The mutually exclusive expression of caveolin-l isofonns was verified 
by Western blotting of the selective plasma membrane preparation obtained from the 
endothelial and alveolar epithelial cells. The present result indicates that the two 
caveolin-l isoforms are generated from distinct mRNAs in vivo and that their production 
is regulated independently at the transcriptional level. The result also suggests that the 
alpha and beta isoforms of caveolin-l may have unique physiological functions. 

PMID: 1 5067006 [PubMed - indexed for MEDLINE] 
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Oncogene and growth factor expression in ovarian cancer. 
KommossF, Bauknecht T, Birmclin G . KohlerM Tesch H , PfleMcrerA . 
Department of Gynaecology, AlberM-udwig University, Freiburg, Germany. 

"S^^hT^f ° l08iCal behaVi ° r ° f ° Varian P~babl y influences both 

t ZT£ ? "? rCSP ™ t0 chemothera Py> are the most relevant prognostic 
factor . The phenotype of different ovarian carcinomas is obviously associated Sh an 

^Tof^oTT 1 ^ s,8nai p f way ' including ™ d «»■ 

Analysis of EGF-R, TGF-alpha, c-myc. and c-jun expression in 33 stage HI/IV, and 2 
stage I/II ovarian carcinomas with biochemical, molecular-chemical and 
immunohistochemical methods showed a correlation between the mRNA and protein 
levels of EGF ; R and TGF-alpha for tumors with low or high expressing rates. However 
the concentration of measurable free EGF-Rs seems to depend on the amount of TGF- 
alpha expression by the tumors. The EGF-R binding ligand TGF-alpha is produced by 
ep.thehal tumor cells; stromal cells are usually TGF-alpha-negative, as shown by 
immunohistochemistry. High expression rates of EGF-R, TGF-alpha and c-myc were 
detected in 6, 7, and 10 out of 35 ovarian carcinomas, respectively. C-jun mRNA was 
detected m 18/19 cases studied. Non-maUgnant tissues originating from myometrium or " 
ovary expressed no (or only small amounts of) EGF-R or TGF-alpha mRNA, whereas a 
high c-myc expression was found in 1/7 normal myornetria, and in 2/5 normal ovaries 
1 here was no strong correlation between EGF-R/TGF-alpha and c-myc/c-iun 
expression.(ABSTRACT TRUNCATED AT 250 WORDS) 

PMID: 1502888 [PubMed - indexed for MEDLINE] 
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Somatostatin receptors in primary human breast cancer: quantitative 
analys.s of mRNA for subtypes 1--5 and correlation with receptor protein 
expression and tumor pathology. 

Ki"narJI,Grj^^ 

R^T^° rat u CS F0, ; D i abetes Research - Department of Medicine, McGill University 
Royal VictonaHospital, 687 Pine Avenue West, H3A IA1 Montreal, Quebec, Canada 
ujendra.launar@muhc.mcgill.ca 

Somatostatin receptors (SSTRs) have been identified in most hormone-producing tumors 
as well a? m breast cancer. In the present study, we determined SSTR1-5 expression in 
primary ductal NOS breast rumors through semi-quantitative RT-PCR and 
immunocytochemistry. The results from the analysis of 98 samples were correlated with 
several key histological markers and receptor expression. All five SSTR subtypes are 
variably expressed at the mRNA level in breast tumors with 91% of samples showing 
SSTR1, 98% SSTR2, 96% SSTR3, 76% SSTR4, and 54% SSTR5 SsTrI 5 « ^ 
localized to both tumor cells and the surrounding peritumoral regions as detected by 
immunocytochemistry. Levels of SSTR mRNA when corrected for beta-actin levels 
were h.ghest for SSTR3 followed by SSTR1, SSTR2, SSTR5, and SSTR4 Furthermore 
7o e o/% W ^f too^of^ 0 " bCtWeen mRNA 311(1 P rotein e *P«*sion with 84% for SSTRl', 
f" 89% f ° r SSTR3 ' 68% for SSTR4 > 68% forSSTRS, and 78% for all five 

receptors bSTRl,2 and 4 were correlated with ER levels whereas SSTR2 showed an 
addihona correlation with PR levels. These correlations were independent of patient age 
and histological grade. Moreover, using immunocytochemistry, blood vessels exhibited 
receptor-specific localization for SSTR2 and SSTRS. Our results indicate significant 
corrections between mRNA and protein expression along with receptor-specific 
correlations with histological markers as well as ER and PR levels. Differential 
distribution of SSTR subtypes in tumors and receptor-specific expression in vascular 
agonisT C ° nSidered 83 3 noveI di *&™* f°r breast tumors with receptor subtype 
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A transcriptomic and proteomic analysis of the effect of CpG-ODN on human 
THP-1 monocytic leukemia cells. 

ICuojCC, Kuo_C_W, LiMgJCM, LiMg_SM. 

Institute of BioAgricultural Sciences, Academia Sinica, Taipei, Taiwan. 

The CpG motif of bacterial DNA (CpG-DNA) is a potent immunostimulating agent whose 
mechanism of action is not yet clear. Here, we used both DNA microarray and proteomic 
approaches .to investigate the effects of oligodeoxynucleotides containing the CpG motif (CpG- 
ODN) on gene transcription and protein expression profiles of CpG-ODN responsive THP-1 cells 
=Microarray analysis revealed that 2 h stimulation with CpG-ODN up-regulated 50 genes and 
down-regulated five genes. These genes were identified as being associated with inflammation 
antimicrobial defense, transcriptional regulation, signal transduction, tumor progression cell ' 
differentiation, proteolysis and metabolism. Longer stimulation (8 h) with CpG-ODN enhanced 
transcriptional expression of 58 genes. Among these 58 genes, none except one, namely WNTI 
inducible signaling pathway protein 2, was the same as those induced after 2 h stimulation 
Proteomic analysis by two-dimensional gel electrophoresis, followed by mass spectrometry 
identified several proteins up-regulated by CpG-ODN. These proteins included heat shock 
proteins, modulators of inflammation, metabolic proteins and energy pathway proteins 
Comparison of microarray and proteomic expression profiles showed poor correlation Use of 
more reliable and sensitive analyses, such as reverse transcriptase polymerase chain reaction 
Western blotting and functional assays, on several genes and proteins, nonetheless, confirmed that 
there is indeed good correlation between mRNA and protein expression after CpG-ODN 
treatment. This study also revealed that several anti-apoptotic and neuroprotective related 
proteins not previously reported, are activated by CpG-DNA. These findings have extended our 
knowledge on the activation of cells by CpG-DNA and may contribute to further understanding of 
mechanisms that link innate immunity with acquired immune response(s). 

PMID: 15693060 [PubMed - indexed for MEDLINE] 
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A transcriptomic and proteomic analysis of the effect of 
CpG-ODN on human THP-1 monocytic leukemia cells 
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1 Introduction 

Mammals protect themselves against pathogen infection pri- 
manJy via innate and adaptive immunity [1J. Hie innate 
immune system relies on a set of pattern recognition receptors 
{e.g., Toll-like receptors) to recognize foreign molecular struc- 
tures such as lipopolysaccharide (LPS) and bacterial DNA |2, 3]. 
Innate immune cells recognize these molecular structures and 
initiate not only innate but also adaptive immunity by produc- 
ing immunomodulatory cytokines and activating T and B 
immune cells [1]. Bacterial DNA can directly activate B cells to 
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proliferate and secrete immunoglobuliris in a T cell-independ- 
ent manner [4-6). It also induces B cells and monocytes to 
activate transcription factor NF-kB and secrete cytokines, 
including interleukrn (11) 12, tumor necrosis factor a (TNF-a), 
and interferon a/p [7-10]. The immunostimulatory activity of 
bacterial DNA has been assigned tounmethyiated CpG motifs 
(GACGTTfor murine, GTCGTT for human) [11). Recent evi- 
dence shows that synthetic oligodeoxynudeotides containing a 
CpG motif (CpG-ODN), like bacterial DNA with the CpG moi- 
ety (CpG- DNA), induce potent Thl-like immune responses 
that are protective against several infectious agents and 
immune disorders inarrimal models [1 2, 1 3], Biologically active 
CpG-ODN, like bacteria DNA, activates macrophages and 
immature dendritic cells to increase expression of MHC class 
11 and cosrimulatory molecules, thereby transcribing cytokine 
rnRNAs, and producing pro-inflammatory cytokines including 
TNFa,.IL-l, 1L-6 and 11-12 [% 14-16). CpG-ODN can therefore 
serve as an adjuvant and irnmunomodulator in vaccines 
against a wide variety of targets, including infectious agents, 
cancer antigens and allergens [17). 

It has been suggested that unmethylated CpG-DNA-medi- 
ated immune activation functions through a toll-like receptor 9 
(TLR9) signaling pathway (18). Endocytosis and sequentially 
endosomal maturation as weD as binding of heat shock protein 
(HSP) 90 to CpG-DNA are essential for induction of TLR9 sig- 
nal transduction [19, 20). ]t has also been shown that recogni- 
tion of CpG-DNA causes TLR to form a dimer, which recruits 
the adaptor molecule, myeloid differentiation factor 88 
(MyD88), through interaction between their C-teirninal ToB/IJL- 

1 R domains. This recruitment of MyD88 to the Toll/1 L-1R do- 
main of TLR9 initiates a signaling pathway that sequentially 
involves IL-lR-associated kinase 1 and TNF-a receptor-asso- 
ciated factor 6 [18, 21, 22). Studies using gene-deficient mice 
and RAW2647 cells transiently transfected with dominant- 
negative forms of these molecules have indicated that the 
MyD88-mediated signaling pathway is essential for CpG-DN A- 
induced activation of NF-kB and c-Jun NH2-terminal kinase 
(JNK), as well as subsequent production of cytokines in mono- 
cytic cells [1 8, 2J , 22]. The precise mechanism of action of CpG^ 
DNA and CpG-ODN, nonetheless, is still not thoroughly 
understood. To further elucidate the molecular events after 
binding of CpG-ODN to TLR9, in this study, we treated CpG- 
ODN responsive THP-1 cells with CpG-ODN and evaluated 
changes by using DNA microarray and proteomic approaches. 
We have discovered up- regulation of more than 50 disting- 
uished genes/proteins and identified induction of several anti- 
apoptotic and neuroprotecting genes by CpG-ODN treatment. 

2 Materials and methods 
2.1 Reagents 

Phosphorothioate-modified CpG-ODN and GpC-ODN were 
synthesized by MDBio (Taipei. Taiwan). Human specific 
ODN sequences are: CpG-ODN, 5'-TCC TCG TIT TGT CGT 
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TTT GTC GTT-3'; GpC-ODN, 5' TGC TGC TIT TGT GCT 
TIT GTG CTT-3'. The mouse specific CpG-ODN sequence is 
5'-TCC ATG ACG TTC CTG ATG CT-3\ CHCA was from 
Sigma (St Louis, MO, USA). 

2:2 Cell culture 

Cell lines were obtained from the American Type Culture 
Collection (Rockville, MD). Mouse RAW264.7 macrophage 
and human embryonic kidney 293 cells (HEK293) were cul- 
tured in DMEM- supplemented with 10% heat inactivated 
fetal bovine serum, 100 U/mL penicillin, 100 pg/mL strep- 
tomycin sulfate, 200mmol/L L-glutarrune, and 50 um 0-mer- 
captoethanol in a humidified atmosphere of 5% C0 2 at 37°C. 
The medium was changed every 2 days for all experiments. 
Human THP-1 monocytic leukemia cells, which have been 
shown, to express TLR9 and respond to CpG-DNA stimula- 
tion [23, 24J, were cultured in RPMI1640 with the same sup- 
plements as for RAW264.7 cell cultures. 

2.3 Human cDNA microarray 

Total RNAs extracted from cultured THP-1 cells were isolated 
with TRlzoI (Invirrogen, Leek, The Netherlands) and submitted 
to Genasia Biotechnology (Taipei, Taiwan) for further proces- 
sing. In brief, 4 ug of total RNA from CpG-ODN stimulated, or 
normal THP-1 cells was labeled with a fluorescence marker 
(U-vision F (Taipei, Taiwan)). Different colored fluorescence dyes 
(Cy5 and Cy3) were used to distinguish total RNA from normal 
and ODN stimulated cells. The labeled RNA was used for 
hybridization with the Human 1 cDNA microchip from Agilent 
Technologies (Palo Alto, CA, USA). The chips were scanned and 
the expression pattern was analyzed using genechip software. 
Genes showing up-regulation or down-regulation of RNA levels 
were analyzed and identified on a genomic database as sug- 
gested by the manufacturer of the microchip. 

2.4 Protein preparation 

THP-1 cells were seeded in a 175 cm 2 tissue culture flask at a 
density of 10 6 cells per milliliter in culture medium. The cells 
were stimulated with or without 1.5 um CpG-ODN at defined 
times and harvested by centrifugation at 4°C, 1000 x g for 
I5min. Cell pellets were washed twice with ice-cold PBS, 
resuspended and sonicated in extraction buffer containing 
25 mM Tris-Ha (pH 7.5), 2mM p-mercaptoethanol and pro- 
tease inhibitor cocktail. After centrifugation at 10000 x g for 
20min, ammonium sulfate was added to the supernatant until 
the final concentr ation reached 50% saturation w/v. The solu- 
tion was stirred at 4°C for 30 min and centrifuged at 10 000 x g 
for 30 min at 4°C. The supernatant fraction was then trans- 
ferred into a fresh rube, and the precipitated protein pellet 
solubilized in extraction buffer. To remove salts and other con- 
taminants, the extracts were treated with a pre-cooled (-20°C) 
solution of 10% TCA in acetone with 0.07% p-mercaptoetha- 
nol. Proteins were allowed to precipitate overnight at -20*C 



© 2005 WILEY-VCH Verlag GmbH & Co. KGaA, Weinbeim 



www.proteomics-journal.de 



896 C-C.KuoetaL 

After centrifugation, the pellet was washed with ice-cold ace- 
tone, containing 0.07% P-mercaptoethanol. The supernatant 
was discarded and the peDet dried in a SpeedVac system {Model 
AES1010: Savant, Holbrook, NY, USA). 

2.5 2-DE 

2-DE was performed using an IPGphore IEF and a Hofer 
DA1X vertical unit (Amersham Biosciences, PiscatawaVj. NJ, 
USA). One milligram of dried protein sample was dissolved 
in 350 uL of rehydration buffer solution, containing 7 m urea, 
2 m thiourea, 4% w/v CHAPS, 5 mM tributyi phosphine, and 
2% IPG and loaded onto an immobilized pH 3-10 linear 
gradient strip (18 cm), followed by rehydration for 16h. IEF 
was then performed in the following manner: 100 V for 
30min, 250 V for 30min, 500V for 30mm, 3000 V for 
30 min, 4000 V for 30min, 6000 V for 55 000 Vh. At the end 
of IEF, the IPG strips were equilibrated for 15 min in buffer 
containing 6 m urea, 2% w/v SPS, 30%v/v glycerol, and 
50 mM Tris, pH 6.8, then reduced with 65 mM dithiorery- 
thritol (DTE) and subsequently alleviated with 135mM 
iodoacetamide for another 15 mm. After equilibration, the 
IPG strips were immediately placed on top of a 12% SDS^ 
PAGE (1.5 mm;. 20x24 cm). The second dimension gels 
were then overlaid with molten 0.8% agarose solution in 
SDS electrophoresis bufTer. Electrophoresis was performed 
at 16°C, starting at 10mA per gel for 1 h, followed by 45 mA 
per gel until the dye front reached the bottom of the gels. 

2.6 Staining and image acquisition 

Immediately after electrophoresis, gels were stained with 
SYPRO Ruby (Molecular Probes, Eugene, OR, USA). In 
brief, gels were fixed for 30min in 10% methanol, 7% acetic 
acid, and then stained overnight in SYPRO Ruby stain. The 
staining solution was removed and gels were washed in 10% 
methanol and 7% acetic acid for 3h. After staining, image 
acquisition was carried out on a Typhoon 9200 (Amersham 
Biosciences). To identify a protein, spot detection, quantifi- 
cation and matching of 2-D results were analyzed using 
JmageMaster software (Amersham Biosciences). The M 1 of 
the proteins were calibrated according to the LMW-SDS 
Marker Kit (Amersham Biosciences), and their pj values 
were estimated from the position of the protein spots on the 
2-D gel and confirmed with the information supplied by the 
manufacturer. Since most of the pi values for the tnmcated 
proteins had not been reported previously, the p/ values of 
the truncated proteins were estimated from the position of 
the observed spots. To omit the variation due to the use of 
separate gels, afler background subtraction, the intensity 
levels of protein spots on each geJ were normalized as a pro- 
portion of one reference spot, and protein quantities were 
calculated by integrating the density over the spot area. Pro- 
tein spots that showed reproducible modulation exceeding 
-80% after CpG-ODN treatment in thiee experiments were 
further analyzed by MS. 
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2 J In-gel digestion with trypsin arid extraction of 
peptides 

Protein spots were excised from stained gels and cut into 
pieces. In brief, gel spots were dehydrated with ACN for 
10 min and dried in a vacuum centrifuge. Gel pieces were 
reswelled with 55 mM DTE in 25 mM ammonium bicarbon- 
ate (pH 8.5) at 37° C for 1 h. The solution was then exchanged 
with alkylation solution, which contained lOOrriM iodoaceta- 
mide in 25 mM ammonium bicarbonate (pH 8.5), at room 
temperature for 1 h. After- alleviation, the gel pieces were 
washed twice with 50% ACN in 25 mM ammonium bicarbo- 
nate (pH 8.5) for 15 mm. The wash solution was discarded 
and the pieces of gel were dehydrated with ACN for 10 min 
and dried in a vacuum centrifuge. Tryptic digestion was 
initiated by resweiling the gel in 25 mw ammonium bicar- 
bonate solution with 25 ng of trypsin (Promega, Madison, 
WI, USA). After incubation at 37°C for 16h, tryptic peptides 
were extracted twice with 50% ACN containing 5% formic 
acid for 15 min with moderate sonication. The extracted 
solutions were pooled and evaporated to dryness in a vacuum 
centrifuge. The dried peptide mixture was dissolved in 0,1% 
formic acid and used for MS. 

2.8 MALDI-Q-TOF MS and protein identification 

Tryptic peptides analyses were performed using a Micromass 
Q-TOF Ultima MALDI (Micromass, Wythenshawe. U.K.) 
equipped with a 337 nm nitrogen laser and operated in 
reflectron positive ion mode. Peptide mixtures (1 uL) were 
premixed with 1 uL of the matrix (5mg CHCA in 50% ACN 
with 0.1% TFA) then spotted onto the MALDI target plate. 
Mass spectra were acquired for the mass range of 900- 
3500 Da and the individual spectra from MALDI MS or MS/ 
MS were processed using the Micromass MassLynx 4.0 soft- 
ware. The generated peak list files were used to query the 
Swiss-Prot database using the MASCOT program (http:// 
www.matrixscience.com) with the following parameters: 
peptide mass tolerance, 50ppm; MS/MS ion mass tolerance, 
0.25 Da; allowance of missed cleavage, 1; and consideration 
for variable modifications such as oxidation of methionine 
and carboxyamidomethylation of cysteines. Only significant 
hits as defined by MASCOT probability analysis were con- 
sidered initially. In addition, when the PMF matches were 
between 5 and 9, at least one peptide sequence was manually 
checked by MALDI MS/MS analysis. 

2.9 RTPCR analysis 

cDNA from THP-1 cells was produced with Superscript II 
reverse transcriptase (Invitrogen) using a o!igo(dT) )5 primer 
for 1 h at 42°C PCR of cDNA was performed using specific 
primers for the gene of interest and control P-actin. All PCR 
products were electrophoresed on a 1.5% agarose gel, and 
DNA bands weie -visualized by staining the gel with ethi- 
dium bromide. 
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2.10 Immunoblotting 



3 Results 



Human THP-1 or mouse macrophage RAW264.7 cells 
(5 x 3 x 10 6 /well) were cultured in a six-well culture 
plate and treated with or without 1.5 um CpG-ODN for 
the. designated times. After stimulation, cells were harvest- 
ed by centrifugation at 1000 x g for 15miii in a refhV 
gerated centrifuge and washed twice with cold PBS buffer 
The cells were lysed on ice for ISmin with 300 uL lysis 
buffer (Pierce, Rockford, USA), supplemented with pro- 
tease inhibitor cocktail (Sigma). The lysates were cen- 
trifuged at 12000 x g for 15 nun at 4°C, and protein 
concentrations of supernatant were determined using the 
Bio-Rad Protein Assay (Hercules, CA> USA). The lysates 
(50 ug of protein/lane) were subjected to 12% SDS-PAGE 
and transferred to NC membranes (Amersham Bio- 
sciences). The membranes were blocked in PBS-0.1% 
Tween 20 (PBST) containing 5% non-fat skim milk at 
room temperature for 1 h, followed by staining with anti- 
ADP-ribosylation factor 3 (ARF-3) monoclonal antibody. 
(0-1 Mg/ mL ; Sigma). The membranes were then incubated 
with horseradish peroxidase-conjugated secondary anti- 
body (dilution, 1:3000) for 1 h. After washing three times 
with PBST, specific bands were detected by chemilumi- 
nescence according to the manufacturer's protocol 
(Amersham Biosciences). 

2.11 Cell transfection and lucif erase assay 

HEK293 cells (5 x 3 x 10 6 /well) were transfected using 
FuGENE 6 (Roche Molecular Biochemicals, Indianapolis, 
IN, USA) plus 0.1 ug p5xNF-KB-]uc (Stratagene, La Jolla, CA, 
USA), 0.05 ug pCDNA3.1-P-galactosidase, and pCDNA3.1- 
hTLR9 overnight The cells were incubated with or without 
1.5 um CpG-ODN for 8h and then Jysed. NF-kB luciferase 
activity assays were performed as recommended by the 
manufacturer (Promega). P-galactosidase activity was used to 
normalize the data. 

2.12 Enzyme activity assay 

Pyruvate kinase activity was assayed in a solution (1 mL) 
containing ]00mM Tris-HCl (pH 8.0), 100 mM KCl, 10 rnM 
MgCl 2 , 0.2 mM NADH, 10mM PEP, 1.5 mM ADP, lunit of 
lactate dehydrogenase, and an appropriate amount of cell 
lysate from CpG-ODN untreated or treated THP-1 cells. The 
reaction was monitored at 30°C for a period of time by 
measuring the decrease in absorbance at 340 nm. PGK ac- 
tivity was assayed in a coupled reaction with glycer aldehyde 
3-pbosphate dehydrogenase (GAPDH) as described by Lee 
[25J. In brief, the assay was performed at 30°C in a total vol- 
ume of ] mL containing JOOmM Tris-HCl (pH 7.9), 10mM 
MgCl 2 , 0.15 mM NADH, 2mM ATP, 6mM 3-phosphoglyce- 
rate, 0.1 mg/mL BSA, 50mg of GAPDH, and an appropriate 
amount of cell lysate. NADH consumption was monitored at 
340 nm. 



3.1 Effect of CpG-ODN on gene expression profiles of 
human THP-1 cells 

To elucidate the effect of CpG-ODN on gene expression, 
THP-1, a cell line known to express TLR9 and respond to 
CpG-DNA (23, 24) was cultured with or without CpG- 
ODN. Since preliminary experiments showed that 1.5 um 
CpG-ODN caused more contrasting results between nor- 
mal and CpG-ODN. treated cells, 1.5 um CpG-ODN was 
used throughout the experiments. To distinguish CpG- 
ODN treated samples from the control, total cellular RNA 
of normal and CpG-ODN treated cells was isolated and 
labeled with the fluorescence dyes, Cy5 and Cy3, respec- 
tively. The labeled RNA was then used for hybridization 
with a Human 1 cDNA microchip from Agilent Technol- 
ogies. Of the 13 000 human genes represented on the 
gene array, a total of .55 genes changed expression sign- 
ificantly after 2h CpG-ODN treatment Among these, 
50 genes were up-regulated while five genes were down- 
regulated by a factor > 2. These genes were sorted by 
functions and are listed in Tables 1 and 2. They included 
notably, 1L-18 receptor accessory protein, MSG A beta 
gene, thioredoxin, pro-pol-dUTPase polyprotein, Spl40, 
cpnnexin 59 gene, Grb2-like 2, enoyl-coenzyme A hydra- 
tase, propionyl coenzyme A carboxylase, cytochrome P450, 
and WNT1 inducible signaling pathway protein 2 (WISP- 
2) etc. The function of these genes are known to be relat- 
ed to inflammatory responses, antimicrobial defense, 
transcriptional regulation, intracellular signal transduc- 
tion, tumor progression, cell differentiation, proteolysis 
etc. 

Table 1. Genes up-regulated t> 2 fold) in human THP-1 cells after 
2 h of CpG-ODN treatment 



Gene name and description 
(changed fold > 2) 



Genebank 
number 



Inflammation and receptor 

IL-1 8 receptor accessory protein AF077346 

T cell receptor V beta gene X58806 

MGSA, beta gene U03019 

Antigen gene (PA) M21896 

TIED NM_004791 

Platelet activating receptor AF002986 

Antimicrobial defence 

Thioredoxin NM_003329 

Pro-Pol-dUTPase polyprotein AC004748 

Nuclear body protein Spl40 U63420 

Transcriptional regulation 

Putative transcription factor LUZP AI986271 
General transcription factor II, I, pseudogene 1 AI700706 

Connexin 59 gene L29277 

Basic helbc-loop-helix protein class B 1 AF221520 
(BHLHBt) 
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Table 1. Continued 



Gene name and description 
(changed fold > 2) 



Genebank 
number 



Regulatory protein 
Advillin 

i 

Channel and transport . 

Small GTP binding protein Rab9 
FXYD domain-containing ion transport 
regulator 1 

Signal transduction 

SH3-ddmain Grb2-like 2 
Titin 

Tittn associated protein (165 kD protein) 
KIAA1451 protein 

Vasoactive intestinal peptide receptor 

Enzyme and protease 

Enoyl-Coenzyme A, hydratase 

Nephrin, B-type metallocarboxypeptidase 

Propionyl Coenzyme A carboxylase 

Acyloxyacyl hydrolase (neutrophil) 

Cytochrome P450 

Intestinal alkaline phosphatase 

Tumor progression and cell differentiation 

. Retinoblastoma 1 - 
Human genomic DNA of 9q32 anti-oncogene 
of flat Epithenurrt cancer, 
segment6/1u' 
WNT1 inducible signaling pathway protein 2 
(WISP-2) 

Structure protein 

Collagen, type IV, alpha 6 
Beta Myosin heavy chain 

Other 

Homo sapiens Cri-du-chat region mRNA, 

clone NIBB11 
Human mRNA for laminin alpha 5 chain, 

partial cds. 
NIK like and Thyroxin-binding 

globulin precursor 
Hypothetical protein DKFZp434M0331 
Hypothetical protein FLJ1 1021 similar > 

to splicing factor 
Hypothetical protein 
Chromosome 18 open reading frame 1 
Arfaptin 1 

Zinc finger protein 8 (ZFP8) 

Zinc finger protein 137 (clone pHZ-30) 

Olfactomedin related FR localized protein 

Cyclin-dependent kinase 8 

Integrin, alpha 1 

KIAA0421 protein 

KIAA1233 protein 

Unnamed protein product 

NBL4 

BC331191J1 



AF041449' 

U44103 
AM 25364 



AF036268 

X90568 

X69089 

AB040884 

U11087 



AI800553 

U65090 

AB011145 

M62840 

U79716 

M31008 



L11910 

AB036268 



AF 100780 



D21337 
M58018 

U52827 

AB010099 

Z83850 

AL137720 
AK023985 

AL049851 

NM_004338 

AW4087S5 

M29581 

U09414 

AI738468 

BE467537 

D87462 

AB007881 

AB033059 

AK026362 

X75535 

AAD39268 
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Table 2. Genes down regulated (> 2 fold) in human THP-1 cells 
after 2 h of CpG-ODN treatment 



Gene name and description 


Genebank 


(changed fold > 2) 


number 


Zinc-finger homeodomain protein 4 


BAB03600 


Human protein kinase MEKK2b mRNA, 


AF239798 


complete cds. . 




Glypicari 5 


U66033 


Human genomic DNA, chromosome 22q11.2, 


AP000362 


clone N75A12. 




Collagen, type 1, alpha 1 


Z74615 



ioriger stimulation of THP-1 cells with CpG-ODN (8h) 
resulted in the up-regulation of 58 genes. These genes in- 
cluded notably 1L-10 receptor beta, formyl peptide receptor- 
like 1 (FPRL1), vitamin D receptor; nuclear receptor sub- 
family 1 (bcR), early B-ceJl factor, protein kinase. C gamma 
(PKCJ, Nek, Ash, phospholipase C binding protein (NAP4), 
phosphoriboxyl pyrophosphate amidotransferase/ dishev- 
eled 3, WISP-2 etc. Analysis of the functions of the 58 up- 
regulated genes showed that they are associated with anti- 
inflammation, transcriptional regulation, intracellular signal 
transduction, tumor progression, cell differentiation, pro- 
teolysis, neurodegeneration, neuroprotection etc. (Table 3). 
We also found that the stimulation of THP-1 cells with CpG- 
ODN for different periods of times resulted in different pro- 
files. Several defense related genes such as 1L-18 receptor 
accessory protein, Pro-Pol-dUTPase polyprotein, Spl40 and 
connexin 59 were transiently up-regulated at 2 h short stira- 

Table 3. Genes up-regulated (> 2 fold) in human THP-1 cells after 
8 h of CpG-ODN treatment 



Gene name and description 
(changed fold > 2) 



Genebank 
number 



Inflammation and receptor 

IL-10 receptor (beta) - U08988 

Formyl peptide receptor-like 1 (FPRL1) AF081535 

Vitamin D receptor J03258 

NMDAR1 Z32774 

CD44 antigen AW028346 

Nuclear receptor subfamily 1 (LxR> NM_005693 

Neuromedin B receptor M73482 

Transcriptional regulation 

Early B-cell factor (ebf) AF208502 

Neurogenic differentiation 1 INeuro D) AB018693 

MAX dimerization protein (NESH protein) AB037 886 

Ribosomnl protein S6 kinase AF090421 

ASH2L AB022785 

Regulatory protein 

Hypothetical protein DKFZp434H082Q AL137555 

Peroxisomal farnesylated protein X75535 

LJ BP4 * AF051344 

Neuronal pentraxin II U29195 
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Table 3. Continued 



Gene name and description 
(changed fold > 2) 



Structure protein 

Trichohyalin 

Kertain 

Other 

Human transferrin pseudogene 
TIMP-2 

Collagen like protein 

Human genomic DNA, chromosome 21q, 

section 60/105 
Human genomic DNA, chromosome 21q, 

section 64/105 
KIAA0136 
KIAA0379 
KIAA04B9 
KIAA1114 
KIAA1451 
KIAA0756 

Zinc finger protein 267 

Hypothetical protein FU 10633 

Hypothetical protein EUROIMAGE 1955967 

Mybl homoiog like 1 

Antizyme inhibitor 

Disintegrin-like and metalloprotease 

Ireprolysin type) with Thrombospondin 

type 1 motif, 3 
ADP-ribosylation factor 3 {ARF-3J 
Testis specific protein, Y-linked 
Unnamed protein 



Genebank 
number 



Table 4. List of antimicrobial and anti-inflammatory genes 
modulated by CpG-ODN treatment of THP-1 cells 



Channel and transport 

Gamma-aminobutyric [61 ] A receptor 
ATP synthase subunit F6 
Transient receptor potential channel 1 
Choroideremia (Rab escort protein 1) 

Signal transduction 

Protein kinase C, gamma 
Regulator of G-protein signaling 5 
Nek, Ash and phospho lipase C binding . 

protein INAP4) 
Highly similar to adeylate kinase gene 

Enzyme and protease 
Phenylalanine hydroxylase 
Cor boxy peptidase A 1 
Xylulokinase 
Pancreatic lipase 
Ubiquitin specific protease 12 
Transmembrane protease, serine2 
Aspartate bata-hydroxylase 
Phospho ribosyl pyrophosphate 
amidotransferase 

Tumor progression and cell differentiation 

CDC23 
WISP-2 

Microseminoprotein, beta 
Dishevelled 3 



NM_004961 
M37104 
Z73903 
X57637 

Z15114 

AI674877 

AB005216 

AB016886 

AA203389 

X67318 

AK001205 

J05125 

AF022789 . 

U75329 

U03109 

D13757 



AF053977 
AF100780 
M34376 
NM_004423 

L09190 
AF061809 

M22376 
U44383 
U67921 
AP001716 

AP001720 

D50926 

AB002377 

AB007958 

AL049732 

AB040884 

AB018299 

AF220492 

AK001495 

AK026108 

AK001893 

D88674 

AB002364 



M74491 
M98525 
AK026042 



Gene name 



Genebank 
number 



Expression fold 









2h 


8h 


Connexin 59 gene 


L29277 


2.12 


±0.05 


1.74 ± 0.08 


IL-18 receptor 


X58806 


2.32 


±0.21 


1.19 ±0.13 


accessory protein 










Integrin, alpha 1 


X68742 


2.01 


±0.03 


1.05 ±0.10 


Nuclear body protein 


U63420 


2.22 


±0.11 


1.39 ±0.04 


Sp140 










Pro-Pol-dUTPase 


AC004748 


2.33 


±0.18 


0.95 ± 0.03 


polyprotein 










Thioredoxin 


NM_003329 


2.20 


±0.08 


1.07 ± 0.01 


FPRLt 


AF081535 


0.95 


± 0.06 


2.13 ±0.11 


IL-10 receptor 


U08988 


1.22 


±0.18 


2.21 ± 0.07 


bcR 


NM_005693 


0.90 


±0.03 


2.37 ±0.31 


Vitamin D receptor 


J03258 


1.37 


±0.23 


2.39 ±0.11 



Expression fold is designated as the ratio of CpG-ODN treated 
over control 



uJation but were down- regulated thereafter, while anti- 
inflammatory associated genes such as FPRU, 1U0 recep- 
tor, vitamin D receptor and IxR were up- regulated after 8h 
stimulation (Tables 1, 3 and 4). 

3.2 Verification of the microarray results with RT-PCR 
or Western blotting 

To verify the results from the microarray analysis, we aJso 
performed RT-PCR on the up-regulated genes {Fig. 3). Con- 
sistent with results obtained in the micToarray gene expres- 
sion analysis, RT-PCR studies showed that the mRNA levels 
of some selected genes, including ubiquitin specific pro- 
tease 12, regulator of G-protein signaling 5, NAP4 and 
ASH2L, were increased in response to CpG-ODN (Fig. 1, 
Table 3). In addition, the protein expression level of ARF-3 

- GpC-ODN CpG-ODN 



Untreated 



Ubiquitin specific 
protease t2 | 
Regulator of G- J 
protein signaling 5 I 



S3EBB 

Figure 1. Induction of various genes by CpG-ODN. THPl cells 
were stimulated with medium alone, 1.5 jjm GpC-ODN {as the 
negative control) or CpG-ODN for the indicated times. -RT-PCR 
was then performed to analyze gene expression levels. 0-actin 
was used as an internal control. The experiment was repeated 
three times with similar results. 
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Figure 2. CpG-ODN induced ABF-3 protein expression in human 
THP-1 or mouse RAW264.7 cells. THP-1 {A) or mouse RAW264.7 
(B) cells were incubated with 1.5 \m CpG-ODN for the indicated 
time points. The protein expression level of ARF-3 was deter- 
mined by Western blotting of cell extracts using anti-ARF-3 anti- 
body. The experiment was repeated three times with similar 
results. 

was shown to increase in Western blotting analysis in cell 
lysates from THP-3 cells treated with CpG-ODN for 8-24 h 
(Fig. 2A). Similar studies showed that the ARF-3 protein was 
also induced by mouse specific CpG-ODN in other TLR9 
expression cell lines such as the mouse macrophage 
RAW264.-7 cell line (Fig 2B). 

3.3 Proteins regulated in CpG-ODN stimulated THP-1 
cells 

To further assess whether there was any correlation between 
regulation of gene expression and expression of cellular pro- 
teins, a proteomic approach was adopted to identify protein 
expression profiles. THP-1 cells were treated with CpG-ODN 
for defined rimes (from 8 to 40h), and their cytoplasmic 
proteins were extracted for 2-DE analysis. Although the use 
of high concentrations of urea might give us a broader view 
of aD the proteins afTected by CpG-ODN, preliminary results 
from 2-D gels showed that the resolution of the protein mix- 
tures were not satisfactory. To improve and get the.best reso- 
lution from 2-DE, total proteins were roughly separated into 
supernatant and precipitated fractions using 50% saturated 
ammonium sulfate solution. To remove salts and other con- 
taminants, both protein fractions were precipitated with TCA 
solution and then subjected to 2-DE. By protein spot deter- 
mination analysis, about 500 and 450 well-resolved spots 
were observed on each pH 3.0-10.0 gel for precipitated or 
supernatant fractions, respectively. Comparative analysis of 
2-DE between treatments and control showed that the inten- 
sities of the protein spots from the ammonium sulfate pre- 
cipitated fraction did not change, while several protein spots 
were up-regulated by at least -80% in the supernatant frac- 
tion of 8h CpG-ODN stimulated THP-1 cells (Fig. 3). 

The protein spots were individually excised from gels for 
further identification. After trypsin digestion, several protein 
spots were identified without ambiguity by MS MALD1- 
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n 5 ore 3. The effect of CpG-ODN on the 2-DE profile of THP-1 
cells. THP-1 cells were treated with or without 1.5 mm, CpG-ODN 
for 8h. Total proteins were extracted and roughly separated into 
two fractions by 50% saturation with ammonium sulfate. The 
supernatant fraction was then separated by 2-DE. Protein spots 
were visualized by SYPRO Ruby staining. Comparison of CpG- 
ODN treated THP-1 cells to untreated cells showed that these 
proteins changed in intensity by over 80%. Protein spots were 
identified by trypsin digestion and MS. Localization of protein 
spots 1 (enoly-coehzyme A hydratase), 2 {proteasome a) and 3 
Icyclophilin A; two isoforms) are shown. The experiment was 
repeated three times with similar results. 

TOF. These proteins included HSP60, HSP90, cydophi- 
linA, enoyl-coenzyme A hydratase, eukaryotic translation 
elongation factor, proteasome a and p* chain and ATP syn- 
thase beta chain (Table 5). Similar experiments on cells 
treated for a longer period of time with CpG-ODN stimu- 
lation (25 h) revealed tfrat 27 protein spots were changed in 
intensity by at least -80%. These protein spots contained 
members of HSPs (HSP27, hsc70, grp78 and grp94), 
metabolic enzymes (phosphoglycerate kinase (PGK) and 
pyruvate kinase (PYK)), macrophage capping protein and 
cyclophilin A (Table 6). Among these proteins, macrophage 
capping protein, PGK, PYK, cyclophilin A and HSP27 
(Figs. 4 and 5 A) were found to be up-regulated. Interest- 
ingly, we found that a truncated form of grp78 with an 
expected mass of 25kDa and pi of 5.3 was up-regulated 
while grp78 itself was down-regulated. A similar situation 
was also found foi grp94 and hsc70 and their truncated 
derivatives (Table 7 and Fig. 5). In addition, we also ob- 
served six down-regulated protein spots on 2-D gels in 
samples after 25 h CpG-ODN treatment. Among these six 
proteins, we have successfully identified three as 40s ribo- 
somal protein SA, grp78 and hsc70/ respectively (Table 6), 
while the other three, due to their relative low abundance, 
have not been identified yet. 
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Table 5. List of proteins modulated by 8 h CpG-ODN treatment 



Protein name 


Accession 


Mr' . 




Matched 


Cover- 


Score 


Expression 




no. 


(theor.) 


(theor.) 


no. 


age% 




fold 


ATP synthase beta-chain 


gi1 14549 


56 525 


5.26 


16 


58 


171 


2.33 ± 0.06 


Cyclophilin A 


P05092 


17 870 


7.82 


5 


35 , 


62 


2.85 ± 0.13 


Enoyl-Coenzyme A 


g»4503447 a > r 35 971 


6.61 


11 


44 


62 


2.52 ±0.05 


hydratase 
















Eukoryotic translation 


gi4503481 


50 087 


6.25 


6 


25 


68 


3.41 ± 0.21 


elongation factor 
















HSP60 


P10809 w 


57 963 


5.24 


13 


27 


76 


2.78 ± 0.03 


HSP90-beta 


P08238 


83 133 


4.97 


10 


18 


65 - 


2.36 ±0.10 


Prbteasome a chain 


gi4506181 


25 882 


6.92 


11 


59 


80 


2.52 ± 0.11 


Proteasome p chain 


gi4506193 


26 472 


8.27 


9 


46 


84 


3.85 + 0.17 



Expression fold is designated as the ratio of CpG-ODN treated over control 

a) . NCBI accession number 

b) Swiss-Prot accession number 



Table 6. List of proteins modulated by 16 and 25 b CpG-ODN treatment 



Protein name Swiss-Prot M r p/ Match- Coyer- Score Expression fold 

no. (theor.) (theor.) ed no. age% 16n 



Cyclophilin A P05092 17 870 7.82 

78 kDa glucose regular- Pi 1021 • 72 288 5.07 

ed protein (grp78) 
HSP27 P04792 22 768 5.98 

Heat shock cognate P1 11 42 70 854 5.37 

70 kDa protein 

(hsc70) 
Macrophage capping 

protein 

Pbospboglycerate P00558 
kinase 

Pyruvate kinase P14618 57 710 7.95 

40s ribosomal protein P08865 32 833 4.79 
SA (RSP40) 



P40121 38 494 5.88 



44 284 7052 



5 
13 



12 
16 



11 



17 

5 



35 62 2.53 ±0.02 2.48 ± 0.15 

30 148 0.61 ± 0.04 0.29 ± 0.06 

59 124 1.00 ±0.01 2.61 ±0.12 

34 114 0.64 ± 0.01 0.31 ±0.03 



30 58 1.31 ±0.01 2.58 ±0.03 

33 71 2.32 ±0.11 4.23 ±0.19 

32 114 1.65± 0.07 2.7O±0.12 

23 61 0^33 ± 0.02 0.35 ±0.05 



Tabic 7. List of truncated proteins detected in THP-1 cells after 25 h CpG-ODN treatment 



Protein name 


Swiss-Prot 




Pt 


Matched 


Cover- 


Score 


Expression 




no. 


{obs.J 


tobs.) 


no. 


age% 




fold 


94 kDa glucose-regulated 


P14625 


■> -59 700 


-5.00 


12 


14 


104 


New 3 ' 


protein (grp94) 
















Truncated form of grp78 


P11021 


-25 000 


-5.30 


12 


22 


75 


New 


Truncated form of hsc70 


P11142 


-22 000 


-5.80 


12 


20 


96 


New 


Truncated form of hsc70 


P11142 


-19000 


-6.10 


11 


18 


113 


New 



a) New designated proteins detected in the CpG-ODN treated gel but not in the corresponding control gel 



3.4 Comparison of microarray and proteomic results 

Table 8 shows the expression of six genes and their corre- 
sponding proteins that were modulated by 8 h treatment of 
THP-I cells with CpG-ODN, Besides enoyl-coenzyme A 
hydratase, there was poor correlation between the expression 



of genes and their corresponding proteins (Table 8), sug- 
gesting that more in-depth studies were needed. To further 
evaluate whether changes observed in protein expression 
correlated with changes in mRNA levels, we randomly chose 
two proteins (PGK and PYK) that were induced after 16h 
CpG-ODN treatment 3nd determined their mRNA levels by 
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Figure 4. 2-D gel elcctrophorelic analysis of CpG-ODN-treated 
THP-1 cells. (A) Total cell protein from unstimulated THP-1 cells 
was subjected to 2-DE. (B) THP-1 cells were treated with or with- 
out 1.5 m m CpG-ODN for defined times. Cellular proteins were 
extracted and separated by 2-DE. Several up-regulated proteins 
are shown in the SYPRO Ruby staining gel. Comparison of CpG- 
ODN treated THP-1 cells to untreated cells showed that these 
proteins changed in intensity by over 80%. the experiment was 
repeated three times with similar results. 



(A) 



grp7B 




33kDa 



Rgure 5. 2-D gel electrophoretic analysis of CpG ODN-treated 
THP-1 cells. THP-1 cells were treated with or without 1.5 m m CpG- 
ODN lor defined limes. Cellular proteins were extracted and 
separated by 2-DE. Protein spots were detected by SYPRO Ruby 
staining. |A> Expression of HSP27 was induced by increasing the 
period of CpG-ODN stimulation. A truncated form of hsc70 was 
detected on the gel. IB) The native form of grp78 was detected in 
untreated cells, while the truncated form of grp78 was observed 
after 25 h stimulation. The experiment was repeated three times 
with similar results. 



Proteomrcs2005, 5, 894-906 

Table & Comparison of gene and protein expression levels in 
THP-1 cells after 8 h CpG-ODN treatment 



Protein; name 


Gene 


Protein 




expression 


expression 




fold from 


fold from 




microarray 


2-D gel 


Enoyl-Coenzyme A hydrolase 


1.72 ± 0.31 


2.52 ± 0.05 


Eukaryotic translation 


0.95 ± 0.01 


3.41 ±0.21 


elongation factor 




HSP60 


0.94 ± 0.07 


2.78 ± 0.03 


HSP90-beta 


1£8±0.11 


2.36+0.10 


Proteasome a chain * 


0.98 ± 0.03 


2.52 ±0.U 


Proteasome fJ chain 


1.09 ±0.11 


3.85 ± 0.17 



RT-PCR. Our results showed that mRNA levels of PYK 
increased after 16 h CpG-ODN treatment, while mRNA 
levels of PGK were, dramatically increased after 24h stimu- 
lation (Fig. 6). In addition, we also performed enzyme activ- 
ity analysis and found that the activity of PGK and PYK were 




2.5 



(B) 

□ GpC-ODN 
E3 CpG-ODN 




4 8 
Time (h) 

Figure 6. Activities of PYK and PGK induced by CpG-ODN. THP1 
cells were stimulated with medium alone, 1.5 m m GpCODN {as 
the negative control) or CpG-ODN for the indicated times. Cell 
lysates were extracted and assayed for (A) PYK and (B) PGK ac- 
tivities. Data represent mean -± SEM. tn= 3). 
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Figure 7. Induction of PYK and PGK transcripts by CpG-ODN. 
THP1 cells were stimulated with medium alone, 1.5 pM GpC-ODN 
or CpG-ODN for the indicated times. RT-PCR was then performed 
to analyze gene expression, fractin was used as an internal con- 
trol. The experiment was repeated three times with similar 
results. 



indeed increased by a factor —2.5 after CpG-ODN stimula- 
tion (Fig. 7). To confirm that mRNA induced by CpG-ODN 
would also be accompanied by an increase in protein 
expression even though it was not detected in 2-D gel analyT 
sis, we used more sensitive and specific Western blotting 
analysis. As shown in Table 3 and Fig. 2, ARF-3 was identi- 
fied in the microarray gene profile but not in the proteomic 
expression profile. Nevertheless, we observed enhanced pro- 
tein expression of ARF-3 after CpG-ODN stimulation by 
Western blotting. Moreover, to investigate whether the up- 
regulation of ARF-3 by CpG-ODN is mediated through the 
TLR9 pathway, 1XR9-deficient HEK293 cells were transiently 
cotransfected with hTLR9 and luciferase-reporter gene driv- 
en by a NF-KB-dependent promoter. Our data showed that 
NF-kB activity of untransfected HEK293 cells were not 
responsive to CpG-ODN stimulation, while in transfected 
HEK293 cells expressing hTLR9, NF-kB ludferase activity 
was up-regulated 12-fold after 8h CpG-ODN stimulation. 
The activation of NF-kB induced by CpG-ODN was blocked 
by. pretreatment of the transfected cells with an ARF-3 in- 
hibitor, such as brefeldin A {Fig. 8), suggesting CpG-ODN 
induces ARF-3 and activates NF-kB after the interaction of 
CpG-ODN with TLR9. 



4 Discussion 

In this study both microarray and proteomic approaches 
were used to evaluate the effect of CpG-ODN on gene/pro- 
tein expression profiles of THP-1 cells at several time points. 
Comparison of the gene expression profiles showed that 
stimulation of the cells with CpG-ODN for different periods 
of time resulted in different profiles (Tables 1-4). The differ- 
ences in mRNA expression between the cells with short and 
long stimulation could be attributable to the low reproduci- 
bility. However, to avoid experimental variations, we not only 
used the same batch of miaoarrays from the same manu- 
facturer but also appbed the samples of short and long term 
stimulation at the same time. Jn this way, we found that the 



303 



changes in expression fold of mRNA after CpG-ODN treat- 
ment were quite reproducible as shown by their mean 
± SEM (Table 4). A more likely explanation for the difference 
in the expression level of mRNA after different periods of 
stimulation with CpG-ODN is that the transient increase or 
decrease in these mRNA by CpG-ODN plays a significant 
role in modulating biological functions. For example, we 
found that the IL-18 receptor accessory protein from THP-1 
cells was up-regulated after 2h of CpG-ODN stimulation. 
The IL-18/IL18R system is known to activate Thl-mediated 
immune responses that play a critical role in host defense 
against infection [26]. Together with IL-18/IL18 R, several 
genes for antimicrobial defense were also increased, includ- 
ing thioredoxin, Pro-Pol-dUTPase j>oryprotein and Spl40. 
After 8h of CpG-ODN stimulation, however, none of these 
genes was activated any more (Table 4). Since sustained or 
excessive production of these antimicrobial molecules might 
lead to inflammation and cellular damage [27], a plausible 
explanation is that THP-1 cells fight against the invasion of 
pathogens by up-regubting antimicrobial defense-associated 
genes at an early stage of stimulation and then shut them 
down to avoid over-activation. Whether this explanation is 
true remains to be verified. 

It is noteworthy that our data also identified the up-reg- 
ulation of several anti-inflammatory associated genes after 
8h of CpG-ODN stimulation. These genes included FPRL1, 
IL-10 receptor, vitamin D receptor, and LxR (Table 3). FPR 
and FPRL1 have been defined as chemotactic factors 
involved in host defense against bacterial infection and in the 
clearance of damaged cells. Additional studies have indicated 
that FRPL1 interacts with a menagerie of structurally diverse 
pro- and antiinflammatory ligands associated with diseases, 
including amyloidosis, Alzheimer's diseases, prion disease 
and HIV [28, 29). Therefore, FPRL1 may play an important 
role in regulating and/or balancing the production of pro- 
arid an ti- inflammatory molecules in CpG stimulated THP-1 
cells. Additionally, a recent study has demonstrated that LxRs 
and their ligands act as negative regulators of macrophage 
inflammatory gene expression and inhibit the expression of 
inflammatory mediators such as inducible nitric oxide syn- 
thase, cyclooxygenase and IL-6 in response to bacterial 
infection or LPS stimulation [30]. Of interest, we found that a 
transcri prion factor gene connexin 59, a regulator of IL-6 
expression, was up-regulated after 2 h of CpG-ODN stimula- 
tion. It is thus likely that CpG-ODN stimulation of THP-1 
cells for 2h may induce the expression of the pro-inflamma- 
tory cytokine IL-6 through the up-regulation of the connexin 
59 gene, while 8h of CpG-ODN treatment may counter- 
balance the initial inflammatory response by inducing LxR to 
inhibit JL-6 production. 

Signal transduction molecules play an important role in 
cellular activation. Intracellular signal transduction systems 
employing various intermolecular interactions through 
docking elements, including SH2 and SH3 domains, have 
been reported (31-33]. Here we found that THP-1 ceils 
treated with CpG-ODN for 2 h up-regulated gene expression 
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of Grb2-Iilce protein (which contains an SH3 domain), while 
8h of stimulation induced Nek, Ash and phospholipase C 
binding protein (NAP4 which contains an SH2 domain). It is 
thus possible that Grb2-lilce protein and NAP4 may play 
important roles in CpG-ODN mediated signaling pathways. 
Furthermore, recent studies have also revealed that binding 
of CpG-DNA to TLR9 results in activation of INK [34). Since 
JNK is activated by Nek adaptor protein and Nek interacting 
kinase [35, 36j/it is possible that CpG-ODN may activate JNK 
via up-regulation of NAP4. Although a recent publication 
described the gene expression profiles of a cultured mouse 
macrophage cell line after CpG-DNA stimulation [34], their 
microarray results were only conducted at one time-point 
(6h stimulation). Moreover, they did not report the meas- 
urement of protein expression profiles in response to CpG- 
ODN stimulation. 

Comparison of the gene and protein expression profiles 
showed that there was discordance between mRNA and pro- 
tein levels (Table 8). Similar discordance between the 
expression pattern of genes and proteins was also reported in 
other system using different stimuli [37-41). The dis- 
cordance between mRNA and protein levels could be due to 
screening capability such as detection sensitivity, choice of 
cut-off point, quantitarivity of microarray and 2-D gels, as 
well as time discrepancy between gene and protein expres- 
sion [39, 40, 42, 43]. Alternatively, it could also be explained 
by post-trancriptional events, such as alternative splicing or 
PTM [39, 40, 42, 43]. Another possible explanation is that 
most of the spots observed in the 2-D gels are isoforms of 
some proteins. The intensity of each spots does not neces- 
sary represent total amount of a certain protein and thus 
does not correlate with its mRNA level. Our finding that 
microarray results correlated better with Western blotting 
results (eg., ARF-3 in Fig. 2), an approach more suitable than 
2-D gels for determining the total amount rather than iso- 
forms of a given protein, seems to suggest that formation of 
isoforms should be carefully taken into consideration when 
one tries to correlate mRNA and protein expression data. 

Using a proteornics approach, we found that CpG-ODN 
treatment up-regulated the expression of many proteins 
including HSPs, metabolic enzymes, structural proteins, as 
well as macrophage capping protein, cydophilin and protea- 
some a arid 0 chain etc, HSPs are the most abundant and 
ubiquitous soluble intracellular proteins. They are up-regu- 
lated by various stressors including temperature, glucose 
deprivation, microbial infection and. cancer [44]. They func- 
tion as molecular chaperones to prevent protein aggregation 
and contribute to the folding of nascent and altered proteins. 
In addition, they are able to regulate immune responses, 
including production of inflammatory cytokines and che- 
mokines and activation or maturation of immune cells [45, 
46], Beside HSPs, cydophilin as well as proteasome a and [3 
chain have also been reported to be involved in the immune 
response |39, 47]; proteasome p chain is consistently up- 
regulated in human neutrophils following LPS exposure [39]. 
Our finding that the protein levels of HSPs, cydophilin, and 

© 2005 WILEY- VCH Verlag GmbH & Co. KGaA, Weinheim 



Proteornics 2005, 5, 894-906 

proteasome a and P chain were increased after CpG-ODN 
treatment suggests that these molecules might play a role in 
the immunostimulating effect of CpG-ODN. To what extent 
these proteins contribute to the immune responses of the 
cells to CpG-ODN is currently under study. Proteomic anal- 
ysis also showed that truncated forms of grp78, grp94 and 
hsc70 were induced, a phenomenon similar to calreticulm 
observed by Richards and his coworkers [48J. The expression 
of full length hsc70 and grp78 were decreased while the 
levels of their truncated derivatives was increased after CpG- 
ODN treatment. These -results suggest that the degradation 
of these proteins has been enhanced. We also found that 
proteosome a and p chains as well as ubiquitin specific pro- 
tease 12 were increased by CpG-ODN. Whether these 
enzymes or other enzymes were responsible for the genera- 
tion of truncated hsc70 and grp78 remains to be eluddatecl 
Cells rely on multiple signaling pathways to determine 
their fates of survival, proliferation or apoptosis [49]. In fact, 
apoptosis plays an important role in relating pathogen 
infection. To be able to grow and replicate in the target cells, 
pathogens may have to bloclc apoptosis. Results from several 
laboratories have made it dear that HSP70 and HSP27 pro- 
tect cells not only from heat, but also from most apoptotic 
stimuli [48, 50] by binding to Alct and subsequently mediat- 
ing anti-apoptotic activity through activation of Akt [51-53]. 
Since our data revealed that CpG-ODN induced the expres- 
sion of HSP90 and HSP27, it is possible that CpG-ODN 
might prevent apoptosis by up-regulation of HSPs. 

Interestingly, our microarray data also showed that CpG- 
ODN mediated the induction of a set of genes associated 
with tumor progression and cell proliferation. Among these, 
one gene, WISP-2, was up-regulated by CpG-ODN after both 
2 and 8h stimulation. WISP genes were first identified as 
downstream targets of the Wnt-10-catenin signaling path- 
way. They belong to the CCN family of growth factors that 
have been receiving increasing attention lately due to some 
of the family members having been reported to be involved 
in angiogenesis and tumorigenesis [54]. It would be inter- 
esting to evaluate whether CpG-ODN plays a role in angio- 
genesis and tumorigenesis by regulating WISP-2. In addi- 
tion, we found that some genes associated with neurode- 
generation or neuroprotection, such as FPRU, NMDAR 
{NM DA) receptor, PKC and dishevelled 3 were up-regulated 
To our knowledge, this is the first report to suggest an asso- 
riation between these genes and CpG-ODN stimulation. As 
mentioned above, FPRLl plays a crurial role in proin- 
flammatory aspects of systemic amyloid sis and neurodegen- 
erative disease such as Alzheimer's disease and prion disease 
[28]. NMDAR, PKC and disheveled are involved in modulat- 
ing amyloid precursor protein metabolism, which is central 
to the pathogenesis of Alzheimer's disease [55-57]. Most 
notably, recent studies have shown that the TLR 4 -dependent 
pathway is involved in neurodegenerarion of the central 
nervous system [58]. Whether CpC-ODN moieties of patho- 
gens play any role in neurodegenerative diseases such as 
Alzheimer's remains to be elucidated. 
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Figure 8. ARF-3 participates in the CpG- ODN-TLR9-NF-K B path- 
way. HEK293 cells were cotransfected with p5xNF-KB and human 
TLR9. After overnight transfection, the cells were incubated with 
or without 1.5 mm CpG-ODN for 8h in the presence or absence of 
increasing concentrations of the ARF-3 inhibitor brefeldin A. 
After incubation, cells were lysed and NF-xB luciferase activity 
was measured. Data represent mean ± SEM. [n = 3). 



Exposure of cells to LPS or microbial infection has been 
known to induce several genes encoding metabolic enzyme 
(34, 39]. Our microarray data also revealed that a large number 
of genes encoding proteins involved in energy synthesis and 
fatty acid oxidation, such as en oyl- coenzyme A hydratase, 
pTopionyl coenzyme A carboxylase and cytochrome p450 . 
were activated by CpG-ODN treatment. In addition, we found 
that other proteins such as ARF-3 were up-regulated (Table 3, 
Fig. 8). ARFs are 20kDa GTPases of theras superfamily that 
are critical to vesicular trafficking, including exocytic protein 
transport and endocytosis [59, 60]. This study demonstrates 
for thefirst time that ARF-3 is involved in theactivabon of N F- 
kB induced by CpG-ODN (as shown in Fig. 8) 

CpG-DNA/ODN has been shown to elicit primarily 
responses via the TLR9/MyD88 dependent pathway (18, 21, 
22]. Chromosome location analysis showed that instead of 
localizing on one or two chromosome, the genes/proteins 
modulated by CpG-ODN stimulation aie scattered on all 
chromosomes except chromosomes 23 and 24. These results 
seem to suggest that CpG-ODN either affects multiple chro- 
mosomes simultaneously or subsequently via cascades of 
cellular messengers. More studies are needed to elucidate its 
mechanism of actions. 



5 Concluding remarks 

In summary, by using microarray and proteomic approaches 
to evaluate the effect of CpG-ODN at different time points, 
we have found that genes/pioteins regulated by CpG-ODN 
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are related to inllarrrmatory responses, antimicrobial defense, 
transcriptional regulation, intracellular signal transduction, 
tumor progression, cell differentiation, proteolysis, anti-apop- 
tosis as well as neurodegeneration and neuroprotection- Our 
results may help delineate the CpG-ODN mediated pathway 
and contribute to further understanding of mechanisms that 
link innate immunity with acquired immune responsejs). 
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Quantification of CK20 gene and protein expression in colorectal cancer 
by RT-PCR and immunobistochemistry reveals inter- and iatratumour 
heterogeneity. 

Lassmann S , BauerM, Soong R, Schregunann J . Tahiti K Nahm, T Rucer R 
Hotter H. Werner M . ' 

Pathologisches Institut, Universitatsklinikum Freiburg, Albettstrasse 19, 79104 Freiburg 
Germany, lassmann@ukl.uni-freiburg.de ' 

Cytokeratin 20 (CK20) is an epithelial protein expressed almost exclusively in the 
gastrointestinal (GI) tract and is widely used as immunohistochemical marker for routine 
diagnosis. In conlrast, CK20 gene expression is not an established marker for the 
classification of rumours and the detection of disseminated cancer cells in colorectal 
cancer. Recently, real-time reverse transcriptase polymerase chain reaction (RT-PCR) has 
provided the means for reproducible and quantitative investigation of molecular markers 
This report directly compares CK20 mRNA and protein expression in serial sections of 
archival, formalin-fixed, paraffin-embedded (FFPE) colorectal adenocarcinomas. CK20 
expression was detected by immunohistochemistry (IHC) in 60/63 (95.2%) cases by 
conventional RT-PCR in 58/60 (96.7%) and by quantitative RT-PCR using the 
LightCycler (LightCycler is a trademark of a Member of the Roche Group) System in 
29/32 (90.6%) microdissected cases, one case yielding variable results. Despite the high 
detection rate of all three techniques, marked heterogeneity of CK20 expression was seen 
between different cases and also within individual cases. CK20 expression profiles were 
not related to particular histopathological features of the tumours. A good correlation (r = 
0.8964) was found between CK20 mRNA and protein expression by comparing 
quantitative RT-PCR with IHC in 32 cases. This was also true for selected heterogeneous 
tumour cells within individual cases. Both RT-PCR and IHC are therefore valuable tools 
for CK20 detection in colorectal adenocarcinoma, with real-time RT-PCR providing 
supplementary quantitative information. This suggests a promising supportive role for 
quantitative RT-PCR in molecular pathology. Copyright 2002 John Wiley & Sons, Ltd. 

Publication Types: 

• Evaluation Studies 

PMID: 12237879 [PubMed - indexed for MEDLINE] 
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Expression of the multidrug resistance-associated protein (MRP) mRNA 
and protein in normal peripheral blood and bone marrow haemopoietic 
cells. 

Lcgrand O, Pcrrot JY, Tang R . Simonin G . GurbuxaniS. Zittoun R . Marie JP . 

Laboratoire de Cinetique et de Cultures Cellulaires, Hotel Dieu, Paris, France. 

We studied the expression of multidrug resistance-associated protein (MRP) in normal 
haemopoietic celts from peripheral blood and bone marrow. The MRP mRNA levels 
were estimated by RT/PCR and in situ hybridization (ISH) assay, and the protein levels 
by flow cytometry- 21 samples of peripheral blood and 21 samples of bone marrow (1 1 
normal bone marrow donors, 1 0 patients in complete remission after chemotherapy for 
large cell lymphoma or acute myeloid leukaemia) were analysed. In peripheral blood the 
mean MRP mRNA level in CD34- cells was statistically higher than in the other cells (3- 
fold by the methods used). The levels of MRP in CD3+ varied from one individual to 
another (4.5-34.8 units by RT/PCR and 5-23 grains/cell by ISH); however, this was 
proportional to the variation in all the cell lineages of same individual (r = 0.84). In bone 
marrow the mean MRP levels of the various cell lineages (including CD34+) were 
similar to the basal level in HL60 cells. Individual expression levels were again variable; 
however, diere was no difference between untreated normal bone marrow and post 
chemotherapy normal bone marrow. MRP protein expression was determined by flow 
cytometry with the monoclonal antibody MRPm6. The CD4+ lymphocytes exhibited a 
higher MRP protein expression than die other cell lineages, including CD8+ cells. There 
was a good correlation between the three mediods used (RT/PCR and ISH, P = 0.000 1, r 
= 0.87; RT/PCR and flow cytometry, P = 0.0001, r = 0.85; ISH and flow cytometry P = 
0.002, r = 0.67). 

PMDD: 8757504 [PubMed - indexed for MEDLINE] 
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Vascular endothelial growth factor enhances cardiac allograft 
arteriosclerosis. 

Lemstrom KB, Krebs R, Nykaaen AI . TUckaaen JM . Sihvola RK Aaltoia EM . 
UMIXM, WoodJ, Aljtalo.K, Yla-Herttuala S, Koskincn PIC 

Cardiopulmonary Research Group, Transplantation Laboratory, University of Helsinki 
and Helsinki University Central Hospital, Helsinki, Finland. KarLLemstrom@helsinki.fi 

BACKGROUND: Cardiac allograft arteriosclerosis is a complex process of alloimmune 
response, chronic inflammation, and smooth muscle cell proliferation that includes cross 
talk between cytokines and growth factors. METHODS AND RESULTS: Our results in 
rat cardiac allografts established alloimmune response as an alternative stimulus capable 
of inducing vascular endothelial growth factor (VEGF) mRNA and protein expression in 
cardiomyocytes arid graft-infiltrating mononuclear inflammatory cells, which suggests 
that these cells may function as a source of VEGF to the cells of coronary arteries. Linear 
regression analysis of these allografts with different stages of arteriosclerotic lesions 
revealed a strong correlation between intragraft VEGF protein expression and the 
development of intimal thickening, whereas blockade of signaling downstream of VEGF 
receptor significantly reduced arteriosclerotic lesions. In addition, in cholesterol-fed 
rabbits, intracoronary perfusion of cardiac allografts with a clinical-grade adenoviral 
vector that encoded mouse VEGF(164) enhanced the formation of arteriosclerotic 
lesions, possibly secondary to increased intragraft influx of macrophages and 
neovascularization in the intimal lesions. CONCLUSIONS: Our findings suggest a 
positive regulatory role between VEGF and .coronary arteriosclerotic lesion formation in 
the allograft cytokine microenvironment 
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CHAPTER 29 



Regulation of transcription 



The phenotypic diJTerences that distinguish the 
various kinds of celJs in a higher eukaryote are 
largely due lo differences in the expression of 
genes that code for proteins, that is, those tran- 
scribed by RNA polymerase JJ. hi principle, the 
expression of these genes might be regulated at 
any one of several stages. The concept of the 
"level of control" implies that gene expression 
is not necessarily an automatic process once it 
has begun. Jt coidd be regulated in a gene- 
specific way at any one of several sequential 
steps. We can distinguish (at least) five poten- 
tial control points, forming the series: 

Activation of gene structure 
i 

Initiation of transcription 
i 

Processing the transcript 
i 

Transport lo cytoplasm 
I 

Translation of in RNA 

The existence of the first step is implied by 
the discovery thai genes may exist in either of 
too structural conditions. Relative to the stale 
of most of the genome, genes are found in 
an "active" stale in the cells in which they 
are expressed (see Chapter 27); The change of 
stmcmre is distinct from the act of transcrip- 
tion, and indicates that the gene is "iranscrib- 
able." This suggests that acquisition of the 
"active'* structure must be the first step in senc 
expression. 

Transcription of a gimc in I he active Mare is 



controlled at the stage of initiation, tfiat is, by 
the interaction of RNA polymerase with its pro- 
moter. This is now becoming susceptible to 
analysis in the in vitro systems (see Chapter 
28). For most genes, this is a major control 
point; probably it is the most common level of 
regulation. 

There is at present no evidence for control 
at subsequent stages of transcription in eukary- 
otic cells, for example, via an tiler ruination 
mechanisms. 

The primary transcript is modified by capping 
at the 5' end, and usually also by polyadenyla- 
tion at the 3' end; Introns must be spliced out 
from the transcripts of interrupted genes. The 
manire RNA must be exported from die nucleus 
to the cytoplasm. Regulation of gene expression 
by selection of sequences at die level of nuclear 
RNA might involve any or all of these stages, 
but the one for which we have most evidence 
concerns changes in splicing; some genes are 
expressed by means of alternative splicing pat- 
terns whose regulation controls the type of pro- 
tein product (see Chapter 30). 

Finally, the translation of an m.RNA in the cyto- 
plasm can be specifically controlled. There is little 
evidence for the employment of this mechanism in 
adult somatic cells, but it does occur in some 
embryonic situations, as described in Chapter 7. 
The mechanism is presumed to involve the block- 
ing of initiation of translation of some mKNAs bv 
specific protein factors. 

But having acknowledged that control of gene 
expression can occur at multiple stages, and 
that production of RNA cannot inevitably be 
equaled with production of protein, if is dear 



|PAGF. 3/5 ' RCVD AT 5/?5f?006 17:36:38 PM [Pacific Dayliyhl Time) ' SVR:SVCS01/0 ' DNIS:66Q& * CStD:613 95? 9303 ■ DURATION (mm ss):0? ?8 



MAY 25 2006 15:34 FR CI: 



IC1ST 



613 952 9303 TO J03246606 



P. 04 



848 I Chapter 29 



that the overwhelming majority of regulatory 
events occur at the initiation of transcription. 
Regulation of tissue-specific gene transcription 
lies at the heart of eukaryotic differentiation; 
indeed, we see examples in Chapter 38 in 
which proteins that regulate embryonic devel- 
opment prove to be transcription factors. A reg- 
ulatory transcription factor serves to provide 



common control of a large number of target 
genes, and we seek to answer two questions 
about this mode of regulation; what identifies 
the common target genes to the transcription 
factor, and how is the activity of the transcrip- 
tion factor itself regulated in response to intrin- 
sic or extrinsic signals? 



Response elements identify genes under common 



regulation 



The principle that emerges from characterizing 
groups of genes under common control is that 
they share a promoter element that is recognized 
by a regulatory transcription factor. An element 
that causes a gene to respond to such a factor 
is called a response element; examples are the 
. USE (lieat shock response element), GRE 
(glucocorticoid response element), SRE (serum 
response element). 

The properties of some inducible transcription 
factors and the elements that they recognize are 
summarized in Table 29.1. Response elements 
have the same general characteristics as 
upstream elements of promoters or enhancers. 
They contain short consensus sequences, and 
copies of the response elements found in dif- 
ferent genes are closely related, but not neces- 
sarily identical. The region bound by the faclor 
extends for a short distance on either side of 



. Table 29.1 Inducible transcription factors, bind to 
response elements that identify groups of promoters 
or enhancers subject to coordinate control.. ; .. 



Regulatory Agent Module Consensus 



Factor 



Heat shock USE CNNGAANNTCCMNG HSTF 

Glucocorticoid GRE TGGTACAAA7 GTTCT Receptor 

Phofbol ester "TRE TGACTCA API 

Serum SRE CCATATTAGG SRF 



the consensus sequence. In promoters, the ele- 
ments are not present at fixed distances from 
the startpoint, but are usually <200 bp upstream 
of it The presence of a single element usually 
is sufficient to confer the regulatory response, 
but sometimes there are multiple copies. 

Response elements may be located in pro- 
moters or in enhancers. Some types of elements 
are typically found in one rather than the other: 
usually an USE is found in a promoter, while a 
GRE is found in an enhancer. We assume that 
all response elements function by the same 
general principle. A gene is regulated by a 
sequence at the promoter or enhancer that is 
recognized by a specific protein. 7?ie protein 
functions as a transcription factor needed for 
UNA polymerase to initiate. Active protein is 
available only under conditions when the gene is 
to be expressed; its absence means that the pro- 
moter is not activated by this particular circuit 

An example of a situation in -which man)' 
genes are controlled by a single faclor is pro- 
vided by the heat shock response. This is com- 
mon to a wide range of . prokaryotes and 
eukaryotes and involves multiple controls of 
gene expression: an increase in temperature 
turns off transcription of some genes, turns on 
transcription of the heat shock genes, and 
causes changes in the translation of mRNAs. 
The control of the heat shock genes illustrates 
the differences between • prokaryolic and 
eukaryotic modes of control. In bocteria. a new 
sigma faclor is synthesized thai directs hW 
polymerase hoi o enzyme to recognize an nltcr- 
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Enhanced expressions of arachidouic acid-sensitive tandem-pore domain 
potassium channels in rat experimental acute cerebral ischemia. 

LLZB, ^laagJUK, LiJLL, WangXL. 

Institute of Materia Medica, Chinese Academy of Medical Sciences and Peking Union 
Medical College, Beijing 100050, China. 

To further explore the pathophysiological significance of arachidonic acid-sensitive 
potass.um channels, RT-PCR and Western blot analysis were used to investigate the 
expression changes of TREK channels in cortex and hippocampus in rat experimental 
acute cerebral ischemia in this study. Results showed that TREK-1 and TRAAK rnRNA 

a T° rte H J^l 1 T d TREK - 2 mRNA in W PP ocam P^ «*owed significant increases 2 h 
after middle cerebral artery occlusion (MCAO). While the mRNA expression levels of 
the all three channel subtypes increased significantly 24 h after MCAO in cortex and 
hippocampus At the same time, the protein expressions of all the three channel proteins 
^ed a&uficaot increase 24 h after MCAO in cortex and hippocampus, but only 
1 REK-1 showed increased expression 2 h after MCAO in cortex and hippocampus 
taummohis tochemical experiments verified that all the three channel proteins had higher 
expression levels in cortical and hippocampal neurons 24 h after MCAO These results 
?S f 8 Sti , me . COrreIation between T REK channels and acute cerebral ischemia 
i KfcR channels might provide a neuroprotective mechanism in the pathological process. 

PMBD: 15652517 [PubMed - indexed for MEDLINE] 
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Retinal preconditioning and the induction of heat-shock protein 27. 
LiY, Roth S . Laser M . Ma JX . Crosson CE . 

c^S^mt 00 ^ MediCal University ° f South CaroIina ' CharIeston ' South 

D^Snv ^ ri f Peri ° dS - °f iS< ? em j a have been show " to protect the retina from 
potenua y damaging periods of ischemia. This phenomenon has been termed ischemic 
precond, honmg or ischemic tolerance. In the present study the cellular cSng^Sb 
of heat shock protein (Hsp)27, -70, and -90 mRNA and expression of Hsp in the^atS ina 
associated with ischemic preconditioning were evaluated. METHODS' UMte£^ 
ischem,a was created in Long-Evans and Sprague-Dawley rats for 5 minutes ^ S w re 
then left for 1 hour to 7 days, to allow the retina to reperfuse. Retinas were tad he 
mRNA and protein isolated, and Northern and Western blot analyses 
changes in expression of Hs P 27, -70, and -90. Immunohistochemical studies wte used to 
.denufy retinal re gl ons where Hsp changes occurred. Selected animals were suited to 
a second ischenuc event, 60 nunutes in duration, to correlate the changes in expression of 
Hsp with functional protection of the retina from ischemic injury. RESULTS*? 
and sham-treated annals retinal Hsp27, -70, and -90 mRNAswere detectable Five 
hours after retmal preconditioning, levels of Hsp27 mRNA were elevated above control 
levels, and 24 hours later, mRNA levels increased 200% over basal levels Hs P 27 

level? H S n77 maii ; ed f ° r UP t0 72 h ° UrS and then retura * control 

S Hsp27 P.™ tein ,evels were ltlcre ^d by 200% over basal levels 24 hours after 

Ss lnoon^r ne ' ? ^ 72 h ° Un - ^ retumed «° control 

ZT a a I n ° consistent cha »g e «> HSP70 or -90 mRNA or protein levels was 
observed during the course of the study. Immunohistochemical studies demons^ that 
he mcreaS e m expression of Hsp27 was localized to neuronal and non-neuronal ceUs n 
theinner layers of the retina. Electroretinography studies demonstrated a strong 
correlation between the protection of retinal function from ischemic injury and the 

"** 2? : INCLUSIONS: These results provide evidence thaUhe induction 
of Hsp27 is a gene-specfic event associated with ischemic preconditioning in the retina 
Tin mcrease in expression of Hsp27 occurs in both neuronal and non-neuLal retin 7 
ceHs, and appears to be one component of the neuroprotective events induced by 
ischemic preconditioning iri the retina. '- / y 

PMID: 12601062 [PubMed - indexed for MEDLINE] 
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Increasing expression of tissue plasminogen activator and plasminogen 
activator inhibitor type 2 in dog gingival tissues with progressive 
inflammation. 

Lindberg P, Kinnby B, Lecander I Lang NP . Matsson L . 

Center for Oral Health Sciences, Malmo University, S-214 21 Malmo, Sweden 
pia.luidberg@od.mah.se " 

Urokinase and tissue-typeplasminogen activators (u-P A and t-PA) are serine proteases 
tha convert plasm.nogen mto plasmin, which degrades matrix proteins and activates 
meta loproteinases. The PAs are balanced by specific inhibitors (PAI-1 and PAl-2) 
Local production of t-PA and PAI-2 was recently demonstrated in human gingival 
tissues. The aim now was to investigate the production and localization of t-PA and PAI- 
-2 in gmgwal tissues from dogs in three well-defined periodontal conditions; clinically 
healthy gingiva, chronic gingivitis and an initial stage of ligature-induced loss of 
attachment. At the start of the experiment the gingiva showed clear signs of 
inflammation. Clinically healthy gingi va were obtained after 21 days period of intense 
oral hygiene^ Attachment loss was induced by placing rubber ligatures around the neck of 
some teeth. Biopsies were taken from areas representing the different conditions and 
prepared for in situ hybridization and immunohistochemistry. In clinically healthy 
gingiva bom t-PA mRNA and antigen were expressed in a thin outer layer of the sulcular 
and Junctional epitheha. No t~PA signals or staining were seen in connective tissue. Both 
mlWA signaling and immunostaining for t-PA were stronger in chronic gingivitis. In 
areas wUh loss of attaclrrnent, t-PA mRNA as well as antigen were found in the sulcular 
and junctoonal epitheha to a similar degree as in gingivitis. Occasionally the connective 
tissue was involved especially in connection with vessels. PAI-2 mRNA was seen in a 
thin outer layer of the sulcular and junctional epithelia in clinically healthy gingiva but 
no signals were seen in connective tissue. PAI-2 antigen was found primarily in the 
outer layer of the sulcular and junctional epithelia.Some cells in the connective tissue 
were stained. In gingivitis, PAI-2 signals were mainly found in the same locations but 
f e ™™?* xt ^ awards the connective tissue. Immunostaining was seen in 
the outer half of the sulcular and junctional epithelia as well as in the upper part of the 
connective t.ssue, close to the sulcular epithelium. In sites with loss of attachment, PAI- 
2 mRNA. was found throughout the sulcular and junctional epithelia, as was the antigen 

; y „ N ° PAf ~ 2 mRNA W3S Seen in COIUiective tis ™> "he was 
found scattered, especally near vessels. This study shows that the expression of both t~ 

PA and PAI-2 increases with experimental gingival inflammation in the dog and 

furthermore, the two techniques demonstrate a strong correlation between the 

topographical distribution of the site of protein synthesis and the tissue location of the 

^hgens for both t-PA and PAI-2. The distribution correlates well with previous 

findings in humans. . 
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Effect of duration of fixation on quantitative reverse transcription 
polymerase cliaiu reaction analyses. 

Macabco-OnrlVT, Guuingcr PG, PekkerN. McMillan A . Rcgezi JA Wong DT 
Jordan RC . ' ' 

Oral Pathology, Department of Stomatology, University of California San Francisco 
California 94143-0424, USA. 

Increasingly, there is the need to analyze gene expression in tumor tissues and correlate 
these findings with clinical outcome. Because there are few tissue banks containing 
enough frozen material suitable for large-scale genetic analyses, methods to isolate and 
quantify messenger RNA (mRNA) from formalin-fixed, paraffin-embedded tissue 
sections are needed. Recovery of RNA from routinely processed biopsies and 
quantification by the polymerase chain reaction (PCR) has been reported; however the 
effects of formalin fixation have not been well studied. We used a proteinase K-salt 
preemption RNA isolation protocol followed by TaqMan quantitative PCR to compare 
the effect of formalin fixation for 24, 48, and 72 hours and for 1 week in normal (2) oral 
epithelial dysplasia (3), and oral squamous cell carcinoma (4) specimens yielding 9 fresh 
and 36 formalin-fixed samples. We also compared mRNA arid protein expression levels 
using lmmunohistochemistry for epidermal growth factor receptor (EGFR) matrix 
metalloproteinase (MMP)-l, P 21, and vascular endothelial growth factor (VEGF) in 15 
randomly selected and routinely processed oral carcinomas. We were able to extract 
RNA suitable for quantitative reverse transcription (RT) from all fresh (9/9) and 
formalin-fixed (36/36) specimens fixed for differing lengths of time and from all (15/15) 
randomly selected oral squamous cell carcinoma. We found that prolonged formalin 
fixation (>48 h) had a detrimental effect on quantitative RT polymerase chain reaction 
results that was most marked for MMP-1 and VEGF but less evident for p2 1 and EGFR 
Comparisons of quantitative RT polymerase chain reaction and immunohistochemistry ' 
showed that for all markers, except p21, there was good correlation between mRNA and 
protein levels. p21 mRNA was overexpressed in only one case, but protein levels were 
elevated in all but one tumor, consistent with die established translational regulation of 
p21 These results show that RNA can be reliably isolated from formalin-fixed, paraffin- 
embedded tissue sections and can produce reliable quantitative RT-PCR data However 
results for some markers are adversely affected by prolonged formalin fixation times. 

PMID: 12218216 [PubMed - indexed for MEDLINE] 
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Id-1 and Id-2 arc overcxprcsscd iu pancreatic cancer aud iu dysplastic 
lesions in chronic pancreatitis. 

Maruvatna H , KlceffJ, Wildi S , Fricss H . Buchler IVfW . Israel MA . KorcM . 
Division of Endocrinology, Department of Medicine, University of California, Irvine, 

Id proteins antagonize basic helix-loop-helix proteins, inhibit differentiation, and enhance 
cell proliferation. In this study we compared the expression of Id-1, Id-2, and Id-3 in the 
normal pancreas, in pancreatic cancer, and in chronic pancreatitis (CP). Northern blot 
analyse demonstrated that all three Id mRNA species were expressed at high levels in 
pancreatic cancer samples by comparison with normal or CP samples. Pancreatic cancer 
cell lines frequently coexpressed all three Ids, exhibiting a good correlation between Id 
mRNA and protein levels, as determined by immunoblotting with highly specific anti-Id 
anybodies. Immunohistochemistry using these antibodies demonstrated the presence of 
taint Id-I and Id-2 immunostaining in pancreatic ductal cells in the normal pancreas 
whereas Id-3 immunoreactivity ranged from weak to strong. In the cancer tissues, many 
of the cancer cells exhibited abundant Id-1 , Id-2, and Id-3 immunoreactivity. Scoring on 
he basis of percentage of positive cells and intensity of immunostaining indicated that 
Id-1 and Id-2 were increased significantly in the cancer cells by comparison with the 
respective controls. Mild to moderate Id immunoreactivity was also seen in the ductal 
cells in the CP-like areas adjacent to these cells and in the ductal cells of small and 
interlobular ducts in CP. In contrast, in dysplastic and atypical papillary ducts in CP Id-1 
and Id-2 immunoreachvity was as significantly elevated as in the cancer cells These 
findings suggest that increased Id expression may be associated with enhanced 
proliferative potential of pancreatic cancer cells and of proliferating or dysplastic ductal 
cells in. CP. 



PMID: 10487839 [PubMed - indexed for MEDLINE] 
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ld-1 and ld-2 Are Overexpressed in Pancreatic 
Cancer and in Dysplastic Lesions in 
Chronic Pancreatitis 



Haruhisa Maruyama,* Jorg Kleeff,* Stefan Wildi,* . 
Helmut Friess* Markus W. Buchler,* 
Mark A. Israel * and Murray Korc* 

From the Division of Endocrinology, Diabetes, and Metabolism*. 
Departments of Medicine, Biological Chemistry and 
Pharmacology, University of California, Irvine, California; the 
Department of Viscera! and Transplantation Sttrgety, 1 University 
of Bern, Bern, Switzerland; and the Preuss laboratory* 
Department of Neurological Surgery, University of California, 
San Francisco, California 



Id proteins antagonize basic helix-Joop-belix pro- 
teins, inhibit differentiation, and enhance cell prolif- 
eration. In this study we compared the expression of 
Id-3, Jd-2, and ld-3 in the normal pancreas, in pan- 
creatic cancer, and in chronic pancreatitis (CP). 
Northern blot analysis demonstrated that all three Id 
mRNA species Tvere expressed at high Jevels in pan- 
creatic cancer samples by comparison with normal or 
CP samples. Pancreatic cancer cell lines frequently 
coexpressed all three Ids, exhibiting a good correla- 
tion between Id mRNA and protein levels, as deter- 
mined by immunoblotting with highly specific anti-Jd 
antibodies, lmmunohistochemistry using these anti- 
bodies demonstrated the presence of faint ld-3 and 
ld-2 inxmunostaining in pancreatic ductal cells in the 
normal pancreas, whereas ld-3 immunoreactivity 
ranged from weak to strong. In the cancer tissues, 
many of the cancer cells exhibited abundant ld-1, 
ld-2, and ld-3 immunoreactivity. Scoring on the basis 
of percentage of positive ceils and intensity of immu- 
nostaining indicated that Id-land ld-2 were increased 
significantly in the cancer cells by comparison with 
the respective controls. Mild to moderate Id immuno- 
reactivity was also seen in the ductal cells in the 
CP-like areas adjacent to these cells and in the ductal 
cells of small and interlobular ducts in CP: In con- 
trast, in dysplastic and atypical papillary ducts in CP, 
ld-1 and ld-2 immunoreactivity was as significantly 
elevated as in the cancer cells. These findings suggest 
that increased Id expression may be associated with 
enhanced proliferative potential of pancreatic cancer 
cells and of proliferating or dysplastic ductal cells in 
CP. (Am J Pathol 199% J 55:8 1 5-822) 



Basic helix-loop-helix (bHLH) proteins play an important 
role as transcription factors in cellular development, pro- 
liferation, and differentiation; 1,2 The basic domain of the 
bHLHs is required for binding to an E-box DNA se- 
quence, thus promoting transcription of specific target 
genes. The HLH domain promotes dimer formation with 
various members of the bHLH protein family. 1,2 Ho 
modimers of the class B family of bHLH proteins, includ- 
ing MyoD, NeuroD, and numerous other proteins, are 
known to activate tissue-specific genes. 3 ; 5 These tissue- 
specific bHLHs typically form heterodimers with widely 
expressed class A bHLHs, which include proteins en- 
coded by E2A,.E2-2, HEB, and other genes (also termed 
E-proteins). 6 " 9 These heterodimers activate transcription 
of genes that are associated with differentiation. 

Id genes encode a family of four HLH proteins that lack 
the basic DNA binding domain. 1 10 They act as dominant- 
negative HLH proteins by forming high affinity het- 
erodimers with other bHLH proteins, thereby preventing 
them from binding to DNA and inhibiting transcription of 
differentiation-associated genes. 1012 Id gene expres- 
sion is down-regulated on differentiation in many cell 
types in vitro and in vivo. 13-18 In addition, Id proteins seem 
to be required for cell cycle progression through G/3 
phase in certain cell types, and interaction between ld-2 
and pRB is associated with enhanced proliferation in 
some cell lines in vitro. 19-23 

Pancreatic cancer is the fifth leading cause of cancer 
death in. the United States, with a mortality rate that vir- 
tually equals its incidence rate. 24 This malignancy is often 
associated with the over expression of a variety of mito- 
genic growth factors and their receptors, and by onco- 
genic mutations of K-ras and inactivation of the p53 tumor 
suppressor gene. 25 We have recently reported that pan- 
creatic cancers overexpress the HLH protein ld-2, and 
that enhanced expression of this protein is evident in the 
cytoplasm of the cancer cells within the pancreatic tumor 
mass. 26 It is not known, however, whether the expression 
of other Id proteins is altered in this malignancy, or 
whether their expression is altered in chronic pancreatitis 
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(CP), an inflammatory, disease that is characterized by 
-dysplastic ducts, foci of proliferating ductal cells, acinar 
cell degeneration, and fibrosis. 27 We now report that 
there is a five- to sixfold increase in ld-1 and ld-2 mRNA 
levels and a twofold increase in ld-3 mRNA levels in 
pancreatic cancer by comparison with the normal pan- 
creas. In contrast, overall Id mRNA levels are not in- 
creased in CP. 



Patients and Methods 

Normal human pancreatic tissue samples from 7 male 
and 5 female donors (median age 41.8 years, range 
14-68 years), CP tissues from 13 males and 1. female 
(median age 42.1 years; range 30-56 years), and pan- 
creatic cancer tissues from 10 male and 6 female donors 
(median age 62.6 years; range 53-83 years) were ob- 
tained through an organ donor program and from surgi- 
cal specimens from patients with severe symptomatic 
chronic pancreatitis or pancreatic cancer. A partial 
duodenopancreatectomy (Whipple/pylorus-preserving 
Whipple; n = 13), a left resection of the pancreas (n = 2), 
or a total pancreatectomy (n — 1) were carried out in the 
pancreatic cancer patients. According to the TNM clas- 
sification of the Union Internationale Contre le Cancer 
• (UICC) 6 tumors were stage 1, 1 was stage 2, and 9 were 
stage 3 ductal cell adenocarcinoma. Freshly removed 
tissue samples were fixed in 10% formaldehyde solution 
for 12 to 24 hours and paraffin-embedded for histological 
analysis. In addition, tissue samples were frozen in liquid 
nitrogen immediately on surgical removal and maintained in 
-80°C until use for RNA extraction. All studies were ap- 
proved by the Ethics (Committee of the University of Bern, 
Bern, Switzerland, and by the Human Subjects Committee 
at the University of California, Irvine, California. 

Northern Blot Analysis 

Northern blot analysis was carried out as described pre- 
. viously. 26 28 Briefly, total RNA was extracted by the single 
step acid guanidinium thiocyanate phenol chloroform 
method. RNA was size-fractionated on 1.2% agarose/1.8 
mol/L formaldehyde gels, electrotransferred onto nylon 
membranes, and cross-linked by UV irradiation. Blots 
were prehybridized and. hybridized with cDNA probes 
and washed under high stringency conditions. The fol- 
lowing cDNA probes were used: a 979-bp human ld-1 
cDNA probe, a 440-bp human ld-2 cDNA probe, and a 
450-bp human ld-3 cDNA probe, covering the entire 
coding regions of ld-1, ld-2, and ld-3, respectively. A 
BamYW 190-bp fragment of mouse 7S cDNA that. hybrid- 
izes with human cytoplasmic RNA was used to confirm 
equal RNA loading and transfer. Blots were then exposed 
at -80°C to Kodak BioMax-MS films and the resulting 
autoradiography were scanned to quantify the intensity of 
the radiographic bands. 26 28 For each sample the ratio of 
Id mRNA expression to 7S expression was calculated. To 
compare the relative increase in expression of the re- 
spective Id mRNA species in the cancer and CP sam- 
ples, the same normal samples were used for normal/ 
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Figure 1. mRNA expression of Jd-1, Jd-2, and ld-3 in pancreatic cancer and 
chronic pancreatitis. Total RNA (20 up/lane) from six normal, eight cancer- 
ous, and seven chronic pancreatitis tissue samples were^ subjected to North- 
ern blot analysis using 32 P-labeled cDNA probes (500.000 cpm/mJ) specific 
for ld-1, ld-2, and Jd-3 r respectively. A 7S cDNA probe (50,000 cpm/ml) was 
used as a loading and transfer control. Exposure limes of the normal/cancer 
blots were 1 day for all Id probes, and 2 days for the normal/ CP blots. 
Exposure time was 4 hours for mouse 7S cDNA. By comparison with the 
normal samples, id -1 and ld-3 mRNA levels were elevated in 8 and 9 cancer 
samples, respectively,, whereas Jd-2 was elevated in 6 cancer samples. 

cancer and normal/CP membranes. The median score for 
ld-1, ld-2, and ld-3 mRNA levels in these normal samples 
was set to 100. Statistical analysis was performed with 
SigmaStat software (Jandel Scientific, San Raphael, CA). 
The rank sum test was used, and P < 0.05 was taken as 
the level of significance. 

Cell Culture and Western Blot Analysis 

PANC-1, MIA-PaCa-2, ASPC-1, and CAPAN-1 human 
pancreatic cell lines were obtained from ATCC (Manas- 
sas, VA). COLO-357 human pancreatic cells were a gift 
from Dr. R. S. Metzger (Durham, NC). Cells were routinely 
grown in DMEM (COLO-357, MIA-PaCa-2. PANC-1) or 
RPMI (ASPC-1. CAPAN-1) supplemenled with 10% fetal 
bovine serum, 100 U/ml penicillin, and 100 ng/rnl strep- 
tomycin. For immunoblot analysis, exponentially growing 



Id Proteins in Pancreatic Diseases 817 
AJP September 1999, Vol. 755, AV>. 3 



•r-H 

> 
O 



1600 
1400 
1200 
1000 
800 
600 
400 





D 


ld-1 


D 

□ 








,D rP 

I 


o. 


OQ 








1 2 kb-> 



14kDa-> 




1.6kb-l 
1.2kb-» 



18 kDa ^ 




1*1 ll 



mRNA 




Normal CP Cancer 

Figure 2. Derwitometric analysis of Northern Wots. Autoradiographs of 
Northern blots from 12 normal. 1 4 CP, and )6 pancreatic cancers were 
analyzed by densitometry. mRNA levels were determined by calculating ihe 
ratio of ihe optical density for the respective Jd mRNA species in rebtion to 
. the optical density of mouse 75 cDNA. To compare the relative increase in 
expression of the respective Id mRNA species in die cancer and CP samples, 
the same normal samples were used for normal/cancer and normal/ CP 
membranes. Normal pancreatic tissues are indicated by circles, CP tissues by 
triangles, and cancer tissues by squares. l>aia are expressed as median 
scores ± SI). By comparison with the normal samples, only the cancer 
samples exhibited significant increases: 6.>fokl (P< 0.01) for ld-1, fivefold 
<P < 0.01.) for Id-2, and twofold (P = 0.027) for Id- 3. * 



cells (60-70% confluent) were solubilized in lysis buffer 
containing 50 mmol/L Tris-HCl, pH 7.4, 150 mrnol/L NaCI, 
1 mmol/L EDTA, 1 ^g/ml pepstatin A, 1 mmol/L phenyl- 
methylsultonyl fluoride (PMSF), and 1% Triton X-100. Pro- 
teins were subjected to sodium dodecyl sulfate polyacryl- 
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Figure 3. Id mRNA and protein expression in pancreatic cancer cell lines. 
Upper panels:. Total RNA (20 ^g/lane) from 5 pancreatic cancer cell lines 
vere subjected to Northern blot analysis using 32 P-labeled cDNA probes 
(500,000 cpm/ml) specific for ld-1, )d-2, and ld-3 t respectively. Exposure 
times were 1 day for all Id probes. Lower panels: Jmrnunoblouing. Cell 
lysates (30 fig/lane) were subjected to SDS-PAGE. Membranes were probed 
with specific Jd-1, Id- 2, and ld-3 antibodies. Visualization was performed by 
enhanced chemiluminescence. 



amide gel electrophoresis (SDS-PAGE), transferred to 
Immobilon P membranes, and incubated for 90 minutes 
with the indicated antibodies and for 60 minutes with 
secondary antibodies against rabbit IgG. Visualization 
was performed by enhanced chemiluminescence. 



immunohistochemisiry 

Specific rabbit anti-human (C-20), ld-2 <C-20), and 
ld-3 (C-20; all from Santa Cruz Biotechnology, Santa 
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Figure 4. Normal and cancerous pancreatic tissues were subjected to immu- 
nostaining using highly .specific ami-Id-] (A-C), anti-Id- 2 (D-F). and ami- ld-3 
(G-0 antibodies as described in ihe Methods section. Moderate to strong Id- 1 
immunoreactiviry was present in the cytoplasm of duct-like cancer cells (A 
and C, left panel). In the normal pancreas tliere was weak ld-1 immunore- 
aciivity in the ductal cells (B). Preabsorption with the ld-1 -specific blocking 
peptide abolished the Id- J immunoreactiviry (C, right panel). Strong Id-2 
immunoreactiviry was observed in the cytoplasm of the cancer cells that 
exhibited duct-like structures (D and F, left panel), whereas in the normal 
pancreas, there was only weak Id- 2 immunoreactiviry in the ductal cells (E). 
Preabsorption with the Id- 2-specific blocking peptide alxdished the ld-2 
immunoreaciivity (F, right panel). Moderate to strong kl-3 immunoreaciivity 
was present in the duct- like cancer cells (G and I. left panel). Moderate to 
strong ld-3 immunoreactiviry was al.so present in the ductal cells of normal 
pancreatic tissue samples (H). ld-3 immunoreaciivity was completely abol- 
ished by preabsorption with the ld-3 specific blocking peptide (t, right 
paneD- A, D, and G constitute serial sections of a pancreatic cancer sample. 
' revealing coex press ion of the three Id proteins. Scale bars. 25 pm. 

Cruz, CA) polyclonal antibodies were used for irnrnunhis- 
tochemistry. These affinity-purified rabbit polyclonal anti- 
bodies specifically react with ld-1, ld-2, and ld-3, respec- 
tively, of human origin, as determined by Western 
blotting. Paraffin-embedded sections {A /xrrt) were sub- 
jected to immunostaining using Ihe streptavidin-peroxi- 
dase technique. Where indicated, immunostaining tor all 
three Id proteins was performed on serial sections. En- 



dogenous peroxidase activity was blocked by incubation 
for 30 minutes with 0.3% hydrogen peroxide in methanol. 
Tissue sections were incubated for 15 minutes (23°C) 
with 10% normal goat serum and then incubated for 16 
hours at 4°C with the indicated antibodies in PBS con- 
taining 1 % bovine serum albumin. Bound antibodies 
were detected with biotinylated goat anti-rabbit IgG sec- 
ondary antibodies and streptavidin-peroxidase complex, 
using diaminobenzidine tetrahydrochloride as the sub- 
strate. Sections were counterstained with Mayer's hema- 
toxylin. Preabsorption with ld-1-, ld-2-, or ld-3-specific 
blocking peptides completely abolished immunoreaciiv- 
ity of the respective primary antibody. The immunohisto- 
chemical results were semiquantitatively analyzed as de- 
scribed previously. 29 : 30 The percentage of positive 
cancer cells was stratified into four groups: 0, no cancer 
cells exhibiting immunoreactivity; 1, <33% of the cancer 
cells exhibiting immunoreactivity; 2, 33 to 67% of the 
cancer ceils exhibiting immunoreactivity; 3 >67% of the 
cancer cells exhibiting immunoreactivity. The intensity of 
the immunohistochemical signal was also stratified into 
four groups: 0, no immunoreactivity; 1 , weak immunore- 
activity; 2, moderate immunoreactivity; 3, strong immu- 
noreactivity. Finally, the sum of the results of the cell 
score and the intensity score was calculated. Statistical 
analysis was performed with SigmaStat software. The 
rank sum test was used, and P < 0.05 was taken as the 
level of significance. 



Results 

Northern blot analysis of total RNA isolated from 12 nor- 
mal pancreatic tissues and 16 pancreatic cancers re- 
vealed the presence of the 1.2-kb ld-1 transcript and the 
1.6-kb Id2 mRNA transcript in 11 of the 12. normal pan- 
creatic samples, and the 1.3-kb ld-3 mRNA transcript in 
all normal pancreatic samples (Figure 1A, 2). In the can- 
cer tissues, ld-1 mRNA levels were elevated in 8 of 16 
samples, ld-2 mRNA levels were elevated in 9 of these 
samples, and ld-3 mRNA levels were elevated in 6 of 
these samples (Figure 1A. 2). Concomitant over expres- 
sion of all three Id species was observed in 6 of the 
cancer samples (38%). In contrast, none of the Id mRNA 
species were overexpressed in CP by comparison with 
normal controls (Figure 1B, 2). Densitometry analysis of 
all of the autoradiograms indicated that there was a 6.5- 
fold increase (P < 0.01) in ld-1 mRNA levels, a fivefold 
increase (P < 0.01) in ld-2 mRNA levels, and a twofold 
increase (P = 0.027) in ld-3 mRNA levels in the pancre- 
atic cancer samples in comparison to normal controls 
(Figure 2). In contrast, there was no statistically signifi- 
cant difference in the expression levels of ld-1 , ld-2, and 
ld-3. in CP tissues in comparison to the corresponding 
levels in the normal pancreas (Figure 2). 

Next, we assessed the expression of the three Id 
genes in 5 human pancreatic cancer cell lines by North- 
ern and Western blot analyses, ld-1 mRNA was present 
at varying levels in alt 5 cell lines (Figure 3). ASPC-1, 
CAPAN-1, MIA-PaCa-2, and PANC-1 expressed moder- 
ate to high levels of ld-1 mRNA, whereas COLO- 357 cells 
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Figuie 5. lmmunohisiochemistry of pancreatic earner and dysplasiic ducts in CP tissues. Jn the pancreatic cancer tissues (A-C) there was moderate to strong Jd-1 
(A) t Jd-2 (B>, and Jd-3 (C) immunoreactivity in the ductal cells in the areas adjacent to the cancer cells that exhibited CP-like alterations. Islet cells did not exhibit 
Id immunoreactiviiy (outlined by solid arrowheads). Jn the CP samples, moderate to strong ld-1 (D), )d-2 (E), and Id-3 (F> immunoreaciiviiy was present in the 
cytoplasm of epithelial cells forming Jarge dysplasiic ducts. Scale bar. 25 /im. 



expressed relatively low levels of this mRNA moiety. 
Western blotting with a highly specific anti-Id- 1 antibody 
confirmed the presence of the approximately 14-kd ld-1 
protein in the 4 cell lines that expressed high levels of 
ld-1 mRNA (Figure 3). Furthermore, the three cell lines 
with the highest ld-1 mRNA expression (CAPAN-1, MIA- 
. PaCa-2, and PANC-1) also exhibited the highest ld-1 
protein expression. Variable levels of the 1.6-kb ld-2 
mRNA transcript were present in all 5 cell lines. In addi- 
tion, a minor band of approximately 1.2 kb was visible in 
COLO-357 arid MIA-PaCa-2 cells. Immunoblot analysis 
with a highly specific anti-ld-2 antibody revealed two 
bands of approximately 16 and 18 kd at relatively high 
levels in all of the cell lines with exception of PANC-1 
cells, in which the 16-kd band was relatively faint (Figure 
.3). With the exception of MIA-PaCa-2 cells, there was a 
good correlation between ld-2 mRNA and protein levels 
(Figure 3). Id-3 mRNA was present at high levels in 
MIA-PaCa-2 cells, at moderate levels in COLO-357 cells, 
and at low levels in PANC-1 cells, td-3 mRNA was not 
detectable in ASPC-1 and CAPAN-1 cells (Figure 3). 
Immunoblot analysis with a highly specific anti-ld-3 anti- 
body revealed an approximately 14-kd band thai was most 
abundant in MIA-PaCa-2 cells, and was also readily appar- 
ent in COLO-357 and PANC-1 cells. In contrast, only a faint 
Id-3 band was seen in ASPC-1 and CAPAN-1 cells. Thus, 
with the exception of PANC-1 cells, there was a good cor- 
relation between Id-3 mRNA and protein levels. 



To determine the localization of ld-1, ld-2, and Id-3, 
immunostaining was carried out using the same highly 
specific anti-Id antibodies. In the pancreatic cancers, 
moderate to strong ld-1 immunoreactivity was present in 
the cancer cells in 9 of 10 1 randomly selected cancer 
samples. An example of moderate ld-1 immunoreactivity 
is shown in Figure 4A, and of strong immunoreactivity in 
Figure 4C (left panel). In contrast, in the normal pancreas, 
faint ld-1 immunoreactivity was present only in the ductal 
cells of pancreatic ducts (Figure 4 B, arrowheads). Pre- 
absorption with the ld-1 -specific blocking peptide com- 
pletely abolished the ld-1 immunoreactivity (Figure 4C, 
right panel). The cancer cells also exhibited strong ld-2 
(Figure 4, D and F, left panel) and moderate to strong Id-3 
immunoreactivity. An example of moderate Id-3 immuno- 
reactivity is shown in Figure 4G, and of strong immuno- 
reactivity in Figure 4 1 (left panel). In contrast, only faint 
ld-2 immunoreactivity was present in the ductal cells in 
the normal pancreas (Figure 4E), whereas Id-3 immuno- 
reactivity in these cells was more variable and ranged 
from moderate to occasionally strong (Figure 4H). Islet 
cells and acinar cells were always devoid of Id immunore- 
activity. Preabsorption of the respective antibody with the 
blocking peptides specific for ld-2 (Figure 4F t right panel) 
and Id-3 (Figure 41. right panel) completely abolished im- 
munoreactivity. Analysis o( serial pancreatic cancer sec- 
tions revealed that there was often colocatization of the 
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ftgure 6. Inimunuhistochemistiy of atypical papillary epithelium in CP tissues. Serial sect ion analysis of some CP samples revealed the presence of large dud-like 
structures with atypical papillary epithelium. Mild to moderate Id- J (A) and ld-2 (B) immunoreactivity and weak ld-3 (C) immunoreaciivity was present in the 
cytoplasm of the cells forming these large ducts with papillary structures. Some CP samples also exhibited moderate ld-3 immunoreactivity in these cells (D) Sole 



bar, 25 t*m> 



three Id proteins. An example of serial sections from a 
pancreatic cancer tissue is shown in Figure 4, A; D. and G. 

ld-1, ld-2, and ld-3 immunoreactivity was also present 
at moderate levels in the cytoplasm of ductal cells within 
CP-like areas adjacent to the cancer cells (Figure 5, A-C). 
As in the normal pancreas, islet cells (outlined by arrow- 
heads) did not exhibit Id immunoreactivity. In A of 9 CP 
samples, there were foci of ductal cell dysplasia of rela- 
tively large interlobular duels, all of which exhibited mod- 
erate to strong ld-1 , ld-2, and ld-3 immunoreactivity (Fig- 
ure 5, D-F). Five of 9 CP samples also contained foci of 
large ducts exhibiting atypical papillary epithelium. Serial 
section analysis of one of those CP samples revealed 
mild to moderate ld-1 and ld-2 immunoreactivity and 
weak ld-3 immunoreaciivity in the cells of these atypical 
papillary ducts (Figure 6, A-C). In contrast, in some of 
these CP samples, moderate to strong ld-3 immunoreac- 
tivity was also observed (Figure 6D). However, most of 
the ductal cells forming the typical ductular structures of 
CP. such as large interlobular ducts and small proliferat- 
ing ducts, exhibited generally only weak to occasionally 
moderate Id immunoreactivity (data not shown). 



The immunohistochemical data for ld-1, ld-2, and ld-3 
are summarized in Table 1. In the case of ld-1 and ld-2, 
the cancer cells as well as the dysplastic and atypical 
papillary ducts in CP exhibited a significantly higher 
score than the ductal cells in the normal pancreas. In 
contrast; due to the marked variability in ld-3 immuno- 
staining in the normal pancreas, the differences between 
normal and cancer cells and normal and dysplastic cells 
did not achieve statistical significance. 



Discussion 

Id proteins constitute a family of HLH transcription factors 
that are important regulators of cellular differentiation and 
proliferation. 12 To date, four members of the human Id 
family have been identified. 1 - 10-17 Their expression is 
enhanced during cellular proliferation and in response to 
mitogenic stimuli, 19,31 and overexpression of Id genes 
inhibits differentiation and/or enhances proliferation in 
several different cell types. 15 - 3? - 3 * The forced expression 
of ld-1 in mouse small intestinal epithelium results in 
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Table 1. Histological Scoring 







ld-1 


ld-2 


ld-3 


Normal {n = 6) 


Ductal cells 


2.0 ± 0.4 


2.3 ± 0.2 


2.5 ± 0.9 


Cancer (n - 10) 


Cancer cells 


4.5* ±0.5 


5.2 s ±0.3 


4.5 ± 0.6 


CP {n « 9) 


Typical CP lesions (n = 9) 


. 2.7 ± 0.5 


3.1 ±0.6 


3.4 ± 0.7 




Dysplastic ducts (n - 4) 


5.3* ± 0.2 


5.8* ± 0.2 


5.3 ± 0.4 




Atypical papillary ducts {n = 5). 


4.4* ± 0.2 


5.2* ± 0.2 


5.0 ±0.4 



Scoring ot.the histological specimens was performed as described in the Patients and Methods section. Values are the means ± SD of the number 
of samples indicated in parenthesis. P values are based on comparisons with the respective controls in the normal samples. 
\ P < 0.02; *P.< 0.01; X P = 0.004; §P = 0.001. 



adenoma formation in these animals. 36 The growth-pro- 
moting effects of Id genes are thought to occur through 
several mechanisms. For example"; ld-2 can bind to mem- 
bers of the pRB tumor suppressor family, thus blocking 
their growth-suppressing activity, 20 - 21 and ld-1 and ld-2 
can antagonize the bHLH-rhediated activation of known 
inhibitors of cell cycle progression such as the cyclin- 
dependent kinase inhibitor P21. 23 

In the present study, we determined by Northern blot 
analysis that a significant percentage of human pancre- 
atic cancers expressed increased ld-1, ld-2. and ld-3 
mRNA levels. Increased expression was most evident for 
ld-1 (6.5-f old) and ld-2 (fivefold). In contrast, ld-3 mRNA . 
levels were only twofold increased in the cancer samples, 
partly because this mRNA was present at relatively high 
levels in the normal pancreas. Immunhistochemical anal- 
ysis confirmed the presence, of ld-1, ld-2, and ld-3 in the 
cancer cells within the tumor mass, whereas in the normal 
pancreas faint ld-1 and ld-2 immunoreactivity and mod- 
erate to occasionally strong ld-3 immunoreactivity was 
present in some ductal cells. Pancreatic acinar and islet 
cells in the normal pancreas were devoid of ld-1, ld-2. 
and ld-3 immunoreactivity. In the cancer samples, all 
three Id proteins often colocalized in the cancer cells. 
Coexpression of all three Id genes was also observed in 
cultured pancreatic cancer cell lines, which often exhib- 
ited a close correlation between Id mRNA and protein 
expression. However, in MIA-PaCa-2 there was a diver- 
gence of ld-2 mRNA and protein levels, and in PANC-1 
cells, ld-3 mRNA levels did not correlate well with ld-3 
protein expression. These observations suggest that in 
these cells, the half-life of either Id mRNA or Id protein 
may be altered by comparison with the other cell lines. 
Interestingly, ld-2 immunoblotttng revealed two closely 
spaced bands of approximately 16 and 18 kd in 4 of 5 
cell lines. In view of the fact that two possible initiation 
codons have been reported for the ld-2 gene, 36 our 
observation raises the possibility that the two ld-2-immu- 
noreactive bands may represent separate translation 
products of the ld-2 gene. 

Pancreatic cancers often harbor p53 tumor suppressor 
gene mutations 37 and exhibit alterations in apoptosls 
pathways. Thus, these cancers often exhibit increased 
expression of anti-apoptotic proteins such as Bcl-2 3S and 
abnormal resistance to Fas-ligand-rnediated apopto- 
sis. 39 It has been shown recently that forced constitutive 
expression ol fd genes together with the expression of 
anti-apoptotic genes such as Bcl-2 or BclX L can result in 



malignant transformation of human fibroblasts, 11 raising 
the possibility that the enhanced Id expression in pan- 
creatic cancers together with increased expression of 
anti-apoptotic genes may contribute to the malignant 
potential of pancreatic cancer cells in vivo. 

In the CP tissues there was no significant increase in 
ld-1, ld-2, and ld-3 mRNA levels in comparison to the 
normal pancreas. Immunohistochemical analysis of pan- 
creatic, cancer samples revealed colocalization of weak 
to moderate ld-1, ld-2, and ld-3 immunoreactivity in pro- 
liferating ductal cells in the CP-like regions adjacent to 
the cancer cells, indicating that Id expression was not 
restricted to the cancer cells. Similarly, analysis of CP 
samples indicated weak ld-1, ld-2, and ld-3 immunore- 
activity in the cells of small proliferating ducts and large 
ducts without dysplastic changes. In general, there was a 
correlation between weak immunoreactivity and low Id 
mRNA levels. However, in samples that harbored large 
ducts with papillary structures there was moderate Id 
immunoreactivity, and in the cells forming dysplastic 
ducts there was moderate to strong Id immunoreactivity. 
In these CP samples, Id mRNA levels were relatively 
higher than in the CP samples that were devoid of these 
histological changes. Overall, however, increased Id ex- 
pression, most notably of ld-1 and ld-2, distinguished a 
subgroup of pancreatic cancers from CP (Table 1). 

Epidemiological studies have shown that the risk of 
developing pancreatic cancer is increased up to 16-fold 
in patients with pre-existing CP in comparison to the 
general population. 40 The mechanisms that contribute to 
neoplastic transformation in CP are not known. Although 
there is no established tumor progression model for pan- . 
creatic cancer, such as the adenoma-carcinoma se- 
quence of colorectal carcinoma, 41 it is generally ac- 
cepted, that K-ras and p16 mutations occur relatively 
early in pancreatic carcinogenesis, whereas p53 muta- 
tions occur late in this process. 37 * 41 " 43 Increased Id ex- 
pression may contribute to malignant transformation of 
cultured cell lines in vtfro 11 and has been linked to cell 
invasion in a murine mammary epithelial cell line. 44 In 
view of the current findings that ld-1, ld-2, and ld-3 are 
overexpressed in pancreatic cancer and in dysplastic/ 
metaplastic ducts in CP, these observations raise the 
possibility that elevated levels of ld-1, ld-2. and. to a 
lesser extent, ld-3 may represent relatively early markers 
of pancreatic malignant transformation and may contrib- 
ute to the pathobiology of pancreatic cancer. 
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complex in HEK293 cells. 
Mechaa TP . Puett D . Naravan P . 

SSS^ ^ M ° leCUlar **** diversity of Georgia, Athens, 

We have previously reported the construction of a constitutively active luteinizing 
lZ°c?U TV" 7 T, alent!y Hnking 3 fosed heterodimenc hormone Z the " 

^con^ Characterization of clona. linea r vea£ 

tight control of YHR expression both by dose and time of incubation with doxvcvcline 
These expertments demonstrated a good correlation between expreSevel s o f the 
receptor and basal cAMP production. Moreover, the reduction 

lb lowing doxycychne removal revealed that YHR mRNA and protein d^a V S similar 
rates, aga.n suggest a strong linkage between mRNA and prLm evds The 
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c-fo, and estrogen receptor gene expression pattern in the rat uterine 
epithelium during the estrous cycle. 

Mendoza-Rodrip ,,e7, C A . Merchant-!. a Hn,: Segura-Vald^ Mr. Moreno- 

^l^t^^^ A ~ de Mexico, Ciudad Unive^itaria, 

Different studies in ovariectomized estrogen treated animals support the idea that c fos ' 
plays a role in the proliferation of uterine epithelial cells. Howe^r, these £Z £v„e us 
to reassess the .role played by c-fos in epithelial cell types of the crikmutitenZd^ 
estrous cycle The present study was undertaken to determine the c-ZTd^oZ 

3 e ? reSSi ° n PattCm * rat uterine e P ith «ta du^nTJSS 

cycle ui whiehjiatural and cyclic changes of steroid hormones occur, and corretSese 

dST ^ ePr0l f rati ° n StatUS ° f this -""^types. Fnm,^^^ 
dunng the estrous cycle using bromodeoxyuridine incorporation to DNA. ERaSa *nd 
beta proteins were assessed by immunohistochemistry. The regulation of cSene 
evasion .n the uterus of intact animals during the estrous cyde was ^uSldSus 
both m ^hybridization and immunohistochemistry. Estradiol (E(2)) and 
ffld TLmiT S 7? T* by rad " oi ™assay. The rLlte indiS m7 
ZTt ™ fi Cpithe,ia (GE) preSented maximaI Proliferation during the 

™! ?f I ^ -f d the diest ™ s (P> da ^ However, during the proestrus (P) T only LE 

A marked immunostainmg for ERalpha was detected in both LE and GE celled uW^L 

undetectable in born epitheha dunng all stages of the cycle. The highest c-fos mRNA 
level was detected in both epithelia on the M day, followed by a sigWfic^r r e^ction 
dunng the other days of the cycle. The highest pro tein contend was^i on S 
and D days, and the mm,mal value was detected on the E day. The c-Fos broSn fevel in 

pro hferauon pattern of th.s cell type. In conclusion, the overall results indicate that c-Fos 

SGr^ 2 ^ with uterine epithelial cell proliferation of LE In tne 

found Both t Vr i??"^ W3S 0bS6rVed ' no ^fi^nt correlation was 

found. Both in LE and GE, c-fos mRNA did not stricdy correlate with its protein levels 
c-fos seems to have a postranscriptional regulation in uterine epithelial cells during me 
rats estrous cycle.. Copyright 2003 Wiley-Liss, Inc. souring the 
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(3HJMK-801 binding and the mRNA for the NMDARl subunit of the 
NMDA receptor arc differcntiaUy distributed in human and rat forebrain. 

MconiP, Mugnaini M . Buiiiicmaiiu BH . Trist DG, Covvery NG . 

Department of Pharmacology, Medical School, University of Birmingham, UK. 
meonip@noveIl5.bham.ac.uk 

The distributions of [3H]MK-801 binding and the NMDA NR1 subunit mRNA were 
studied using receptor autoradiography and in-situ hybridization in rat and human brain 
whole-hemisphere coronal sections. Receptor protein detected by radioligand 
autoradiography and the mRNA for the key subunit of the receptor presented similar 
distributions in the forebrain, with a few areas showing an imbalance between the levels 
of mRNA and receptor protein. Human frontal cortex showed a relative abundance of 
NMDARl mRNA as compared to [3H]MK-80l binding. The same area in rat brain did 
not show any difference in the two distributions. In comparison, the rat claustrum 
presented a relative excess of NMDARl mRNA which was not detected in human 
sections. Human caudate nucleus exhibited relatively high levels of (3H]MK-801 binding 
that were unmatched in rat caudate. The hippocampi of either species presented similar 
levels of [3H]MK-801 binding and NMDARl mRNA, but when the two signals were 
measured in .specific subfields of the hippocampal formation, the differential distribution 
of the two signals reflected the anatomy of hippocampal connections assuming a 
preferential dendritic distribution for MK-80 1 binding. Interestingly, rat and human 
hippocampi also showed some important species-dependent difference in the relative 
distribution of the receptor protein and mRNA. The data presented show an overall good 
correlation between the mRNA for the key subunit of the NMDA receptor and the 
functional receptor detected with radioligand binding and highlight the presence of local 
differences in their ratio. This may reflect different splicing of the mRNA for the 
NMDARl subunit in specific brain areas of rat and human. The species-dependent 
differences in the relative distribution of the mRNA for the key subunit of the NMDA 
receptor and that of a marker of functional receptors also highlights important differences 
in the NMDA function in rat and human brain. 
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Abstract 

Translation initiation is regulated in response to 
nutrient availability and mitogenic stimulation and is 
coupled with cell cycle progression and cell growth. 
Several alterations in translational control occur in 
cancer. Variant mRNA sequences can alter the 
translational efficiency of individual mRNA molecules, 
which in turn play a role in cancer biology. Changes in 
the expression or availability of components of the 
translational machinery and in the activation of 
translation through signal transduction pathways can 
lead to more global changes, such as an increase in 
the overall rate of protein synthesis and translational 
activation of the mRNA molecules involved in cell 
growth and proliferation. We review the basic 
principles of translational control, the alterations 
encountered in cancer, and selected therapies 
targeting translation initiation to help elucidate new 
therapeutic avenues. . 

Introduction 

The fundamental principle of molecular therapeutics in can- 
cer is to exploit the differences in gene expression between 
cancer cells and normal cells. With the advent of cDNA array 
technology, most efforts have concentrated on identifying 
differences in gene expression at the level of mRNA, which 
can be attributable either to DNA amplification or to differ- 
ences in transcription. Gene expression is quite complicated, 
however, and is also regulated at the level of mRNA stability, 
mRNA translation, and protein stability. 

The power of translational regulation has been best recog- 
nized among developmental biologists; because transcription . 
does not occur in early embryogenesis in eukaryotes. For ex- 
- ample, in Xenopus, the period of transactional quiescence 
continues until the embryo reaches midbfastula transition, the 
4000-cell stage. Therefore, all necessary mRNA molecules are 
transcribed during oogenesis and stockpiled in a translations JJy 
inactive, masked form The mRNA are transtationally activated 
at appropriate times during oocyte maturation, fertilization, and 



early embryogenesis and thus, are under strict translational 
control. 

Translation has an established role in cell growth. Basi- 
cally, an increase in protein synthesis occurs as a conse- 
quence of mitogenesis. Until recently, however, little was 
known about the alterations in mRNA translation in cancer, 
and much is yet to be discovered about their role En the 
development and progression of cancer. Here we review the 
basic principles of translational control, the alterations en- 
countered in cancer, arid selected therapies targeting transla- . 
tion initiation to elucidate potential new therapeutic avenues. 

Basic Principles of Translational Control 
Mechanism of Translation Initiation 
Translation Vitiation is the main step in translational regulation. 
Translation initiation is a complex process in which the initiator 
tRNA and the 40S and 60S ribosoma! submits are recruited to 
the 5' end of a mRNA molecule and assembled by eukaryotic 
translation initiation factors into an 80S ribosome at the start 
codon of the mRNA (Rg. 1). The 5' end of eukaryotic mRNA is 
capped, i.e., contains the cap structure m 7 GpppN (7-methyf- 
guarK«ine-triprK>£pho-5'-ribonucleoside). Most translation in 
eukaryotes occurs in a cap-dependent fashion, i.e., the cap is 
specifically recognized by the elF4E, 3 which binds the 5' cap. 
The elF4F translation initiation complex is then formed by the 
assembly of elF4E, the RNA helicase elF4A, and elF4G, a 
scaffolding protein that mediates the binding of the 4 OS ribo- 
sorrial subunit to the mRNA molecule through interaction with 
the eIF3 protein present on the 40S ribosome. elF4A and eiF4B 
participate in melting the secondary structure of the 5' UTR of 
the mRNA The 43S initiation complex (40S/elF2/Met-tRNA/ 
GTP complex) scans the mRNA in a 5'-»3' direction until it 
encounters an AUG start codon. This start codon is then base- 
paired to the anticodort of initiator tRNA, forming the 48S initi- 
ation complex The initiation factors are then displaced from the 
48S complex, and the 60S ribosome joins to form the 80S 
ribosoma 

Unlike most eukaryotic translation, translation initiation of 
certain rnRNAs, such as the picomavirus RNA, is cap inde- 
pendent and occurs by internal ribosome entry. This mecha- 
nism does not require elF4EL Either the 43S complex can bind 
the initiation codon directly through interaction with the IRES in 
the 5' UTR such as in the encephaJomyocarditis virus, or it can 
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F~tg. 1. Translation initiation in eukaryoles. The 4E-BPs are hyperphos- 
phorylated to release e(F4E so thai il can interact with the 5' cap, and the 
elF4F initiation complex is assembled. The interaction of poty(A) binding 
protein with the initiation complex and circularization of the mRNA is not 
depicted « the diagram. The secondary structure of the 5' UTR is melted, 
the 4 OS ribosomal subonit is bound to e(F3, and the ternary complex 
consisting of e!F2, GTP, and the Met-tRNA are recruited to the mRNA. The 
ribosome scans the mRNA in a 5'^3' direction until an AUG start codon 
is found in the appropriate sequence context. The iniliatkm (actors are 
released, and the large ribosomal subunit is recruited- 

irriuaDy attach to the IRES and then reach the initiation codon by 
scanning or transfer, as is the case with the poxvirus (1). 

Regulation of Translation Initiation 
Translation initiation can be regulated by alterations in the 
expression or phosphorylation status of the various factors 
involved. Key components in translationai regulation that 
may provide potential therapeutic targets follow. 

e'F4E. elF4E plays a central role in translation regulation. 
It is the least abundant of the initiation factors and is con- 
sidered the rate-limiting component for initiation of cap- 
dependent translation. elF4E may also be involved in mRNA 
splicing, mRNA 3''processing, and mRNA nucteocytoplas- 
mic transport (2). elF4E expression can be increased at the 
transcriptional level in response to serum or growth factors 
(3). elF4E overexpression may cause preferential translation 
of mRNAs containing excessive secondary structure in their 
5' UTR that are normally discriminated against by the trans- 



lationat machinery and thus are inefficiently translated (4-7). 
As examples of this, overexpression of elF4E promotes in- 
creased translation of vascular endothelial growth factor, 
fibroblast growth factor-2, and eyefin D1 (2, 8, 9). 

Another mechanism of control is the regulation of elF4E 
phosphorylation. elF4E phosphorylation is mediated by the 
rnitogen-activated protein kinase-interacting kinase 1, which 
is activated by the rnitogen-activated pathway activating 
extracellular signal-related kinases and the stress-activated 
pathway acting through p38 rnitogen-activated protein ki- 
nase (10-13). Several mitogens, such as serum, platelet- 
derived growth factor, epidermal growth factor, insulin, 
angiotensin H, sre kinase overexpression, and ras over- 
expression, lead to elF4E phosphorylation (14). The phos- 
phorylation status of elF4E is usually correlated with the 
translationai rate and growth status of the cell; however, 
elF4E phosphorylation has also been observed In response 
to some cellular stresses when translationai rates actually 
decrease (15). Thus, further study is needed to understand 
the effects of elF4E phosphorylation on eIF4E activity. 

Another mechanism of regulation is the alteration of elF4E 
availability by the binding of elF4E to the elF4E-binding pro- 
teins (4E-BP, also known as PHAS-I). 4E-BPs compete with 
elF4G for a binding site in elF4E. The binding of elF4E to the 
best characterized elF4E-blnding protein, 4E-BP1 , is regur 
lated by 4E-BP1 phosphorylation. Hypophosphorylated 4E- 
BP1 binds to elF4E, whereas 4E-BP1 hyperphosphorylation 
decreases this binding. Insulin, angiotensin, epidermal 
growth factor, platelet-derived growth factor, bepatocyte 
growth factor, nerve growth factor, insulin- like growth factors 
I and II, interleukin 3, granulocyte-macrophage colony-stim- 
ulating factor + steel factor, gastrin, and the adenovirus have 
all been reported to induce phosphorylation of 4E-BP1 and 
to decrease the ability of 4E-BP1 to bind elF4E (15, 16). 
Conversely, deprivation of .nutrients or growth factors results* 
in 4E-BP1 dephosphorylation, an increase in elF4E binding, 
and a decrease in cap-dependent translation. 

p70 S6 Kinase. Phosphorylation of ribosomal 40S protein 
S6 by S6K is thought to play an important role in translationai 
regulation, S6K -/- mouse embryonic cells proliferate more 
slowly than do parental cells, demonstrating that S6K has a 
positive influence on ceH proliferation (17). S6K regulates the 
translation of a group of mRNAs possessing a 5' terminal 
ofir^pyrirnidine tract {5' TOP) found at the 5' UTR of ribosomal 
protein mRNAs and other mRNAs coding for components of 
the translationai machinery. Prraphorylation of S6K is regu- 
lated in part based on the availability of nutrients (1 8, 1 9) and is 
stimulated by several growth factors, such as platelet-derived 
growth factor and insulin-like growth factor I (20). 

elF2a Phosphorylation. The binding of the initiator tRNA 
to the small ribosomal unit is mediated by translation initia- 
tion factor elF2. Phosphorylation of the a-subunit of elF2 
prevents formation of the elF2/GTP/Met-tRNA complex and 
inhibits global protein synthesis (21, 22). elF2a is phospho- 
rylated under a variety of conditions, such as viral infection, 
nutrient deprivation, heme deprivation, and apoptosis (22). 
etF2a is phosphory lated by heme-regulated «hibftor, nutrient- 
regulated protein kinase*, and the IFN-hxluced, double- 
stranded RNA-activated protein kinase (PKR; Ref. 23). 
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The mTOR Signaling Pathway. The macroltde antibiotic 
rapamycin (Siralimus; Wyeth-Ayerst Research, CollegeviJJe, 
PA) has been the subject of intensive study because it in- 
hibits signal transduction pathways involved in T-ceH activa- 
tion. The rapamycin-sens&ive component of these pathways 
is mTOR (also called FRAP or RAFT1). mTOR is the mam- 
malian homologue of the yeast TOR proteins that regulate 
progression and translation in response to nutrient available 
ity (24). mTOR is a serine-threonine kinase that modulates 
translation initiation by altering the phosphorylation status of 
4E-BP1 and S6K (Fig. 2; Ref. 25). 

4E-BP1 is phosphorylated on multiple residues. mTOR phos- 
phorylates the Thr-37 and 7h>-46 residues of 4E-BP1 m vhro 
(26); however, phosphorylation at these sites is not associated 
with a loss of elF4E binding. Phosphorylation of Thr-37 and 
• Thr-46 is required for subsequent phosphorylation at several 
COOR-terminal, serum-sensitive sites; a combination of these 
phosphorylation events appears to be needed to inhibit the 
binding of 4E-BP1 to elF4E (25). The product of fas ATM gene, 
D38/MSK1 pathway, and protein kinase O also play a rote in 
4E-BP1 phosphorylation (27-29). 

S6K and 4E-BP1 are also regulated, in part, by PI3K and its 
downstream protein kinase Akt. PTEN is a phosphatase that 
negatively regulates PI3K signaling. PTEN null cells have 
constitutivery active of Akt, with increased S6K activity and 
S6 phosphorylation (30). S6K activity is inhibited both by 
PI3K inhibitors wortmannin and LY294002 and by mTOR 
inhibitor rapamycin (24). Akt phosphorylates Ser-2448 in 
mTOR in vitro, and this site is phosphorylated upon Akt 
activation in vivo (31-33). Thus, mTOR is regulated by the 
PI3K/AW pathway; however, this does not appear to be the 
only mode of regulation of mTOR activity. Whether the PI3K 
pathway also regulates S6K and 4E-BP1 phosphorylation 
independent of mTOR is controversial. 

Interestingly, mTOR autophosphorylation is blocked by wort- 
mannin but not by rapamycin (34). This seeming inconsistency 
suggests that mTOR-responsive regulation of 4E-BP1 and S6K 
activity occurs through a mechanism other than intrinsic mTOR 
kinase activity.An alternate pathway for 4E-BP1 and S6K phos- 
phorylation by mTOR activity is by the inhibition of a phospha- 
tase. Treatment with calycufin A, an inhibitor of phosphatases 1 
and 2A, reduces rapamyrin-induced dephosphorylation of 4E- 
BP1 and S6K by rapamycin (35). PP2A interacts with fuft-length 
S6K but not with a S6K mutant that is resistant to .dephospho- 
rylation resulting from rapamycin. mTOR phosphorylates PP2A 
m vitro-, however, how this process alters PP2A activity is not 
known. These results are consistent with the model that phos- 
. phorylation of a phosphatase by mTOR prevents dephospho- 
rylation of 4E-8P1 and S6K, and conversely, that nutrient dep- 
rivation and rapamycin block inhibition of the phosphatase by 
mTOR 

Polyadenylation. The poty(A) taH in eukaryotic mRNA is 
important in enhancing translation initiation and mRNA sta- 
bility. Poryadenylation plays a key role in regulating gene 
expression during oogenesis and early embryogenesis. 
Some mRNA that are translationalry inactive in the oocyte are 
polyadenylated concomitantly with translations! activation in 
oocyte maturation, whereas other mRNAs that are transla- 
tionaWy active during oogenesis are deadenylated and trans- 
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Cap-dependent Translation 

(mRNAs with highly structured 5' UTRs) 

Ftg. 2. Regulation of translation initiation by signal transduction path- 
ways Signaling via p38, extracellular signal-related kinase, PI3K, and 
mTOR can all activate translation initiationi 



lationally silenced (36-38). Thus, control of pory(A) tail syn- 
thesis is an important regulatory step in gene expression. 
The 5' cap and pory(A} tail are thought to (unction synergis- 
ticalty to regulate mRNA translational efficiency (39, 40). 

RNA Packaging. Most RNA-binding proteins are assem- 
bled on a transcript at the time of transcription, thus deter- 
mining the translational fate of the transcript (41). A highly 
conserved family of Y-box proteins is found in cytoplasmic 
messenger ribonucleoproteln particles, where the proteins 
are thought to play a role in restricting the recruitment of 
mRNA to the translational machinery (41-43). The major 
mRNA-associated protein* YB-1, destabilizes the interaction 
of elF4E and the 5' mRNA cap in vitro, and overexpression of 
YB-1 results in translational repression in vivo (44). Thus, 
alterations in RNA packaging can also play an important role 
in translational regulation. 

Translation Alterations Encountered in Cancer 
Three main alterations at the translational level occur in cancer 
variations in mRNA sequences that increase or decrease trans- 
lational efficiency, changes in the expression or availability of 
components of the translational machinery, and activation of 
translation through aberrantly activated signal transduction 
pathways. The first alteration affects the translation of an indi- 
vidual mRNA that may play a rote in carcinogenesis. The sec- 
ond and third alterations can lead to more global changes, such 
as an increase in the overall rate of protein synthesis, and the 
translational activation of several mRNA species. 

Variations in mRNA Sequence 

Variations in mRNA sequence affect the translalional effi- 
ciency of the transcript. A brief description of these variations 
and examples of each mechanism follow. 

Mutations. Mutations in the mRNA sequence, especially 
in the 5' UTR, can alter its translational efficiency, as seen in 
the following examples. 
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c-myc Saito et a/, proposed that translation of full-length 
c-myc ts repressed, whereas in several Burkitt lymphomas 
that have deletions of the mRNA 5' UTFt, translation of c-myc 
is more efficient (45). More recently, it was reported that the 
5' UTR of c-myc contains an IRES, and thus c-myc transla- 
tion can be initiated by a cap-independent as well as a 
cap-dependent mechanism (46, 47). In patients with multiple 
myeloma, a C->t mutation in the c-myc IRES was identified 
(48) and found to cause an enhanced initiation of translation 
via internal ribosomal entry (49). 

BRCA1. A somatic point mutation (117 G->C) in position 
-3 with respect to the start codon of the BRCA1 gene was 
identified in a highly aggressive sporadic breast cancer (50). 
Chimeric constructs consisting of the wild-type or mutated 
BRCA1 5 f UTR and a downstream tuciferase reporter dem- 
onstrated a decrease in the translational efficiency with the 5' 
UTR mutation. 

. CycBn-dependent Kinase inhibitor 2A. Some inherited 
melanoma kindreds have a G-»T transversion at base -34 
of cycfin-dependent kinase fohibitor-2A, which encodes a 
cycltn-dependent kinase 4/cyclin-dependent kinase 6 kinase 
inhibitor important in G t checkpoint regulation (51). This- 
mutation gives rise to a novel AUG translation initiation 
codon, creating an upstream open reading frame that com- 
petes for scanning ribosomes and decreases translation 
from the wild-type AUG. 

Alternate Splicing and Alternate Transcription Start 
Sites. Alterations in splicing and alternate transcription sites 
can lead to variations in 5' UTR sequence, length, and second- 
ary structure, ultimately impacting transitional efficiency. 

ATM. The ATM gene has four noncoding exons in its 5' 
UTR that undergo extensive alternative splicing (52); The 
contents of 12 different 5' UTRs that show considerable 
diversity in length and sequence have been identif ied. These 
divergent 5' leader sequences play an important role in the 
translational regulation of the ATM gene. 

mdm. In a subset of tumors, overexpression of the onco- 
protein mdm2 results in enhanced translation of the mdm2 
mRNA. Use of different promoters leads to two mdm2 tran- 
scripts that differ only in their 5' leaders (53). The longer 5' 
LfTR contains two upstream open reading frames, and this 
mRNA is loaded with ribosomes inefficiently compared with 
the short 5' UTR. 

BRCA1. In a normal mammary gland, BRCA1 mRNA is 
expressed with a shorter leader sequence (5'UTRa). whereas 
m sporadic breast cancer tissue; BRCA1 mRNA is expressed 
with a longer leader sequence (5' UTRb); the translational 
efficiency of transcripts containing 5' UTRb is 10 times lower 
than that of transcripts containing 5' UTRa (54). 

TGF-fSX TGF-03 mRNA includes a 1.1-kb 5' UTR, which 
exerts an inhibitory effect on translation. Many human breast 
cancer cell lines contain a novel TGF-03 transcript with a 5' 
UTR that is 870 nucleotides shorter and has a 7-fo!d greater 
translational efficiency than, the normal TGF-p3 mRNA (55). 

Alternate Potyadenylation Sites. Multiple poryadenyl- 
ation signals leading to the generation of several transcripts 
w.th differing 3' UTR have been described for several mRNA 
species, such as the RET protc-oncogene (56). ATM gene 
(52), tissue inhibitor of metalJoproteinases-3 (57). RHOA 



proto-oncogene (58), and calmoduiin-l (59). Although the 
effect of these alternate 3' UTRs on translation is not yet 
known, they may be important in RNA-protein interactions 
that affect translational recruitment. The role of these alter- 
ations in cancer development and progression is unknown. 



Alterations in the Components of the 
Translation Machinery 

Alterations in the components of translation machinery can 
take many forms. 

Overexpresssion of eIF4E. Overexpression of.elF4E 
causes malignant transformation in rodent cells (60) and the 
deregulation of HeLa cell growth (61). Polunovsky er a/. (62) 
found that elF4E overexpression substitutes for serum and 
Individual growth factors in preserving viability of fibroblasts, 
which suggests that elF4E can mediate both proliferative and 
survival signaling. 

Elevated levels of elF4E mRNA have been found in a broad 
spectrum of transformed cell lines (63). elF4E levels are 
elevated in all ductal carcinoma in situ specimens and inva- 
sive ductal carcinomas, compared with benign breast spec- 
imens evaluated with Western blot analysis (64. 65). Prelim- 
inary studies suggest that this overexpression is attributable 
to gene amplification (66). 

There are accumulating data suggesting that elF4E overex- 
pression can be valuable as a prognostic marker. elF4E over- 
expression was found in a retrospective study to be a marker of 
poor prognosis in stages I to ill breast carcinoma (67). Verifica- 
tion of the prognostic value of elF4E in breast cancer is now 
under way in a prospective trial (67). However, in a different 
study, elF4E expression was correlated with the aggressive 
behavior of non-Hodgkin's lymphomas (68). In a prospective 
analysis of patients with head and neck cancer, elevated levels 
of elF4E in histologically tumor-free surgical margins predicted 
a significantly increased risk of local-regional recurrence (9). 
These results all suggest that'elF4E overexpression can be 
used to select patients who might benefit from more aggressive 
systemic therapy. Furthermore, the head and neck cancer data 
suggest that elF4E overexpression is a field defect and can be 
used to guide local therapy. 

Alterations in Other Initiation Factors. Alterations in a 
number of other initiation factors have been associated with 
cancer. Overproduction of e!F4G, similar to elF4E, leads to 
mafignant transformation in vitro (69). eiF-2or is found in 
increased levels in bronchioloarveolar carcinomas of the lung 
(3). Initiation factor elF-4A1 is overexpressed in melanoma 
(70) and hepatocellular carcinoma (71). The p40 subunit of 
translation initiation factor 3 is amplified and overexpressed 
in breast and prostate cancer (72), and the elF3-p1 1 0 subunit 
is overexpressed in testicular seminoma (73). The role that 
overexpression of these initiation factors plays on the devel- 
opment and progression of cancer, if any, is not known. 

Overexpression of S6K. S6K is amplified and highly 
overexpressed in the MCF7 breast cancer cell fine, com- 
pared with normal mammary epithelium (74). In a study by 
Barlund er a/. (74), S6K was amplified in 59 of 668 primary 
breast tumors, and a statistically significant association was 
observed between amplification and poor prognosis. 
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Overexpression of PAP. PAP catalyzes 3' poly(A). syn- 
thesis. PAP is overexpressed in human cancer cells com- 
pared with normal and viralfy transformed cells (75). PAP 
enzymatic activity in breast tumors has been correlated with 
PAP protein levels (76) and, rn mammary tumor cytosols, was 
found to be an independent factor for predicting survival (76). 
Little is known, however, about how PAP expression or ac- 
tivity affects the translations! profile. 

Alterations in RNA-binding Proteins. Even less is known 
about alterations in RNA packaging in cancer. Increased ex- 
pression and nuclear localization of the RNA-binding protein 
YB-1 are indicators of a poor prognosis for breast cancer (77), 
non-small cell lung cancer (76), and ovarian cancer (79). How- 
ever, this effect may be mediated at least in part at the level of 
transcription, because YB-1 increases chemoresistance by en- 
hancing the transcription of a multidrug resistance gene (80). 



Activation of Signal Transduction Pathways 
Activation of signal transduction pathways by loss of tumor 
suppressor genes or overexpression of certain tyrosine kinases 
can contribute to the growth and aggressiveness of tumors. An 
important mutant in human cancers is the tumor suppressor 
gene PTEN, which leads to the activation of the Pt3K/Akt path- 
way. Activation of PI3K and Akt induces the oncogenic trans- . 
formation of chicken embryo fibroblasts. The transformed cells 
show constitutive phosphorylation of S6K and of 4E-BP1 (81). 
. A mutant Akt that retains kinase activity but does not phos- 
phorylate S6K or 4E-BP1 does not transform fibroblasts, which 
suggests a correlation between the oncogenicity of PI3K and 
Akt and the phosphorylation of S6K and 4E-BP1 (81). 

Several tyrosine kinases such as platelet-derived growth 
factor, insulin-like growth factor, HER2/neu, and epidermal 
growth factor receptor are overexpressed in cancer. Be- 
cause these kinases activate downstream signal transduc- 
tion pathways known to alter translation initiation, activation 
of translation is likely to contribute to the growth and.aggres- 
siveness of these tumors. Furthermore, the mRNA for many 
of these kinases themselves are under translation^ control. 
For example, HER2/neu mRNA is translationally controlled 
both by a short upstream open reading frame that represses 
HER2/neu translation in a cell type-independent manner and 
by a distinct cell type-dependent mechanism that increases , 
transitional efficiency (82). HER2/neu translation is different 
in transformed and normal cells. Thus, it is possible that 
alterations at the translational level can. in part account for 
the discrepancy between HER2/neu gene amplification de- 
tected by fluorescence in situ hybridization and protein levels 
detected by immunohistochemical assays. 



Translation Targets of Selected Cancer Therapy 
Components ol the translation machinery and signal path- 
ways involved in the activation of translation initiation repre- 
sent good targets for cancer therapy. 

Targeting the mTOR Signaling Pathway: Rapamycin 
and Tumstatin 

Rapamycin inhibits the proliferation of lymphocytes. It was 
initially developed as an immunosuppressive drug for organ 



transplantation. Rapamycin with FKBP 12 (FK506-bindtng 
protein, M t 12,000) binds to mTOR to inhibit its function. 

Rapamycin causes a smaH but significant reduction in the 
Initiation rate of protein synthesis (83). It blocks cell growth in 
part by blocking S6 phosphorylation and selectively sup- 
pressing the translation of 5' TOP mRNAs, such as ribosomal 
proteins, and elongation factors (83-85). Rapamycin also 
blocks 4E-BP1 phosphorylation and inhibits cap-dependent 
but not cap- independent translation (17,. 86). 

The rapamyrin-sertsirh/e signal transduction pathway, acti- 
vated during malignant transformation and cancer progression, 
is now being studied as a target for cancer therapy (87). Pros- 
tate, breast, small ceil lung, glioblastoma, melanoma, and T-cefl 
leukemia are among the cancer lines most sensitive to the 
rapamycin analogue CCI-779 (Wyeth-Ayerst Research; Ref. 
87). In rhabdomycosarcoma cell fines, rapamycin is either cyto- 
static or cytocidal, depending on the p53 status of the cell; p53 
wild-type cells treated with rapamycin arrest in the G, phase 
and maintain their viability, whereas p53 mutant celts accumu- 
late in G, and undergo apoptosis (88, 89). In a recently reported 
study using human primitive neuroectodermal tumor and 
rnedulloblastoma models, rapamycin exhibited more cytotox- 
icity in combination with dsplatin and camptothecin than as a 
single agent in two, CCI-779 delayed growth of xenografts by 
1 60% after 1 week of therapy and 240% after 2 weeks. A single 
high-dose administration caused a 37% decrease in tumor 
volume. Growth inhibition in vivo was 1.3 times greater, with . 
cisplatin in combination with CCI-779 than with cisplatin alone 

(90) . Thus, preclinical studies suggest that rapamycin ana- 
logues are useful as single agents and in combination with 
chemotherapy. 

Rapamycin analogues CCt-779 and RAD001 (NovarUs, 
Basel, Switzerland) are now in clinical trials. Because of the 
known effect of rapamycin on lymphocyte proliferation, a 
potential problem with rapamycin analogues is immunosup- 
pression. However, although prolonged immunosuppression 
can result from rapamycin and CCI-779 administered on 
continuous-dose schedules, the immunosuppressive effects 
of rapamycin analogues resolve in —24 h after therapy 

(91) . The principal toxicities of CCI-779 have included der- 
mat ©logical toxicity, myelosuppression, infection, mucositis, 
diarrhea, reversible elevations in liver function tests, hyper- 
glycemia, hypokalemia, hypocalcemia, and depression (87, 
92-94). Phase II trials of CCI-779 have been conducted in 
advanced renal cell carcinoma and in stage lll/IV breast 
carcinoma patients who failed with prior chemotherapy. In 
the results reported in abstract form, although there were no 
complete responses, partial responses were documented in 
both renal cell carcinoma and in breast carcinoma (94, 95). 
Thus, CCI-779 has documented preliminary clinical activity in 
a previously treated, unselected patient population. 

Active investigation is under way into patient selection for 
mTOR inhibitors. Several studies have found an enhanced 
efficacy of CCI-779 in PTEN-null tumors (30. 96). Another 
study found that- six of eight breast cancer cell lines were 
responsive to CCI-779, although only two of these lines 
lacked PTEN (97) There was,' however, a positive correlation 
between Akt activation and CCI-779 sensitivity (97). This 
correlation suggests that activation of the PI3K-AW pathway. 
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regardless of whether it is attributable to a PTEN mutation or 
to overexpression of receptor tyrosine kinases, makes can- 
cer cell amenable to mTOR-directed therapy. In contrast, 
lower levels of the target of mTOR, 4E-BP1, are associated 
with rapamycin resistance; thus, a tower 4£-BP1/elF4E ratio 
may predict rapamycin resistance (98). 

Another mode of activity for rapamycin and its analogues 
appears to be through inhibition of angiogenesis. This activ- 
ity may be both through direct inhibition of endothelial ceil 
profif eration as a result of mTOR inhibition in these ceils or by 
inhibition of translation of such proangiogenic factors as 
vascular endothelial growth factor in tumor celjs (99, 100). 

The angipgenesis inhibitor tumstatin, another anticancer 
drug currently under study, was also found recently to inhibit 
translation in endothelial cells (101). Through a requisite in- 
teraction with integrin, tumstatin inhibits activation of the 
PI3K/Akt pathway and mTOR in endothelial cells and pre- 
vents dissociation of elF4E from 4E-BP1, thereby inhibiting 
cap-dependent translation. These findings suggest that en- 
dothelial cells are especially sensitive to therapies targeting 
the mTOR-signaling pathway. 



ment also reduces the expression of angiogenic factors (115) 
and has been proposed as a potential adjuvant therapy for head 
and neck cancers, particularly when elevated eJRE is found in 
surgical margins. Small molecule inhibitors that bind the elF4G/ 
4E-BP1-binding domain of elRE are proapoptotic (116) and 
are also being actively pursued 

Exploiting Selective Translation for Gene Therapy 
A different therapeutic approach that takes advantage of the 
enhanced cap-dependent translation in cancer cells is the use 
of gene therapy vectors encoding suicide genes with highly 
structured 5' UTR These mRNA would thus be at a competitive 
disadvantage in normal cells and not translate well, whereas in 
cancer cells, they would translate more efficiently. For example, 
the introduction of the 5' UTR of fibroblast growth factor-2 5' to 
the coding sequence of herpes simplex virus typc-1 thymidine 
kinase gene, allows for selective translation of herpes simplex 
virus type-1 thymidine kinase gene in breast cancer cell lines 
compared with normal mammary cell lines and results in se- 
lective sensitivity to ganciclovir (117). 



Targeting elF2a: EPA, Clotrimazole, mda-7, 
and Flavonoids 

EPA is an n-3 polyunsaturated fatty acid found in the fish- 
based diets of populations having a low incidence of cancer 
(102). EPA inhibits the proliferation of cancer cells (103), as 
well as in animal models (104, 105). It blocks cell division by 
inhibiting translation initiation (105). EPA releases Ca 2 * from 
intracellular stores while inhibiting their refilling, thereby ac- 
tivating PKR. PKR, in turn phosphorylates and inhibits elF2a\ 
resulting in the inhibition of protein synthesis at the level of 
translation initiation. Similarly, clotrimazole, a potent antipro- 
liferative agent in vitro and in vivo, inhibits cell growth through 
depletion of Ca 2 ^ stores, activation of PKR, and phospho- 
rylation of elF2a (106). Consequently, clotrimazole preferen- 
tially decreases the expression of cyclins A, E, and D1, 
resulting in blockage of the cell cycle in G v 

mda-7 is a novel tumor suppressor gene being developed 
as a gene therapy agent. Adenoviral transfer of mda-7 (Ad- 
mda7) induces apoptosis in many cancer cells including 
breast, colorectal, and lung cancer (107-1*09). Ad-mda7 also 
induces and activates PKR, which leads to phosphorylation 
of elF2a and induction of apoptosis (110). 

Flavonoids such as genistein and quercetin suppress tu- 
mor cell growth. All three mammalian elF2a kinases, PKR, 
heme-regulated inhibitor, and PERK7PEK, are activated by 
flavonoids, with phosphorylation of elF2a and inhibition of 
protein synthesis (1 1 1). 

Targeting elF4A and elF4E: Antisense RNA 
and Peptides 

Antisense expression of elF4A decreases the proliferation rate 
of melanoma cells (112). Sequestration of elF4E by overexpres- 
sion of 4E-BP1 is proapoptotic and decreases tumorigenicrty 
(113, 114), Reduction of elF4E with antisense RNA decreases 
soft agar growth, increases tumor latency, and increases the 
rates of tumor doubling times (7). Antisense elF4E RNA treat- 



Toward the Future 

Translation is a crucial process in every cell. However, several 
alterations in translations! control occur in cancer. Cancer cells 
appear to need an aberrantly activated translation^ state for 
survival, thus allowing the targeting of translation initiation with 
surprisingly low toxicity. Components of the translation^ ma- 
chinery, such as elF4E, and signal transduction pathways in- 
volved in translation initiation, such mTOR, represent promising 
targets for cancer therapy. Inhibitors of the mTOR have already 
shown some preliminary activity in clinical trials. U is possible 
that with the development of better predictive markers and 
better patient selection, response rates to single-agent therapy 
can be improved. Similar to other cytostatic agents, however, 
mTOR inhibitors are most likely to achieve clinical utility in 
combination therapy. In the interim, our increasing understand- 
ing of translation initiation and signal transduction pathways 
promise to lead to the identification of new therapeutic targets 
in the near future. 
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Overexpression of chemokines, fibrogenic cytokines, and myofibroblasts in 
human membranous nephropathy. 

Mezzano SA, Droguett MA . Burgos ME . Ardiles LG . Aros CA , Caorsil, Egido J . 

Division of Nephrology, School of Medicine, Universidad Austral, Valdivia, Chile. 
smezzano@uach.cl 

Overexpression of chemokines, fibrogenic cytokines, and myofibroblasts in human, 
membranous nephropathy. BACKGROUND: Proteinuria plays a central role in the 
progression of glomerular disease, and there is growing evidence suggesting that it may 
determine tubular cell activation with release of chemokines and fibrogenic factors, 
leading to interstitial inflammatory reaction. However, most studies on this subject have 
been performed in experimental models, and the experience in human kidney biopsies 
has been scarce. We analyzed the tissue sections of patients with idiopathic membranous 
nephropathy (IMN), a noninflammatory glomerular disease that may follow a progressive 
disease with heavy persistent proteinuria, interstitial cell infiltration, and decline of renal 
function. METHODS: Paraffin-embedded biopsy specimens from 25 patients with IMN 
(13 progressive and 12 nonprogressive) were retrospectively studied by 
immunohistochemistry [monocyte chemoattractant protein-1 (MCP- 1 ), regulated on 
activation normal T-cell expressed and secreted chemokine (RANTES), osteopontin 
(OPN), platelet-derived growth factor-BB (PD-GF-BB)] and in situ hybridization [MCP- 
l, RANTES, PDGF-BB, transforming growth factor-betal (TGF-betal)]. Moreover, we 
studied the presence of myofibroblasts, which were identified by the expression of alpha- 
smooth muscle actin (alpha-SMA), the monocytes/macrophages (CD68-positive cells), 
and T-cell infiltration (CD4+ and CD8+ cells). All of the patients were nephrotic and * 
without treatment at time of the biopsy. RESULTS: A strong up-regulation of MCP-1 , 
RANTES, and OPN expression was observed, mainly in tubular epithelial cells, with a 
significant major intensity in the progressive IMN patients, A strong correlation between 
the mRNA expression and the corresponding protein was noted. The presence of these 
chemokines and OPN was associated with interstitial cell infiltration. TGF-beta and 
PDGF were also up-regulated, mainly in tubular epithelial cells, with a stronger 
expression in the progressive IMN, and an association with the presence of 
myofibroblasts was found. CONCLUSIONS: Patients with severe proteinuria and 
progressive IMN have an overexpression in tubular epithelial cells of the chemokines 
MCP-1, RANTES, and OPN and the profibrogenic cytokines PDGF-BB and TGF-beta. 
Because this up-regulation was associated with an interstitial accumulation of 
mononuclear cells and an increase in rnyofibroblastic activity, it is suggested that those 
mediators are potential predictors of progression in IMN. Finally, based on experimental 
data and the findings of this article, we speculate that severe proteinuria is the main factor 
responsible for the up-regulation of these factors in tubular epithelial cells. 
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Decreased uncoupling protein expression and intramyocytic ti iglyceride 
depletion in formerly obese subjects. 

^Sc^l^^^^' MancoM, VegaN, H^ssejinkMK, CastagnetoM, 

Istituto di Clirdca Medica and. Clinica Chirurgica and Centre CNR Fisiopatologia Shock. 
Universe Cattolica S Cuore, Rome, Italy, gmingrone@rm.unicatt.it & K 

?^ CTWE: T ° 6Xam . ine the muscular uncoupling protein expression 2 (UCP2) and 

g^^xpres S ,on m morbid obese subjects before and after bariatric surgery Ibilio- 
P ancreat 1 cd I ver S1 on(BPD)]. RESEARCH METHODS AKD PROCEDUReI Seven 

BPD. Skeletal muscle UCP2 and UCP3 mRNA was measured using reverse 
fransenptase-competitive polymerase chain reaction and UCP3 protein by Western 
blotting Intramyocytic triglycerides were quantified by high-performance liquid 
chromatography. Twenty-four-hour energy expenditure and respiratory quotient (RO) 
were measured m a respiratory chamber. RESULTS: After BPD, the average weigh" loss 

TowTomI 'nl N ° nPr0tein RQ WaS inCreaSCd in * e P° stobese -b el^Vs 
1 0 ?6 t ,^ n?c ' 9 ^ ^y<>^ triglyceride level dropped (3.66 

UC?2 and UCT3 tRMA^ 100 ^ ^ P * ° 0001) ^ BPD - of 
vl,i i and UCP3 mRNA was significantly reduced (from 35.9 +/- 6.1% to 18 6 +/- 4 5% 

of cydophain, p = 0.02; from 60.2 +/- 14.0% to 33.4 +/- 8.5%, p = 0.03; respectively) 

UCP3 ^otem content was also significantly reduced (272.19 +/- 84. 13 vs. 175.78 

levekVn - nVrm )- mUlt i P ! e re g ression ^™ (R(2) = 0.90) showed that IMTG 
jp * ~ 0 : 007 ) presented the most powerful independent variable for predicting 

StZ I" DISCUS ? I0N: The st ™g c ^^tion of UCP expression^ decLe 
te^aS^K* tne,yCeri ^ COntCnt P,3yS 311 ^ m ° re im P° rtant role in 
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Differential expression of the short and long forms of the gamma 2 subunit 
of the GABAA/benzodiazepine receptors. 

MiraUes CP , Gutierrez A , jQiaaZU, VitorkaJ, De Bias AL . 

Division of Molecular Biology and Biochemistry, School of Biological Sciences, 
University of Missouri-Kansas City 641 10-2499. 

The distribution of the rnRNAs encoding the gamma 2S and gamma 2L subunits of the 
GABAA receptor in the rat brain has been revealed by in situ hybridization, northern blot 
and dot blot analysis using specific antisense oligonucleotides. In addition, the 
quantitative distribution of the gamma 2S and gamma 2L subunit peptides participating in 
the fully assembled GABAA receptors/benzodiazepine receptors has been mapped by 
immunoprecipitatioh with specific anti-gamma 2S and anti-gamma 2L antibodies. 
Several neuronal types and brain regions are enriched in gamma 2L such as neurons of 
the layer II of striate cortex and cerebellar Purkinje cells as well as the inferior colliculus • 
superior colhculus, deep cerebellar nuclei, medulla and pons. Other neuronal types and 
regions are enriched in gamma 2S such as the mitral cells of the olfactory bulb, pyramidal 
neurons of the pyriform cortex, layer VI of the neocortex, granule cells of the dentate 
gyrus and pyramidal cells of the hippocampus. Other cortical areas and cerebellar granule 
cells express both gamma 2S and gamma 2L in comparable amounts: There is a good 
correlation between the relative expression of gamma 2S and gamma 2L rnRNAs and the 
relative presence of these protein subunits in folly assembled and mature receptors in the 
studied brain regions. The differential distribution of gamma 2S and gamma 2L might 
result in differential ethanol sensitivity of the neurons expressing these GABAA receptor 
subunits. 
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Follicle-stimulating hormone receptor and its messenger ribonucleic acid 
are present in the bovine cervix and can regulate cervical prostanoid 
synthesis. 

Mizrachi D , Shemesli M . 

Department of Hormone Research, Kimron Veterinary Institute, Bet Dagan, Israel 50250. 

The hypothesis that FSH regulates the bovine cervical prostaglandin E(2) (PGE(2)) 
synthesis that is known to be associated with cervical relaxation and opening at the time 
of estrus was investigated. Cervical tissue from pre-estrous/estrous, luteal, and 
postovulatory cows were examined for 1) the presence of bovine (b) FSH receptor (R) 
and its corresponding mRNA and 2) the effect of FSH on the PGE(2) regulatory pathway 
in. vitro. The presence ofbFSHR mRNA in the cervix (maximal during pre-estrus/estrus) 
was demonstrated by the expression of a reverse transcription (RT) polymerase chain 
reaction (PCR) product (384 base pairs) specific for bFSHR mRNA and sequencing. 
Northern blotting revealed three transcripts (2.5, 3.3, and 3.8 kilobases [kb]) in cervix 
from pre-estrous/estrous cows. The level of FSHR (75 kDa) was significantly higher (p < 
0.01) in Western blots of pre-estrous/estrous cervix than in other cervical tissues. There 
was a good correlation between the 75-kDa protein expression arid its corresponding 
transcript of 2.55 kb throughout the estrous cycle as described by Northern blot analysis 
as well as RT-PCR. Incubation of FSH (10 ng/ml) with pre-estrous/estrous cervix 
resulted in a 3 -fold increase in the expression of FSHR and a 2-fold increase in both G 
protein (alpha(s)) and cyclooxygenase II. FSH (5-20 ng/ml) significantly increased (p < 
0.01) cAMP, inositol phosphate (p < 0.01), and PGE(2) (p < 0.01) production by pre- 
estrous/estrous cervix but not by cervix at the other stages. We conclude that bovine 
cervix at the time of the peripheral plasma FSH peak (pre-estrus/estrus) contains high 
levels of FSHR and responds to FSH by increasing the PGE(2) production responsible for 
cervical relaxation at estrus. 
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The alpha(v)beta6 integrin receptor for Foot-and-mouth disease virus is 
expressed constitutively on the epithelial cells targeted iu cattle. 

Monaghan P , GoldS, SimesoiiJ, Zhang Z, Weinreb PH , Violettc SM, Alexandersen 
S, Jackson T . ~ — 

Institute for Animal Health, Pirbright Laboratory, Ash Road, Pirbright, Surrey GU24 
ONF, UK. 

Field strains of Foot-and-mouth disease virus (FMDV) use a number of alpha(v)- 
mtegrins as receptors to initiate infection on cultured cells, and integriris are believed to 
be the receptors used to target epithelial cells in animals. In this study, 
immunofluorescence confocal microscopy and real-time RT-PCR were used to 
investigate expression of two of the integrin receptors of FMDV, alpha(v)beta6 and 
alpha(v)beta3, within various epithelia targeted by this virus in cattle. These studies show 
that aipha(v)beta6 is expressed constitutively on the surfaces of epithelial cells at sites 
where infectious lesions occur during a natural infection, but not at sites wherelesions 
are not normally formed. Expression of alpha(v)beta6 protein at these sites showed a 
good correlation with the relative abundance of beta6 mRNA. In contrast, alpha(v)beta3 
protein was only detected at low levels on the vasculature and not on the epithelial cells 
of any of the tissues investigated. Together, these data suggest tfiat in cattle 
alpha(v)beta6, rather than alpha(v)beta3, serves as the major receptor that determines the 
tropism of FMDV for the epithelia normally targeted by this virus. 
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Urokinase-mediated posttranscriptional regulation of urokinase-recepfc 



or 



Montuori N , Mattiello A, Mancini A , Taglialatela P . Caputi M. Rossi G . Ragno P . 

Istituto di Endocrinologia ed Oncologia Sperimentale, Consiglio Nazionale delle 
Ricerche, Naples, Italy. 

The urokinase-type plasminogen activator (uPA) and its cellular receptor (uPAR) are 
involved in the proteolytic cascade required for tumor cell dissemination and metastasis, 
and are highly expressed in many human tumors. We have recently reported that uPA, 
independently of its enzymatic activity, is able to increase the expression of its own 
receptor in uPAR-transfected kidney cells at a posttranscriptional level. In fact, uPA, 
upon binding uPAR, modulates the activity and/or the level of a mRNA-stabilizing factor 
that binds the coding region of uPAR-mRNA. We now investigate the relevance of uPA- 
mediated posttranscriptional regulation of uPAR expression in non small cell lung 
carcinoma (NSCLC), in which the up-regulationof uPAR expression is a prognostic 
marker. We show that uPA is able to increase uPAR expression, both at protein and 
mRNA levels, in primary cell cultures obtained from tumor and adjacent normal lung 
tissues of patients affected by NSCLC, thus suggesting that the enzyme can exert its 
effect in lung cells. We investigated the relationship among the levels of uPA, uPAR and 
uPAR-mRNA binding protein(s) in NSCLC. Lung tissue analysis of 35 NSCLC patients 
shows an increase of both uPA and uPAR in tumor tissues, as compared to adjacent 
normal tissues, in 27 patients (77%); 1 9 of these 27 patients also show a parallel increase 
of the level and/or binding activity -of a cellular protein capable of binding the coding 
region of uPAR-mRNA. Therefore, in tumor tissues, a strong correlation is observed 
among these 3 parameters, uPA, uPAR and the level and/or the activity of a uPAR- 
mRNA binding protein. We then suggest that uP A regulates uPAR expression in NSCLC 
at a posttranscriptional level by increasing uPAR-stability through a cellular factor that 
binds the coding region of uPAR-mRNA. Copyright 2003 Wiley-Liss, Inc. 
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Vascular endothelial growth factor expression correlates with matrix 
metalloproteinases MT1-MMP, MMP-2 aud MMP-9 in human 
glioblastomas. 

Munaut C, Noel A , Hougrand (X FoidartJM. Boniver J. Deprcz M . 

Laboratory of Tumour and Development Biology, University of Liege, Liege, Belgium. 

Vascular endothelial growth factor (VEGF) is the major endothelial mitogen in central 
nervous system neoplasms and it is expressed in 64-95% of glioblastomas (GBMs). 
Tumour cells are the main source of VEGF in GBMs whereas VEGF receptors (VEGFR- 
1, its soluble form sVEGFR-1, VEGFR-2 and neuropilin-1) are expressed predominantly 
by endothelial cells. Infiltrating tumour cells and newly-formed capillaries progress 
through the extracellular matrix.by.local proteolysis involving matrix metalloproteinases 
(MMPs). Recent studies have shown that VEGF expression and bioavailability can be 
modulated by MMPs. We reported previously that the expression of MT1-MMP in 
human breast cancer cells was associated with an enhanced VEGF expression. We used 
quantitative RT-PCR, Western blot, gelatin zymography and immunohistochemistry to 
study the expression of VEGF, VEGFR-l, VEGFR-2, sVEGFR-1, neuropilin-1 MT1- 
MMP, MMP-2, MMP-9 and TIMP-2 in 20 human GBMs and 5 normal brains. The 
expression of these MMPs was markedly increased in most GBMs with excellent 
correlation between mRNA and protein levels; activated forms of MMP-2 and MMP-9 
were present in 8/18 and 7/18 of GBMs. A majority of GBMs (17/20) also expressed high 
levels of VEGF, as previously reported, with strong correlation between VEGF and MT1 - 
MMP gene expression levels, and double immunostaining showed that VEGF and MT1- 
MMP peptides co-localize in tumour and endothelial cells. Our results suggest that the 
interplay between metalloproteinases and VEGF previously described in experimental 
rumours may also be operative in human GBMs. Because of its dual ability to activate 
MMP-2 and. to up-regulate VEGF, MT1-MMP might be of central importance in the 
growth of GBMs and represent an interesting target for anti-cancer treatments. Copyright 
2003 Wiley-Liss, Inc. 
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MTI-MMP is involved in both developmental and tumour an- 
£3Hr£ MTJ-MMP overexpression in human melanoma 
cells has been associated w,th enhanced in vitro invasion and 
increased m vivo tumour growth and vascularization 28 We have 1 
shown prev,ously that in MCF-7 breast cancer cells, VEGF tran- 
scription JS upreguJated when MTI-MMP is overexpressed » Ur> 
regulation of VEGF by MTI-MMP has also been reported in a 
model of human oji oma xenograft by Deryugina et al™ These 
expenmental data suggest a link between MTI-MMP and the 
VEGF network. We have tested for the presence of such a link in 
human glioblastomas. We compared the expression of VEGF and 
its receptors (VEGFR-J, sVEGFR-1, VEGFR-2 NRP1) with 
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MT1-MMP, MMP-2, MMP-9 and 71MP-2 in a series of 20 GBMs 
and 5 normal brains. Using quantitative RT-PCR,. gelatin zymog- 
raphy, Western blot and immunohistochemistry, we showed a 
strong correlation between the expression of VEGF, MT1-MMP, 
MMP-2 and MMP-9 in GBMs. These results are in accordance 
with previous in vino studies and add to the evidence of an 
interplay between VEGF and MMPs in the progression of human 
GBMs. 



MATERIAL AND METHODS 

Patients 

We studied 20 GBMs diagnosed at the Laboratory of Neuropa- 
thology-CHU Liege between 1997 and 2001. The series included 
17 primary GBMs (lc, no previous history of lower grade diffuse 
astrocytoma) and 3 secondary GBMs (£.*., previous history of 
lower grade diffuse astrocytoma). Clinical information on these 20 
cases have been reported previously as part of a larger series. 3 1 The 
gender ratio was 1/1, and the age at time of diagnosis ranged from 
41-79 years (mean 56 years). Nonnal brain cortex and white 
matter were obtained from 5 patients with intractable epilepsy 
treated by partial temporal lobectomy. Histological examination of 
these specimens showed severe hippocampal sclerosis; frozen tis- 
sue was sampled from microscopically normal inferior temporal 
gyri Our study was approved by the Ethical Committee of the 
Faculty of Medicine of the University of Liege. 



RNA extraction and cDNA synthesis 

Total RNA was extracted from cryosections with RNeasy Mini 
Kit (QIAGEN GmbH, Hilden, Germany) according to the manu- 
facturer's protocol. Total RNA (1 jig) was reverse transcribed with 
a ThermoScript reverse transcriptase (ThermoScript RT-PCR Sys- 
tem, Invitrogen, Carlsbad, CA) and random hexamers as primers. 

Primers 

: Primers pairs used in our study are described in Table 1. Primers 
for the VEGF gene were chosen to distinguish between VEGF, 3 9, 
VEGF, ^ VEGF I45 and VEGF )21 mRNA isoforms. Intron-span- 
ning primers and probes for the TaqMan system (primers for 
VEGFR- 1 (Flt-lX sVEGFR- 1 , VEGFR-2 (KDR/Flk- 1 ) and NRP1 ) 
were designed to meet specific criteria by using Primer Express 
software (Perkin Elmer, Foster City, CA). All primers were syn- 
thesized by Eurogentec (Liege, Belgium). The 5'- and 3'-end 
nucleotides of the probe were labeled with a Teporter (FAM. = 
6-carboxy-ih)6rescein) and a quencher dye (TAMRA = 6-car- 
boxy-tetramethylrhodaroine). We conducted BLASTn (National 
Center for Biotechnology Information, Bethesda) searches against 
dbEST and the non redundant set of GenBank, EMBL, and GDBJ 
database sequences to confirm the total gene specificity of the 
nucleotide sequences chosen for the primers. The specificity of the 
amplified PCR products was confirmed either by restriction digest 
* or by sequencing. The 18S ribosomal RNA was measured using 
the Pre-Developed TaqMan Assay Reagents Endogenous control 
kit from Applied Biosystems (Foster City, CA). 



TABLE I - SEQUENCE OF PRIMERS AND TaqMan PROBES USED FOR RT-PCR STUDIES 



Gene and accession 
number 


Position 


Sequence 


Size 


Cycles 


MMP-2 FP 


1740F 


5*-AGATCTTCTTCTTCAAGGACCGGTT-3' 
5VGGCTGGTCAGTGGCTTGGGGTA-3' 


225 bp 


33 


MMP-2-RP . 


1964R 






NM_0O4530 










MMP-9-FP 


1592F 


5'-GCGGAGATTGGGAACCAGCTGTA-3' 


208 bp 


37 


MMP-9-RP 


1800R 


5'-GACGCGCCTGTGTACACCCACA-3' 






J05070 










MMP-14-FP * 


1288F 


5'-GGATACCCAATGCCCATTGGCCA-3 f 


221 bp 


32 . 


MMP-14-RP 


1508R 


5'-CCATTGGGCATCCAGAAGAGAGC-3' 






NM_004995 










T1MP1-FP 


: 78F 


5 '- CATCCTGTTGTTGCTGTGGCTG AT-3 ' 


168 bp 


33 


T1MP1-RP 


245R 


5 '- GTC ATCTTG ATCTC ATA A CGCTGG-3 ' 






Ml 2670 










T3MP-2-FP 


78F 


5^(nrGCTGGACGTTCX;ACK3AAACAA-3' 


155 bp 


30 


TTMP-2-RP 


245R 


5'-AGCCCATCTGGTACCTGTGGTTCA-3' 






NM 003255 






479 bp 


33 


VEGF FP 


1208F 


5 ' -CCTGGTGG AC ATCTTCC AGG A GT A-3 ' 
5'-CTCACCGCCTCGGCTTGTCACA-3' 


VECF-RP 


1687R 


407 bp 




AH001553 






347 bp 
275 bp 




28S rRNA-RP 


12403F 


5'-GTTCACCCACTAATAGGGAACGTGA-3' 
S^GATTCTGACiTAGAGGCGTTCAGTO' 


212 bp 


. 19 


28S rRNA-RP 


126J4R 






U 13369 










VEGFR J -FP 


2438F 


5'-TCCCTTATGATGCCAGCAAGT-3' 


79 bp 


40 


VEGFR 1-RP 


2516R 


y-CCAAAAGCCCCTCTTCCAA-3' 






VEGFR 1 Probe 


2469 


5'-CCGGGAGAG A CTTA A ACTGGGCA A ATC A- 3 ' 






AF063657 








40 


sVEGFRI-FP 


2209F 


5'-ACAATCAGAGGTGAGCACTGCAA-3' 


180 bp 


sVEGFRI-RP 


2388R 


5'-TCCGAGCCTGAAAGTTAGCAA-3' 






s VEGFR) Probe 


2257 


5'-TCCAAATTTAAAAGCACAAGGAATGATTGTACCAC-3' 






U01134 








40 


VEGFR2-FP 


79JF 


5'-CTTCGAAGCATCAGCATAAGAAACT-3' 
5'-TGGTCATCAGCCCACTGGAT-3' 


156 bp 


VEGFR2-RP 


946R 






VEGFR2 Probe 


820 ■ 


5'-AACCGAGACCTAAAAACCCAGTCTGGGAGT-3' 






AF063658 








NRPI-FP 


I831F 


5'-CACAGTGGAACAGGTGATGACTTC-3' 


1 12-bp 


40 


NRP1-RP 


1942R 


5'-AACCATATGTTGGAAACTCTGATTGT-3'. 






NRP1 Probe 


1883 


5'-CCACAGAAAAGCCCACGGTCATAGACA-3' 






XM 034725 
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JrfS JiTt,w FM M VEG ^ Cplors mRNA q^rification: scat- 
!l! ; bB On - Nonna] bjaui < bIack and GBMs (white 

spots) mRNA levels are expressed as normalized values (as described 

Z^Vt™* A U - = Units >* Each poinT^pre 

sents the mean of 3 separate experiments. , 

End point quantitative PCR for MTJ-MMP, MMP-2 MMP-9 
TJMP-2 mRNA and, VEGF mRNA isoforms 

MTJ-MMP, MMP-2, MMP-9, T1MP-2 and VEGF mRNA i so- 
forms (VEGF 189 , VEGF 165 , VEGF M5 and VEGF m ) were me£ 
suied in 10 ng aliquots of cDNA using Taq polymerase (Takara, 
Muga, Japan) and 5 pmol of each primers (Table J). The thermal 
cychng conditions included 2 min at 95°C for denaturation and 

wnZK 1 f° n 15 560 al 94 ° C ' 20 " c al 66 ° C and 20 sec at 
Tooo ( S>o C foi , VEGF is ^orms) with a final incubation 2 min at 
i ;«^2lP roducls weie ieso * Ve d on 2% Nusieve 3:1 agarose 
gels (BioWhittaker, Rockland, MD) and analyzed using a Huor-S 
Multilmager (Bio-Rad, Hercules, CA) after ethidium bromide 
stammg. Specific mRNA levels were expressed as the ratio of 
specific lranscnpts/28S transcripts. Experiments were repeated at 
least 3 limes m duplicate. 

Rea J~J!™ ^onmaiive PCRfor VEGFR-J, sVEGFR-I, VEGFR-2 
and NRP J mRNA 

i,5?£l! me ^ uantilative RT-PCR analyses for VEGFR-1 
SVECFR-,, VEGFR-2, NRP1 mRNAs and'lSS rRNA were caj: 
ned out usmg ibe ABJ PRISM 7700 Sequence Detection System 

m f S !^^ and S ° flWarC (PE A PP ,ied Biosystems). The sequences 
ol the PCR primer pairs and fluorogenic probes thai were used for 
each gene are shown in Table I. A standard curve was generaled by 

snr^n llU l i0 ° ° f pbCCma cDNA 10 cove ' ™& of 
£U,UO0-80 ng and was run in duplicate during every experiment 
For each experimental sample, the amount of target gene was 
determined from this standard curve. The relative expression level 
of the target gene was normalized against 18S rRNA to compen- 



S3 ^ 0r , vanat,0n m ^uaiity of RNA and the amount of input 
cDNA (as described by the manufacturer PE Applied Biosystems 
in User Bulletin 2). PCR was carried out with the TaqMan Uni- 
V ^x?I ^ MU < A PP Jied Biosystems) using 5 pj of diluted 
cDNA .(equivalent to 10 ng total RNA), 200 nM of the probe, and 
400 nM pnmers m a 25 pJ final reaction mixture. After a 2 min 
incubation at 50 e C to allow for UNG cleavage, AmpUTaq Gold 
was activated by an incubation for 10 min al 95°C Each of the 40 
PCR cycles consisted, of 15 sec of denaturation at 95°C and 
hybridization of probe and primers for 1 min at 60°C 

To confirm amplification specificity, the PCR products were 
also examined by subsequent 2% agarose gel electrophoresis 
Experiments were repeated at least 3 times in duplicate. 

lmmunohistochemistry for VEGF and MTJ-MMP 

Sections (4 pjn thick) were cut from formalin- fixed, paraffin 
embedded rumour tissue. They were hydrated through graded 
alcohols and incubated in H 2 0 2 (0.3% 15 min). Sections were 
autoclaved for 11 mm at 126°C in citrate buffer pH6 for antigen 
retrieval (Dako, Glostrup, Denmark). For double immunostaiiiL 
>^T?T ere 1DCubaled ' n Primary monoclonal Ab anti-MTl- 
MMP Ab-4) MOO (Oncogene Research Products, San Diego, 
CA) followed by peroxidase- conjugated En Vision (Dako) Wui- 
noreactivity was visualized with 3,3'diaminobenzidine (DAB+ 

vS^" ff^l WCrC ~* en incubaled with polyclonal Ab and-' 
VEGF :150 (Santa Cruz, Santa Cruz, CA) for 1 hf at room 
temprature, followed, by alkaline phosphatase-conjugated En- 
Vision (Dako). Immunoreactivity for VEGF was visualized with 
Fast Red chromogenic substrate (Dako). Single immunostaininfi 
was also earned out on serial sections using each primary antibody 
alone with the corresponding enzyme-chromogene combination 
Negative controls were obtained by omitting the primary antibod- 
ies. 

Gelatin zymography assay 

MMP-2 and MMP-9 activities were quantified by gelatin zy- 
mography on 2 normal brains and 18 GBMs. Ten cryosections (10 
p,M) were homogenized in buffer (0.1 M Tris-HG pH 8.1 0 4% 
Triton X-I00) and centrifuged for 20 min at 5,0(%. The Ijellets 
were discarded. 25 ng of total protein from homogenate superna- 
Unts were mixed with non reducing sample buffer (62 5 mM 
Tns-HCl, pH 6.8; 2% SDS; 10% glycerol; 0.1% bromophenol 
blue) and electrophoresed directly on 10% SDS-po)yacryl amide 
gels (SDS-PAGE) containing 0.1% gelatin (w/v)« electro- 
phoresis, gels were washed for 1 hr at room temperature in a 2% 
(v/v) Triton X-l 00 solution to remove SDS, transferred to a buffer 
(50 mM Tris-HCl, pH 7.6, containing 10 mM CaCU and incu- 
bated for 18 hr at 37°C Gels were stained for 30 min wilh 0 1% 
(w/v) Coomassie brilliant blue G250 in 45% (v/v) methanol/10% 
-(v/v) acetic acid and desiained in 10% (v/v) acetic acid/20% (v/v) 
methanol. Gels were analyzed with Quantity One software (ver- 
sion 4.2.2, Bio-Rad Laboratories, Hercules, CA) after densitomet- 
ry scanning of the gels using a Fluor-S Multimager (BioRad). 

Western blot 

1 c^x' 1 ^ pr0tdn ,evels wcre analyzed in 2 normal brains and 
15 GBMs. Brain exiracts (25 fxg) were mixed with 1/2 sample 
buffer [0.25 M Tris (pH6.8), 10% SDS (w/vX 4% sucrose (v/v), 
a ^' merca P toethano1 < v/v ) ai >d 0. 125% bromophenol blue (w/v)J 
and boiled for 5 min. They were separated on 10% SDS-PAGE 
gels and transferred io a PVDF filter (NEN, Boston, MA) After 
blocking with 5% milk (w/v), 0.1% tween 20 (w/v) in PBS for 2 
hr at room temperature, membranes were exposed to the primary 
antibody (10 ng/ml, clone I13-5B7, Ab-4, Oncogene Research 
Products, San Diego, CA) at 4°C overnight followed by incubation 
with a horseradish peroxidase-conjugated rabbii anti-mouse anti- 
body (1.3 jxo/mJ, Dako, Glostrup, Denmark). Signals were de- 
tected with an enhanced chemoluminescence (ECL) kit (NEN. 
Boston, MA). The relative intensities of the immunoreaciive bands 
were analyzed with Quantity One software (version 4.2.2, Bio-Rad 
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Figure 2- MMPs and TIMP-2 mRNA quantifica- 
tion- (a) Representative 2% agarose gels of RT-PCR 
products for MTI-MMP, MMP-2, TIMP-2 and 
MMP-9 in 2 normal brains (NB) and 10 GBMs. (b) 
Scatter plots (as described in Fig. I). Experiment was 
repeated at least 3 times in duplicate. 



Laboratories) after densitomeiric scanning of the X-ray films using 
a Fluor-S Multimager (Bio-Rad). 

Staiistics 

VEGF, VEGFRs, MMPs and TIMP-2 expression values in 
GBMs were correlated using Spearman's test. Correlation was 
considered significant for 2-tailed p-value < 0.05. Statistical anal- 
ysis was carried out using (be Prism 3.0 software (GraphPad, San 
Diego, CA). 

RESULTS 

Expression of VEGF and VEGF receptors 

VEGF mRNA was present in normal brains (295-375, arbitrary 
units; mean = 322) and in all GBM samples (217-5,1 12; mean = 
1/774) as reported previously (Fig. I). 3 ' In most GBMs, VEGF 
mRNA levels were raised 2- 15- fold above normal brain values. 
Tbe most abundant isoform in all cases was VEGF )65 , followed by 
VEGF 1?lT VEGF, B9 and VEGF J43 (data not shown). VECFR-1 
expression was found at similar levels in GBMs (89-357; mean = 
182) and normal controls (154-198; mean = 181). There was no 
correlation between VEGFR-1 and VEGF mRNA levels ip = 
0.35) in GBMs. VEGFR-2 was expressed in all GBMs (48-582; 
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mean = 210) and in 8/20 cases at least twice normal values 
(87-111- mean = 103). VEGF and VEGFR-2 expressions were 
correlated significantly ip = 0.0035) in GBMs. NRP1 expression 
varied broadly between GBMs (75-3,260; mean = 1,061) con- 
trasting with a constant baseline expression in normal controls 
(383-397; mean - 390). In tumours, NRP1 correlated with 
VEGFR-2 ip = 0.01 19) but not with VEGF {p = 0.084), nor 
VEGFR-1 {p = 0.066). sVEGFR-I was expressed at low levels 
both in normal brains (84-92; mean = 87) and GBMs (25-208; 
mean = 101). sVEGFR-1, however, was found to correlate wjth 
VEGFR-1 ip = 0.0289), VEGFR-2 (p = 0.0029), and NRP) ip - 
0.0027) but not with VEGF (p = 0.053). 

Expression of MMPs and TJMP-2 

MTI-MMP, MMP-2 and MMP-9 were expressed in both nor- 
mal brains and GBMs but at much higher levels in the latter (Fig. 
2a). MTI-MMP mRNA levels were constantly higher id GBMs 
(34-202; mean = 106) than in normal controls (3-29; mean - 
15). MMP-2 and MMP-9 mRNA levels were higher than controls 
in 1 8/20 an9 14/20 cases respectively (Fig. 2b) and correlated with 
each other ip = 0.0187). MTI-MMP mRNA levels correlated with 
MMP-2 ip = 0.0008) and MMP-9 [p = 0.005). TIMP-2 had a 
non-discriminative distribution in relation io the controls. TIMP-2 
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^ m0graphlC . a J ,a ^ ys,S of MMP ' 2 ™<* MMP-9 in tissue 
extracts either from normal brain (N) or GBMs (GB) Medium con- 
dmoned by human IfTlOSO cells was included 'as ^siri^ cont^L 
feiUons of pro-MMP-9, MMP-9, pro-MMP-2 and MMP-2 are "IS- 
caieu uy arrows. 

TABLE D "ZYMOGRAMS AND WESTERN BLOT QUANTIFICATION OF 
- MMPS IN NORMAL BRAIN AND GBMS 1 . 



in S3? 4 ~^ ,Cni W ? ^ thc antibody (clone 

1 I3-5B7) raised against the catalytic domain of MTI-MMP Protein 
extracts from MTI-MMP transfected A2058 cells (don^ >S^W ' 



TABLE JD- CORRELATION BETWEEN VEGF, VEGF RECOTORS MMK 
AND T1MP,2 PRESSED AS p VALUES * 
; FROM SPEARMAN'S TEST 



MTI-MMP 



MMP-2 



MMP-9. 



71MP-2 



VEGF 

VEGFR-1 

VEGFR-2 

NRP1 

sVEGFR-1 



0.0250 
0.0073 
<0.0001 
0.0053 
0.0313 



0.0245 
0.0710 
0.0168 
0.1334 
0.0469 



pioMMP-9 


Nl 


0.41 


N2. 


0.28 


GB1 


9.00 


GB2 


4.28 


GB3 


1.67 


GB4 


1.40 


GB5 


0.47 


GB7 


5.29 


GB8 


5.75 


GB9 


5:02 


GBI2 


5.08 


GB13 


6.86 


GB14 


5.59 


GBI5 


8.00 


GB16 


5.55 


GB17 


6.61 


GBJ8 


14.79 


GB20 


18.85 


GB21 


16.52 


GB22 


20.66 . 



Gelaiin zymogjaphy 


MMP9 


proMMP-2 


MMP-2 


0.00 


1.52 


0.00 


0.00 


0.83 


0.00 


1.25 


2.45 


035 


0.00 


2.88 


0.09 


0.00 


2.86 


0.00 


0.00 


4.33 


0.03 


0.00 


1.15 


0.00 


0.00 


U2 


0.00 


3.78 


4.63 


0.65 


0.00 


4.09 


0.00 


0.00 


4.79 


0.20 


4.58 


4.93 


•0.48 


6.75 


5.53 


0.76 


0.00 


5.13 


0.00 


no 


5.66 


0.00 


2.04 


6.00 


0.00 


4.34 


8.20 


0.40 


0.00 


2.12 


0.00 


0.00 


2.37 


0.00 


0.00 


5.11 


0.00 



. 0.0053 
<0.0001 
0.0004 
0.2457 
0.0194 



0.0094 
0.4542 
0.0153 
0.5480 
0.2563 



.Western bfot 
MTI-MMP 



0.44 

0.47 

1.58 

0.47 

0 

0 

ND 
0 

0.55 
0.45 
3.18 
1.55 
0.94 
ND 
2.50 
0 

1.97 
0 

ND 
0.86 



^TmO^t^ uni ' s * norma, brain; GB - ^ 



oi££^ la,e - wi * ^'-MMP (P = 0.0019) and MMP-2 [p = 
0.0002) bui do! with MMP-9 (p = 0.1408). 

By gelaiin zyrnography, pro-MMP-2 and pro-MMP-9 were de- 
eded ,n Ite ,8 GBMs and 2 controls examined (fig. 3, Tab e U) 

h GBMS ' ' h CiS ° f "** iD3C,iVe f0 ™ s - re Uto In in 
nonnal brains and were correlated with their respective mRNA 

MMP-2 and MMP-9 were not found in normal brain. By contrast 

mti IfLTT *7 m f (MMP " 2) and 7/18 <^ p -9) GBMs! 

GBMs and 2 normal brains (Fig. 4, Table U). They were signifi- 



canUy correlated witf, zympgram-derived activated MMP-2 levels 
{p - O.VZ26) but not with activated MMP-9 levels (t> = 0O6> 
nterestingly, MTI-MMP protein and mRNA levels were corre- 
ated sigmfica^Uy (p = 0.089), arguing for a predominantly Iran- ■ 
scnptiODal regulation in GBMs. 

Correlation between VEGF network and MMPs 

m^mITSL^ y^ GF and VEGF rece P tt>rs wcre compared to 
MTI-MMP, MMP-2, MMP-9 and TTMP-2 (Table Hi). There was 

LSS"5S?»5 0n !l a ?? belWeen VEGF ™P™*ion and MTI- 
MMP MMP-2 and MMP-9. A similar correlation was also ob- 
served between VEGFR-2 and MMPs. Interestingly TTMP-2 ex- 

P ^ SS, ?fr,^ a , S COrreIated wi * ^GF and VEGFR-2 but not with 
other VEGF receptors. 

Immunohistochemistry for VEGF and MTI-MMP 

VEGF immunoreactivity was shown in both tumour and endo- 
mehal cells, as previously reported (Fig. 5a Jb)?* By single immu- 
nostaimng. MTI-MMP was detected in glioblastoma ^ ™ a 
diffuse cytoplasmic staining (Fig. 5 c ,d) . MTI-MMP positiviry was 
also seen m endothelial cells and perivascular cells (Ro 5 c) By 
*£^^* mne > WC observed the ^localization of VEGF 
and MTI-MMP m the cytoplasm of numerous tumour cells (Fig 



D1SCUSSJON 

GBMs are highly malignant tumours with poor prognosis. They 
show major microvascular proliferation and express high levels of 
vtoh. VEGF is a strong mitogen for endothelial cells thereby 
promoting angiogenesis. Previous reports".:" have suggested that 
VEGF also stimulates tumour cell invasion, migration and survival 
in malignant epithelial cells through an autocrine loop by which 
oppression of MMPs induces VEGF secretion and leads to 
subsequent amplification of cell proliferation and protection 
against apoptosis. We and others reported previously that in human 
melanoma and breast carcinoma cells, MTI-MMP upregulates 
VEGF expressjon whereas T1MP-2 reduces it Jin** Therefore ihe 
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Figure 5 - lmmunohistocbemis- 
try (scale bar = 50 ixm). (ajb) VEGF 
positive tumour and endothelial cells 
(plain arrows) show granular red 
staining of the cytoplasm. Negative 
cells (empty arrow) are seen in their 
close vicinity, (c) MT1-MMP posi- 
tive cells show a strong brown cyto- 
plasmic staining. They include tu- 
mours cells (plain arrow) and 
endothelial cells, (empty arrow), (d) 
MT1-MMP positive tumour cells 
(plain arrow) are mixed with nega- 
tive cells (empty arrow), (ej) Double 
staining with VEGF (red) and MTI- 
MMPI (brown). Double positive tu- 
mour cells (plain arrows) contrast 
with negative or single weakly pos- 
itive cell (empty arrow). 



pericellular proteolysis mediated by MT1-MMP in GBMs couJd 
also induce an autocrine loop resulting in enhanced VEGF expres- 
sion. In rum„ VEGF could act as a paracrine factor on endothelial 
cells to stimulate angiogenesis. or possibly as an autocrine factor 
promoting glioblastoma cells survival-migration and invasion as 
demonstrated recently in the various tumour cell culture models. 

We compared the expression of VEGF and its receptors with 
MT1-MMP, MMP 2 and MMP-9 in 20 GBMS and 5 normal brains. 
The expression of these MMPs was markedly increased in most 
GBMs with excellent correlation between mRNA and protein levels. 
MT1-MMP expression has been shown previously to correlate with 
glioma aggressiveness and its transfection in different tumour cell 
lines triggers an angiogenic phenorype and promotes tumour 
growth. 20 - 2 *- 30 - 36 -?* A majority of GBMs ( 17/20) also expressed high 
levels of VEGF, as -previously reported, with a strong correlation 
between VEGF and MT1-MMP gene expression levels. Double in> 
munos turning studies showed co-expression of VEGF and MTI- 
MMP by the same tumour cells. These data suggest that the transcrip- 




tional control of VEGF by MT1-MMP could be operative not only in 
vitro but also in vivo in human GBMs. 

MT1-MMP could also promote the growth of GBMs by its 
ability to activate MMP-2 in the presence of low concentration of 
T1MP-2. 40 Pro-MMP-2 activation occurs after the formation of a 
ternary complex that contains pro-MMP-2 linked to cell surface 
MT1-MMP via a TJMP-2 bridge. In accordance with this hypoth- 
esis, we found that MMP-2 activation occurred j'd 8/18 of our 
GBMs 20 - 41 among which 7/7 tested for MT1-MMP showed high 
contents of this protease. 

Activated MMP-9 was aiso found in 7/18 of our GBMs. This is 
an interesting finding as active MMP-9 is able to mobilize VEGF 
from its ECM reservoir. 39 Therefore, MMPs could promote 
VEGF- mediated angiogenesis in GBMs by both transcriptional 
(MT1-MMP) and post translational (MMP-9) mechanisms. 

VEGF binding to VEGFR-2 triggers the proliferation and mi- 
gration of endothelial cells whereas its binding to VEGFR-I has 
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opposite effects on glioblastoma cell lines. » In our study VEGF 
mKNA levels were correlated with VEGFR-2 but not VEGFR-I, 
NRP1 and sVEGFR-1. Collectively our data suggest that GBMs 
Asplay a specific and complex pattern of VEGF receptors, trans- 
ducing VEGF signaling toward cell proliferation and migration 

In conclusion, our study adds.to the evidence for an interplay 
between metalloproteinases and VEGF in human GBMs as previ- 
ously .documented m experimental tumours. Because of its dual ability 
to activate MMP-2 and to up-regulate VEGF, MT1-MMP might be of 
central importance in the growth of human glioblastomas and repre- 
sent an interesting target for anti-cancer treatments. 
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Genome-wide Study of Gene Copy Numbers, 
Transcripts, and Protein Levels in Pairs of 
Non-invasive and Invasive Human Transitional 
Cell Carcinomas* 

Torben F. 0rntoftj:§, Thomas ThykjaerU, Frederic M. Waldman||, Hans Wolf**, 
and Julio E. Celistt 



Gain and loss of chromosomal material is characteristic 
of bladder cancer, as well as malignant transformation in 
general. The consequences of these changes at both the 
transcription and translation levels is at present unknown 
partly because of technical limitations. Here we have at- 
tempted to address this question in pairs of non-invasive 
and invasive human bladder tumors using a combination 
of technology that included comparative genomic hybrid- 
ization, high density oligonucleotide array-based monitor- 
ing of transcript levels (5600 genes), and high resolution 
two-dimensional gel electrophoresis. The results showed 
that there is a gene dosage effect that in some cases 
superimposes on other regulatory mechanisms. This ef-. 
feet depended (p < 0.015) on the magnitude of the com- 
parative genomic hybridization change. In general (18 of 
23 cases), chromosomal areas with more than 2-fold gain 
of DNA showed a corresponding increase in mRNA tran- 
scripts. Areas with loss of DNA, on the other hand, 
showed either reduced or unaltered transcript levels. Be- 
cause most proteins resolved by two-dimensional gels 
are unknown it was only possible to compare mRNA and 
protein alterations in relatively few cases of well focused 
abundant proteins. With few exceptions we found a good 
correlation (p < 0.005) between transcript alterations and 
protein levels. The implications, as well as limitations, 
of the approach are discussed. Molecular & Cellular 
Proteomics 1:37-45, 2002. 

Aneuploidy is a common feature of most human cancers 
(1), but tittle is known about the genome-wide effect of this 
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phenomenon at both the transcription and translation levels. 
High throughput array studies of the breast cancer cell line 
BT474 has suggested that there is a correlation between 
DNA copy numbers and gene expression in highly amplified 
areas (2), and studies of individual genes in solid tumors 
have revealed a good correlation between gene dose and 
mRNA or protein levels in the case of c-erb-B2, cyclin d1, 
ems 7, and N-myc (3^5). However, a high cyclin D1 protein 
expression has been observed without simultaneous am- 
plification (4), and a low level of c-myc copy number in- 
crease was observed without concomitant c-myc protein 
overexpression (6). 

In human bladder tumors, karyotyping, fluorescent in situ 
hybridization, and comparative genomic hybridization (CGH) 1 
have revealed chromosomal aberrations that seem to be 
characteristic of certain stages of disease progression. In the 
case of non-invasive pTa transitional cell carcinomas (TCCs), 
this includes loss of chromosome 9 or parts of it, as well as 
loss of Y in males. In minimally invasive pT1 TCCs, the fol- 
lowing alterations have been reported: 2q-, 11p-, 1q+,. 
11q13+, 17q+, and 20q+ (7-12). It has been suggested that 
these regions harbor tumor suppressor genes and onco- 
genes; however, the large chromosomal areas involved often 
contain many genes, making meaningful predictions of the 
functional consequences of losses and gains very difficult. 

In this investigation we have combined genome- wide tech- 
nology for detecting genomic gains and losses (CGH) with, 
gene expression profiling techniques (microarrays and pro- 
teomics) to determine the effect of gene copy number on 
transcript and protein levels in pairs of non-invasive and in- 
vasive human bladder TCCs. 

EXPERIMENTAL PROCEDURES 

Material— Bladder tumor biopsies were sampled after informed 
consent was obtained and after removal of tissue for routine pathol- 
ogy examination. By light microscopy tumors 335 and 532 were 
staged by an experienced pathologist as pTa {superficial papillary). 



1 The abbreviations used are: CGH, comparative genomic hybrid- 
ization; TCC, transitional cell carcinoma; LOH, loss ol heterozygosity; 
PA- FA BP. psoriasis- associated fatty acid- binding protein; 2D, 
two-dimensional. 
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A Chr. 1q 733 




Fig. 1. DNA copy number and mRNA expression level. Shown from left to right are chromosome {Chr.), CGH profiles, gene location and 
expression level of specific genes, and overall expression level along the chromosome. A, expression of mRNA in invasive tumor 733 as 
compared with the non-invasive counterpart tumor 335. fl, expression of mRNA in invasive tumor 827 compared with the non-invasive 
counterpart tumor 532. The average fluorescent signal ratio between tumor DNA and normal DNA is shown along the length of the chromosome 
{left). The bold curve in the ratio profile represents a mean of four chromosomes and is surrounded by thin curves indicating one standard 
deviation. The central vertical line iproken) indicates a ratio value of 1 (no change), and the vertical lines next to it {dotted) indicate a ratio of 
. 0.5 {left} and 2.0 {right). In chromosomes where the non-invasive tumor 335 used for comparison showed alterations in DNA content, the ratio 
profile of that chromosome is shown to the right, of the invasive tumor profile. The colored bars represents one gene each, identified by the 
running numbers above the bars (the name of the gene can be seen at www. MDL-DK/sdata. html). The bars indicate the purported location of 
the gene, and the colors indicate the expression level of the gene in the invasive tumor compared with the non-invasive counterpart; >2-fold 
increase {black), >2-fold decrease {blue), no significant change {orange). The oar to the far right, entitled Expression shows the resulting change . 
in expression along the chromosome; the colors indicate that at least half of the genes were up-regulated fplack), at least half of the genes 
down- regulated {blue), or more than half of the genes are unchanged {orange). It a gene was absent in one of the samples and present in 
another, it was regarded as more than a 2-fold change. A 2-fold level was chosen as this corresponded to one standard deviation in a double 
determination of —1800 genes. Centromeres and heterochromatic regions were excluded from data analysis. 



grade I and II, respectively, tumors 733 and 827 were staged as pT1 
(invasive into submucosa), 733 was staged as solid, and 827 was 
staged as papillary, both grade III. 

mRNA Preparation— Tissue biopsies, obtained fresh from surgery, 
were embedded immediately in a sodium-guanidinium thiocyanate 
solution and stored at -80 C C. Total RNA was isolated using the 
RNAzol B RNA isolation method (WAK-Chemie Medical GMBH). 
poly(A)* RNA was isolated by an oligo(dT) selection step (Oligotex 
. mRNA kit; Oiagen). 

cRNA Preparation— 1 p.g of mRNA was used as starting material. 
The Jirst and second strand cDNA synthesis was performed using the 
Superscript® choice system (Invitrogen) according to the manufac- 
turer's instructions but using an oligo(dT) primer containing a T7 RNA 
polymerase binding site. Labeled cRNA was prepared using the ME- 
GAscrip® in vitro transcription kit (Ambion). Biot in -labeled CTP and 



UTP (Enzo) was used, together with unlabeled NTPs in the reaction. 
Following the in vitro transcription reaction, the unincorporated nu- 
cleotides were removed using RNeasy columns (Oiagen). 

Array Hybridization and Scanning— Array hybridization and scan- 
ning was moditied from a previous method (13). 10 ^g of cRNA was 
fragmented at 94 °C for 35 min in buffer containing 40 mM Tris 
acetate, pH 8.1, 100 mM KOAc, 30 mM MgOAc. Prior to hybridization, 
the fragmented cRNA in a 6x SSPE-T hybridization buffer (1 m NaCI, 
10 mM Tris, pH 7.6, 0.005% Triton), was heated to 95 °C lor 5 min, 
subsequently cooled to 40 °C, and loaded onto the Affymetrix probe 
array cartridge. The probe array was then incubated tor 16 h at 40 °C. 
at constant rotation (60 rpm). The probe array was exposed to 10 
washes in 6x SSPE-T at 25 *C followed by 4 washes in 0.5 x SSPE-T 
at 50 °C. The biotinylated cRNA was stained with a streptavidin- 
phycoerythrin conjugate, 10 ng/ml (Molecular Probes) in 6x SSPE-T 
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Fig. 1 — continued 



for 30 min at 25 °C followed by 1 0 washes in 6x SSPE-T at 25 °C. The 
probe arrays were scanned at 560 nm using a confocal laser scanning 
microscope (made for Atfymetrix by Hewlett-Packard). The readings 
from ihe quanlilative scanning were analyzed by Affymetrix gene 
expression analysis software. 

Microsatetlite Anafysis— Microsatellite Analysis was performed as 
described previously (14). Microsatellttes were selected by use of 
www.ncbi.nlm.nih.gov/genemap98, and primer sequences were ob- 
tained from the genome data base at www.gdb.org. DNA was extracted 
from tumor and blood and amplified by PCR in a volume of 20 /J for 35 
cycles. The ampficons were denatured and electrophoresed (or 3 h in an 
ABI Prism 377. Data were collected in the Gene Scan program for 
fragment analysis. Loss of heterozygosity was defined as less than 33% 
of one allele detected in tumor ampticons compared with blood. 

Proteomic Analysis— TCCs were minced into small pieces and 
homogenized in a small glass homogenizer in 0.5 ml of lysis solution. 
Samples were stored at -20 *C until use. The procedure (or 2D gel 
electrophoresis has been described in detail elsewhere (15, 16). Gels 
were stained with silver nitrate and/or Coomassie Brilliant Blue. Pro- 
teins were identified by a combination of procedures that included 
microsequencing, mass spectrometry, two-dimensional gel Western 
immunoblotting, and comparison with the master two- dimensional gel 
image ol human keratinocyte proteins; see biobase.dk/cgi-bin/celis. 

CGH- Hybridization of differentially labeled tumor and normal DNA 
to normal metaphase chromosomes was performed as described 
previously (10). Fluorescein- labeled tumor DNA (200 ng), Texas Red- 



labeled reference DNA (200 ng), and human Cot-1 DNA (20 fig) were 
denatured at 37 "C for 5 min and applied to denatured normal met- 
aphase slides. Hybridization was at 37 °C for 2 days. After washing, 
the slides were count erst ained with 0.15 /xg/ml 4,6-diamidino-2-phe- 
nylindole in an anti-fade solution. A second hybridization was per- 
formed for all tumor samples using fluorescein-labeled reference DNA 
and Texas Red-labeled tumor DNA (inverse labeling) to confirm the 
aberrations detected during the initial hybridization. Each CGH ex- 
periment also included a normal control hybridization using fluores- 
cein- and Texas Red-labeled normal DNA. Digital image anafysis was 
used to identify chromosomal regions with abnormal fluorescence 
ratios, indicating regions of DNA gains and losses. The average 
greenrred fluorescence intensity ratio profiles were calculated using 
four images of each chromosome (eight chromosomes total) with 
normalization of the greenrred fluorescence intensity ratio for the 
entire metaphase and background correction. Chromosome identifi- 
cation was performed based on 4,6-diamidino-2-phenylindole band- 
ing patterns. Only images showing uniform high intensity fluores- 
cence with minimal background staining were analyzed. All 
centromeres, p arms of acrocentric chromosomes, and heterochro- 
matic regions were excluded from the analysis. 

RESULTS 

Comparative Genomic Hybridization —The CGH analysis 
identified a number of chromosomal gains and losses in the 
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Table I 

Correlation between alterations detected by CGH and by expression monitoring 
Top. CGH used as independent variable (if CGH alteration - what expression ratio was found); bottom, altered expression used as 
independent variable (if expression alteration - what CGH deviation was found). ^ ' 



CGH alterations 



Tumor 733 vs. 335 
Expression change clusters 



Concordance 



CGH alterations 



Tumor 827 vs. 532 
Expression change clusters 



Concordance 



13 Gain 



10 Loss 



10 Up-regulation 

0 Down- regulation 

3 No change 

1 Up-regulation 

5 Down- regulation 

4 No change 



77% 



50% 



10 Gain 



12 Loss 



8 Up- regulation 
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5 Loss. 
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two invasive tumors (stage pT1 , TCCs 733 and 827), whereas 
. the two non-invasive papillomas (stage pTa, TCCs 335 and 
532) showed only 9p-, 9q22-q33-, and X-, and 7+ ; 9q-, 
and Y-, respectively. Both invasive tumors showed changes 
(1q22-24 + , 2q14.1-qter-, 3q12-q13.3-, 6q12-q22-, 
9q34 + , 11q12-q13+, 17 + , and 20q11.2-q12+) that are typ- 
ical for their disease stage, as well as additional alterations, 
some of which are shown in Fig. 1. Areas with gains and 
losses deviated from the normal copy number to some extent, 
and the average numerical deviation from normal was 0.4-fold 
in the case of TCC 733 and 0.3-fold for TCC 827. The largest 
changes, amounting to at least a doubling of chromosomal 
content, were observed at 1q23 in TCC 733 (Fig. 1A) and 
20q12 in TCC 827 (Fig. 1B). 

mRNA Expression in Relation to DNA Copy Number— The 
mRNA levels from the two invasive tumors (TCCs 827 and 
733) were compared with the two non-invasive counterparts 
(TCCs 532 and 335). This was done in two separate experi- 
ments in which we compared TCCs 733 to 335 and 827 to 
532, respectively, using two different scaling settings for the 
arrays to rule out scaling as a confounding parameter. Ap- 
proximately 1 ,800 genes that yielded a signal on the arrays 
were searched in the Unigene and Genemap data bases for 
chromosomal location, and those with a known location 
(1096) were plotted as bars covering their purported locus. In 
that way it was possible to construct a graphic presentation of 
DNA copy number and relative mRNA levels along the indi- 
vidual chromosomes (Fig. 1). 

For each mRNA a ratio was calculated between the level in 
the invasive versus the non-invasive counterpart. Bars, which 
represent chromosomal location of a gene, were color-coded 
according to the expression ratio, and only differences larger 



than 2-fbld were regarded as informative (Fig. 1). The density 
of genes along the chromosomes varied, and areas contain- 
ing only one gene were excluded from the calculations. The 
resolution of the CGH method is very low, and some of the 
outlier data may be because of the fact that the boundaries of 
the chromosomal aberrations are not known at high resolution. 

Two sets of calculations were made from the data. For the 
first set we used CGH alterations as the independent variable 
and estimated the frequency of expression alterations in these 
chromosomal areas. In general, areas with a strong gain of 
chromosomal material contained a cluster of genes having 
increased mRNA expression. For example, both chromo- 
somes 1q21-q25, 2p and 9q, showed a relative gain of more 
than 100% in DNA copy number that was accompanied by 
increased mRNA expression levels in the two tumor pairs (Fig. 
1). tn most cases, chromosomal gains detected by CGH were 
accompanied by an increased level of transcripts in both 
TCCs 733 (77%) and 827 (80%) (Table I, top). Chromosomal 
losses, on the other hand, were not accompanied by de- 
creased expression in several cases, and were often regis- 
tered as having unaltered RNA levels (Table I, fop). The inabil- 
ity to detect RNA expression changes in these cases was not 
because of fewer genes mapping to the lost regions (data not 
shown). 

In the second set of calculations we selected expression 
alterations above 2- fold as the independent variable and es- 
timated the frequency of CGH alterations in these areas. As 
above, we found that increased transcript expression corre- 
lated with gain of chromosomal material (TCC 733, 69% and 
TCC 827, 59%), whereas reduced expression was often de- 
tected in areas with unaltered CGH ratios (Table I, bottom). 
Furthermore, as a control we looked at areas with no alter- 
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Fig 2. Correlation between maximum CGH aberration and the ability to detect expression change by oligonucleotide array 
monitoring. The aberration is shown as a numerical -fold change in ratio between invasive tumors 827 (A) and 733 (♦ ) and their non-invasive 
counterparts 532 and 335^ The expression change was taken from the Expression line to the right in Fig. 1, which depicts the resulting 
expression change lor a given chromosomal region. At least hall of the mRNAs from a given region have to be either up- or down-regulated 
to be scored as an expression change. All chromosomal arms in which the CGH ratio plus or minus one standard deviation was outside the 
ratio value of one were included. 



ation in expression. No alteration was detected by CGH in 
most of these areas (TCC 733, 60% and TCC 827, 81%; see 
Table I, bottom). Because the ability to observe reduced or 
increased mRNA expression clustering to a certain chromo- 
somal area clearly reflected the extent of copy number 
changes, we plotted the maximum CGH aberrations in the 
regions showing CGH changes against the ability to detect a 
change in mRNA expression as monitored by the oligonucleo- 
tide arrays (Fig. 2). For both tumors TCC 733 (p.< 0.015),and 
TCC 827 (p < 0.00003) a highly significant correlation was 
observed between the level of CGH ratio change (reflecting 
the DNA copy number) and alterations detected by the array 
based technology (Fig. 2). Similar data were obtained when 
areas with altered expression were used as independent vari- 
ables. These areas correlated best with CGH when the CGH 
ratio deviated 1.6- to 2.0-fold (Table I, boffom) but mostly did 
not at lower CGH deviations. These data probably reflect that 
loss of an allele may only lead to a 50% reduction in expres- 
sion level, which is at the cut-off point for detection of expres- 
sion alterations. Gain of chromosomal material can occur to a 
much larger extent. 

Microsatellite- based Detection of Minor Areas of Loss : 
es— In TCC 733, several chromosomal areas exhibiting DNA 
amplification were preceded or followed by areas with a nor- 
mal CGH but reduced mRNA expression (see Fig. 1 , TCC 733 
chromosome 1q32, 2p21, and 7q2Vand q32, 9q34, and 
10q22). to determine whether these results were because of 
undetected loss of chromosomal material in these regions or 



because of other non-structural mechanisms regulating tran- 
scription, we examined two microsatellites positioned at chro- 
mosome 1q25-32 and two at chromosome 2p22. Loss of 
heterozygosity (LOH) was found at both 1q25 and at 2p22 
indicating that minor deleted areas were not detected with the 
resolution of CGH (Fig. 3). Additionally, chromosome 2p in 
TCC 733 showed a CGH pattern of gain/no change/gain of 
DNA that correlated with transcript increase/decrease/in- 
crease. Thus, for the areas showing increased expression 
there was a correlation with the DNA copy number alterations 
(Fig.' 1/1). As indicated above, the mRNA decrease observed in 
the middle of the chromosomal gain was because of LOH, 
implying that one of the mechanisms for mRNA down-regu- 
lation may be regions that have undergone smaller losses of 
chromosomal material. However, this cannot be detected with 
the resolution of the CGH method. 

In both TCC 733 and TCC 827, the telomeric end of chro- 
mosome 11p showed a normal ratio in the CGH analysis; 
however, clusters of five and three genes, respectively, lost 
their expression. Two microsatellites (D11S1760, D11S922) 
positioned close to MUC2, IGF2, and cathepsin D indicated 
LOH as the most likely mechanism behind the loss of expres- 
sion (data not shown). 

A reduced expression of mRNA observed in TCC 733 at 
chromosomes 3q24, 11p11, 12p12.2, 12q21,1, and 16q24 
and in TCC 827 at chromosome 11p15.5, 12p11, 15q11.2, 
and 18q12 was also examined for chromosomal losses using 
microsatellites positioned as close as possible to the gene loci 
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FtG. 3. Microsatellite analysis of loss of heterozygosity. Tumor 
733 showing loss of heterozygosity at chromosome 1q25, detected 
(a) by D1S215 close to Hu class I histocompatibility anligen (gene 
number 38 in Fig. 1), (b) by D1S2735 close to cathepsin E (gene 
number 41 in Fig. 1), and (c) at chromosome 2p23 by D2S2251 close . 
to general ^-spectrin (gene number 11 on Fig. 1) and of (cO tumor 827 
showing loss of heterozygosity at chromosome 18q12 by S18S1118 
close to milochondria! 3-oxoacyl-coenzyme A thiolase (gene number 
12 in Fig. 1). The upper curves show the electropherogram obtained 
trom normal DMA from leukocytes (A/), and the tower curves show the 
electropherogram Irom tumor DNA (7). In all cases one artele is 
partially lost in the tumor amplicon, 

showing reduced mRNA transcripts. Only the microsatellite 
positioned at 18q12 showed LOH (Fig. 3), suggesting that 
transcriptional down-regulation of genes in the other regions 
may be controlled by other mechanisms. 

Relation between Changes in mRNA and Protein Levels— 
2D-PAGE analysis, in combination with Coomassie Brilliant 
Blue and/or silver staining, was carried out on all four tumors 
using fresh biopsy material. 40 well resolved abundant known 
proteins migrating in areas away from the edges of the pH 
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FtG. 4. . Correlation between protein levels as judged by 20- 
PAGE and transcript ratio. For comparison proteins were divided in 
three groups, unaltered in level or up- or down-regulated horizontal 
axis). The mRNA ratio, as determined by oligonucleotide arrays was 
plotted for each gene [vertical axis). A, mRNAs lhat were scored as 
present in both tumors used for the ratio calculation; A, mRNAs that 
were scored as absent in the invasive tumors (along horizontal axis) or 
as absent in non-invasive reference (fop of figure). Two different 
scalings were used to exclude scaling as a confounder, TCCs 827 
and 532 (AA) were scaled with background suppression, arid TCCs 
733 and 335 (#0) were scaled without suppression. Both compari- 
sons showed highly significant (p < 0.005) differences in mRNA ratios 
between the groups. Proteins shown were as follows: Group A (irom 
left), phosphoglucomutase 1, glutathione transferase class p. number 
4, fatty acid-binding protein homologue,. cytokeratin 15, and cyto- 
keratin 13; 8 (from left), fatty acid- binding protein homologue, 28-kDa 
heat shock protein, cytokeratin 13, and calcyclin; C (from left), or-eno- . 
lase, hnRNP B1, 28-kDa heat shock protein, 14-3-3-c, . and 
pre-mRNA splicing factor; D, mesoihelial keratin K7 (type II); E (trom 
top), glutathione S-transf erase- u and mesothelial keratin K7.(type II); 
F(from top arid /eft), adenytyl cyclase-associaled protein, E-cadherin, 
keratin 19. calgizzarin, phosphogtycerate mutase, annexin IV, cy- 
toskeletal -ractih, hnRNP A1, integral membrane protein calnexin 
(IP90), hnRNP H, brain-type clathrin light chain-a, hnRNP F, 70-kDa 
heat shock protein, heterogeneous nuclear ribonucleoprotein A/B, 
translalionally controlled tumor protein, liver glyceraldehyde-3-phos- 
phate dehydrogenase, keratin 8, aldehyde reductase, and Na.K- 
ATPase 0-1 subunit; G, (from fop and left), TCP20, calgizzarin, 70- 
kDa heat shock protein, calnexin, hnRNP H, cytokeratin 15, ATP 
synthase, keratin 19, triosephosphate isomerase, hnRNP F, liver gryc- 
eraldehyde-3-phosphatase dehydrogenase, glutathione S-transfer- 
ase-w, and keratin 8; H (from /eft), plasma gelsolin, autoantrgen cal- 
reticulin, thioredoxin, and NAD+ -dependent 15 hydroxyprostaglandin 
dehydrogenase; / (from fop), prolyl 4-hydroxylase 0-subunit, cyto- 
keratin 20, cytokeratin 17, prohibition, and fructose 1,6-biphos- 
phatase; J annexin II; K, annexin IV; L (Irom top and /eft), 90-kDa heat 
shock protein, prolyl 4-hydroxylase 0-subunit, a-enolase, 6RP 78, 
cyclophilin, and cofilin. 

gradient, and having a known chromosomal location, were 
selected for analysis in the TCC pair 827/532. Proteins were 
identified by a combination of methods (see "Experimental 
Procedures"). In general there was a highly significant corre- 
lation (p < 0.005) between mRNA and protein alterations (Fig. 
4). Only one gene showed disagreement between transcript 
alteration and protein alteration. Except for a group of cyto- 
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keratins encoded by genes on chromosome 17 (Fig. 5) the 
analyzed proteins did not belong to a particular family. 26 well 
focused proteins whose genes had a know chromosomal 
location were detected in TCCs 733 and 335, and of these 19 
correlated (p < 0.005) with the mRNA changes detected using 
the arrays (Fig. 4). For example, PA-FABP was highly ex- 
pressed in the non- invasive TCC 335 but lost in the invasive 
counterpart (TCC 733; see Fig. 5). The smaller number of 
proteins detected in both 733 and 335 was because of the 
smaller size of the biopsies that were available. 

1 1 chromosomal regions where CGH showed aberrations 
that corresponded to the changes in transcript levels also 
showed corresponding changes in the protein level (Table II). 
These regions included genes that encode proteins that are 
found to be frequently altered in bladder cancer, namely 
cytokeratins 17 and 20, annexins II and IV, and the fatty 
acid-binding proteins PA-FABP and FBP1. Four of these pro- 
teins were encoded by genes in chromosome 17q, a fre- 
quently amplified chromosomal area in invasive bladder 
cancers. 

DISCUSSION 

Most human cancers have abnormal DNA content, having 
lost some chromosomal parts and gained others. The present 
study provides some evidence as to the effect of these gains 
and losses on gene, expression in two pairs of non-invasive 
and invasive TCCs using high throughput expression arrays 
and proteomics, in combination with CGH. In general* the 
results showed that there is a clear individual regulation of the 
mRNA expression of single genes, which in some cases was 
superimposed by a DNA copy number effect. In most cases, 
genes located in chromosomal areas with gains often exhib- 
ited increased mRNA expression, whereas areas showing 
losses showed either no change or a reduced mRNA expres- 
sion. The latter might be because of the fact that losses most 
often are restricted to loss of one allele, and the cut-off point 
for detection of expression alterations was a 2-fold change, 
thus being at the border of detection. In several cases, how- 

Table II 



Proteins whose expression level correlates with both mRNA and gene dose changes 


Protein 


Chromosomal location 


Tumor TCC 


CGH alteration 


Transcript alteration* 


Protein alteration 


Annexin II 


1q21 


733 


Gain 


Abs to Pres* 


Increase 


Annexin IV 


2p13 


733 


Gain 


3-9-Fold up 


Increase 


Cytokeratin 17 


17q12-q21 


827 


Gain 


3.8- Fold up 


Increase 


Cytokeratin 20 


17q21.1 


827 . 


Gain 


5.6-Fold up 


' Increase 


(PA-)FABP 


8q21.2 


827 


Loss 


10-Fold down 


Decrease 


FBP1 


9q22 


827 


Gain 


2. 3- Fold up 


Increase 


Plasma gelsolin 


9q31 


827 


Gain 


Abs to Pres 


Increase 


Heat shock protein 28 


15q12-q13 


827 


Loss 


2.5-Fold up 


Decrease 


Prohibit in 


17q21 


827/733 


Gain 


3-7-/2.5-Fold up* 


Increase 


Prolyl- 4 -hydroxy! 


I7q25 


827/733 


Gain 


5.7-/1.6-Fold up 


Increase 


hnRNPBI 


7p15 


827 


Loss 


2.5-F'old down 


Decrease 



* Abs, absent; Pres, present. 

* In cases where the corresponding alterations were found in both TCCs 827 and 733 these are shown as 827/733. 




Fig. 5. Comparison of protein and transcript levels in invasive 
and non-invasive TCCs. The upper pad ol the figure shows a 2D gel 
(/eft) and the oligonucleotide array [right) of TCC 532: The red rectan- 
gles on the upper gel highlight the areas that are compared below. 
Identical areas of 2D gels of TCCs 532 and 827 are shown below.* 
Clearly, cytokeratins 13 and 15 are strongly down- regulated in TCC 
827. (red annotation). The tile on the array containing probes for 
cytokeratin 15 is enlarged below the array {red arrow) from TCC 532 
and is compared wilh TCC 827. The upper row of squares in each tile 
corresponds to perfect match probes; the lower row corresponds to 
mismatch probes containing a mutation {used for correction for un-, 
specific binding). Absence of signal is depicted as black, and the 
higher the signal the lighter the color. A high transcript level was 
delected in TCC. 532 (6151 units) whereas a much lower level was 
detected in TCC 827 (absence of signals). For cytokeratin 13, a high 
transcript level was also present in TCC 532 (15659 units), and a 
much lower level was present in TCC 827 (623 units). The 2D gels at 
the bottom of the figure (left) show levels of PA-FABP and adipocyte- 
FABP in TCCs 335 and 733 finvasive), respectively. Both proteins are 
down-regulated in the invasive tumor. To the right we show the array 
tiles for the PA-FABP transcript. A medium transcript level was de- 
tected rn the case of TCC 335 (1277 units) whereas very low levels 
were detected in TCC 733 (166 units). IEF, isoelectric focusing. 
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ever, an increase or decrease in DNA copy number was 
associated with de novo occurrence or complete loss of tran- 
script, respectively. Some of these, transcripts could not be 
detected in the non-invasive tumor but were present at rela- 
tively high levels in areas with DNA amplifications in the inva- 
sive tumors {e.g. in TCC 733 transcript from cellular ligand of 
annexin II gene (chromosome 1q21) from absent to 2670 
arbitrary units; in TCC 827 transcript from small proiine-rich 
protein 1 gene (chromosome 1q12-q21.1) from absent to 
1326 arbitrary units). It may be anticipated from these data 
that significant clustering of genes with an increased expres- 
sion to a certain chromosomal area indicates an increased 
likelihood of gain of chromosomal material in this area. 

Considering the many possible regulatory mechanisms act- 
ing at the level of transcription, it seems striking that the gene 
dose effects were so clearly detectable in gained areas. One 
hypothetical explanation may lie in the loss of controlled 
methylation in tumor cells (17-19); Thus, it may be possible 
. that in chromosomes with increased DNA copy numbers two 
or more alleles could be .demethylated simultaneously leading 
to a higher transcription level, whereas in chromosomes with 
losses the remaining allele could be partly methylated, turning 
off the process (20, 21). A recent report has documented a 
ploidy regulation of gene expression in yeast, but in this case all 
the genes were present in the same ratio (22), a situation that is 
not analogous to that of cancer cells, which show marked 
chromosomal aberrations, as well as gene dosage effects. 

Several CGH studies of bladder cancer have shown that 
some chromosomal aberrations are common at certain 
stages of disease progression, often occurring in more than 1 
of 3 tumors. In pTa tumors, these include 9p-; 9q~, 1q+, Y- 
(2, 6), and in pT1 tumors, 2q-,11p-, 11q-, 1q+, 5p+, 8q+, 
17q+, and 20q+ (2-4, 6, 7). The pTa tumors studied here 
showed similar aberrations such as 9p- and 9q22-q33- and 
9q- and Y-, respectively. Likewise, the two minimal invasive 
pTl tumors showed aberrations that are commonly seen at 
that stage, and TCC 827 had a remarkable resemblance to the 
commonly seen pattern of losses and gains, such as 1q22-24 
amplification (seen in both tumors), 11q14-q22 loss, the latter 
often linked to 17 q+ (both tumors), and 1q+ and 9p-, often 
linked to 20q+ and 11 q13+ (both tumors) (7-9). These ob- 
servations indicate that the pairs of tumors used in this study 
exhibit chromosomal changes observed in many tumors, and 
therefore the findings could be of general importance for 
bladder cancer. 

Considering that the mapping resolution of CGH is of about 
20 megabases it is only possible to get a crude picture of 
chromosomal instability using this technique. Occasionally, 
we observed reduced transcript levels close to or inside re- 
gions with increased copy numbers. Analysis of these regions 
by positioning heterozygous microsatellites as close as pos- 
sible to the locus showing reduced gene expression revealed 
loss of heterozygosity in several cases.* It seems likely that 
multiple and different events occur along each chromosomal 



arm and that the use of cDNA microarrays for analysis of DNA 
copy number changes will reach a resolution that can resolve 
these changes, as has recently been proposed (2). The outlier 
data were not more frequent at the boundaries of the CGH 
aberrations. At present we do not know the mechanism be- 
hind chromosomal aneuploidy and cannot predict whether 
chromosomal gains will be transcribed to a larger extent than 
the two native alleles. A mechanism as genetic imprinting has 
an impact on the expression level in normal cells and is often 
reduced in tumors. However, the relation between imprinting 
and gain of chromosomal material is not known. 

We regard it as a strength of this investigation that we were 
able to compare invasive tumors to benign tumors rather than 
to normal urothelium, as the tumors studied were biologically 
very close and probably may represent successive steps in 
the progression of bladder cancer. Despite the limited amount 
of fresh tissue available it was possible to apply three different 
state of the art methods. The observed correlation between 
DNA copy number and mRNA expression is remarkable when 
one considers that different pieces of the tumor biopsies were 
used for the different sets of experiments. This indicate that 
bladder tumors are relatively homogenous, a notion recently 
supported by CGH and LOH data that showed a remarkable 
similarity even between tumors and distant metastasis (10, 23). 

In the few cases analyzed, mRNA and protein levels 
showed a striking correspondence although in some cases 
we found discrepancies that may be attributed to translational 
regulation, post-translational processing, protein degrada- 
tion, or a combination of these. Some transcripts belong to 
undertranslated mRNA pools, which are associated with few 
translationally inactive ribosomes; these pools, however, 
seem to be rare (24). Protein degradation, for example, may 
be very important in the case of polypeptides with a short 
half-life (e.g. signaling proteins). A poor correlation between 
mRNA and protein levels was found in liver cells as deter- 
mined by arrays and 2D- PAGE (25), and a moderate correla- 
tion was recently reported by Ideker ef a/. (26) in yeast. 

Interestingly, our study revealed a much better correlation 
between gained chromosomal areas and increased mRNA 
levels than between loss of. chromosomal areas and reduced 
mRNA levels. In general, the level of CGH change determined 
the ability to detect a change in transcript. One possible 
explanation could be that by losing one allele the change in 
mRNA level is not so dramatic as compared with gain of 
material, which can be rather unlimited and may lead to a 
severalfold increase in gene copy number resulting in a much 
higher impact on transcript level. The latter would be much 
easier to detect on the expression arrays as the cut-off point 
was placed at a 2-fold level so as not to be biased by noise on 
the array. Construction of arrays with a better signal to noise 
ratio may in the future allow detection of lesser than 2-fold 
alterations in transcript levels, a feature that may facilitate the 
analysis of the effect of loss of chromosomal areas on tran- 
script levels. 
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In eleven cases we found a significant correlation between 
DNA copy number, mRNA expression, and protein level. Four 
of these proteins were encoded by genes located at a fre- 
. quently amplified area in chromosome 17q. Whether DNA 
copy number is one of the mechanisms behind alteration of 
these eleven proteins is at present unknown and will have to 
be proved by other methods using a larger number of sam- 
ples. One factor making such studies complicated is the large 
extent of protein modification that occurs after translation, 
requiring immunoidentification and/or mass spectrometry to 
correctly identify the proteins in the gels. 

In conclusion, the results presented in this study exemplify 
the large body of knowledge that may be possible to gather in 
the future by combining state of the art techniques that follow 
the pathway from DNA to protein (26). Here, we used a tradi- 
tional chromosomal CGH method, but in the future high reso- 
lution CGH based on microarrays with many thousand radiation 
hybrid-mapped genes will increase the resolution and informa- 
tion derived from these types of experiments (2). Combined with 
expression arrays analyzing transcripts derived. from genes with 
known locations, and 2D gel analysis to obtain information at 
the post-translational level, a clearer and more developed un- 
derstanding of the tumor genome will be forthcoming. 
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Genome-wide study of gene copy numbers, transcripts, and protein levels 
in pairs of non-invasive and invasive human transitional cell carcinomas. 

Orntoft TF, Thykiaer TV Waldman FM . Wolf K r^re 

Department of Clinical Biochemistry, Molecular Diagnostic Laboratory Aarhus 
University Hospital, Skejby, DK-8200 Aarhus N, Denmark, orntoft@kba.sks.au.dk 

Gain and loss of chromosomal material is characteristic of bladder cancer as well as 
malignant transformation in general. The consequences of these changes at both the 

ranscnption and translation levels is at present unknown partly because of technical 
limitations. Here we have attempted to address this question in pairs of non-invasive and 
invasrve human bladder tumors using a combination of technology that included 
comparative genomic hybridization, high density oligonucleotide array-based monitorW 
of transcript levels (5600 genes), and high resolution two-dimensional gel m ° mt ° nng 
electrophoresis. The results showed that there is a gene dosage effect that in some cases 
superimposes on other regulatory mechanisms. This effect depended (p < 0 01 5) on the 
magnitude of the comparative genomic hybridization change. In general (18 of 23 cases! 
c^omosomal areas with more man 2-fold gain of DNA showed Lrresponin nc^S 

or Setr SCnP ^ ' 0SS ° f DNA ' ° n thC ° ther hand " showed -thef reduced 
or unaltered transenpt levels. Because most proteins resolved by two-dimensional gels 
are unknown it was only possible to compare mRNA and protein alterations in relatively 
few cases of well focused abundant proteins. With few exceptions we found a good 
correlation ( P < 0.005) between transcript alterations and protein levels. The implications 
as well as limitations, of the approach are discussed. mpucauons, 

PMID: 12096139 [PubMed - indexed for MEDLINE] 
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Cell localization and regulation of expression of cytochrome P450 1A1 and 
2B1 m rat lung after induction with 3-methylchoIanthrene using mRNA 
hybridization and immunohistochemistry. 

ggll^OJg L Irabe|gjN, BuardA, Fleury- Feith J , BacheletCM Pn^ft^P 
Brochard P. Laurent P . ^ 

INSERM Unite 139, Hopital Henri Mondor, Creteil, France. 

In order to characterize the response of various pulmonary cell types to polycyclic 
aromatic hydrocarbons, the expression of cytochrome P450(CYP) 1A1 and2Bl mRNA 
in the lung of rats, with or without induction by 3-methylcholanthrene (3MCj was 
analyzed by in situ hybridization using appropriate 35S-labeled riboprobes. The 
expression of the corresponding proteins was investigated immunohistochemically 
Following induction with SMC, the kinetics of mRNA expression differed considerably 
i^ff 1 C md **** 11 P neum o c ytes and venous endothelial cells. In Clara cells 
mRNA expression was detected as early as 1 h after induction, peaked between 2 and 4 h 
and was completely undetectable at 14 h. In contrast, venous endothelial cells and type II' 
pneumocytes exhibited permanent mRNA expression of CYP 1 Al in 3MC-pretreated 
rats These kinetic results explain the striking absence of correlation between mRNA and 
protem expression observed in Clara cells 24 h after the end of the induction protocol as 
these cells exhibited intense protein expression with no mRNA. In contrast a good 
correlation was observed for mRNA and protein expression of CYP 2B1 withsimilar 
e T eS ^ 0nS ^ ^ , ce,ls md 11 Pneumocytes, but no expression in endothelial 
cells. This study clearly distinguished the regulation of CYP 1 Al expression in the rat 
lung from that described in the liver. The differences observed in the various lung cell 
types whatever the post-transcriptional mechanisms involved, emphasize that studies 
must be performed at the cellular level in order to understand the specific response to 
xenob,otics, not only of this organ as a whole but also of its various anatomic structures. 

PMID: 7917307 [PubMed - indexed for MEDLINE] 
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DNA hypennetkylation is a mechanism for loss of expression of the HLA 
class I genes in human esophageal squamous cell carcinomas. 

NieY, YangjG, SongY, ZhaoX, SoC, IdMJ, Wau^, YangCS. 

Laboratory for Cancer Research, College of Pharmacy, Rutgers-The State University of 
New Jersey, 164 Frehnghuysen Road, Piscataway, NJ 08854-8020, USA. 

The three human leukocyte antigen (HLA) class I antigens, HLA-A, HLA-B and HLA-C 
play important roles in the elimination of transformed cells by cytotoxic T cells Frequent 
loss of expression of these antigens at the cell surface has been observed in many human 
cancers. Various mechanisms for post-transcriptional regulation have been proposed and 
tested bui the molecular mechanisms for transcriptional regulation are not clear. We show 
by immunohistochemistry that the HLA class I antigens are absent in 26 of 29 (89%) 
samp es of human esophageal squamous cell carcinomas (ESCC). Eleven of the 26 ESCC 
samples lost mRNA expression for at least one of the HLA genes, as shown by RT-PCR 
DNA from the 29 pairs of ESCC and neighboring normal epithelium were examined for' 
CpO island hypermethylation, homozygous deletion, microsatellite instability (MSD and 
loss of heterozygosity (LOH). DNA from normal epithelial tissues had no detectable 

,°? ° Cp ° i5lands of °f these gene loci. Thirteen of 29 ESCC samples 
(45 /o)exlubited methylation of one or more of the three HLA loci and six samples (21%) 
exhibited methylation of all three loci. The HLA-B gene locus was most frequently 
methylated (38%). HLA-B mRNA expression in an ESCC cell line, where HLA-B was 
hypermethylated and did not express mRNA, was activated after treatment with 5-aza-2'- 
deoxycytidine. Homozygous deletion of these three gene loci was not observed 

n?«~ tl!?* rat6S ° f L ° H 3nd MSI WCre 0bscrved for me microsatellite markers 
- D6S306, D6S258, D6S273 and D6S1666, close to the HLA-A, -B and -C loci, although a 

TiS! clt , W3S observed at a nearb y lQ cus (represented by the markers D6S 1 05 1 
and D6M560), where the tumor suppressor gene p21(Wafl) resides. A strong correlation 
between genetic alterations and mRNA inactivation was observed in the ESCC samples 
p^fu i " d L Cate that HLA C,ass 1 S ene expression was frequently down-regulated in' 
ESCC at both the protein and mRNA levels and that hypermethylation of the promoter 
regions of the HLA-A, -B and -C genes is a major mechanism of transcriptional 
inactivation. 
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understood Thus the present study investigates the expression of HER2/neu in 1 3 
pattents wtth synovial sarcoma (SS). In this study, HE^/neu mkNA^^L 

nh!S T' 1 membran °us and cytoplasmic patterns of immunostainine were 

ievd7- "on w g TSS was found behveen P rotein ex P ressi - S 

owe/risk of devdoZ ^ ^ IeVdS Were associated with a 

wwa nsk of developing recurrences (P = 0,02). Moreover, none of the patients with 

S^^S^r 10 ^ metaStaSiS " ° Ur d3ta dem ° nS *«^Sl is 
S^J^Sf ?° n m ^ ^P 1 * 5 ™ HBR2/neu expression 
Srir !' ? Ur ? UltS Sh ° W ^P-sence of increased levels of - 

S^^tteS^fl? " 3 m0re f T rab,e Clinical ^studies are 

EEL f he ^ of th >s oncogene tn SSs and to evaluate the application of 
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Molecular and Immunohistochemical 
Analysis of HER2/neu Oncogene in Synovial 
Sarcoma 

^OTA MD ' CATER,NA PELLEGRINI ' PhD ' 

KUBtKfA FASANI, MD, MARCO MAGGIONI MD 

?M!P2f OGGL MD ' ANTONINA PARAFIQRITI, MD 
AND SILVANO BOSARI, MD 



AmphBtMon and/or ovwexpression of HDtZ/nen have been 
documented tnmaoytypes of epitheliaj tumor and , ttold . bas ^ 

T™™ 35 ' P**"*™* ™ osteosarcomas . But the role of 
HEM/neu alterations u, soft .issue rumors remains poorly under- 
stood. TW the present sntdy investigates the egression ofMEBI/ 

12 lEjT? ^ SfDmU> sarconM (SS >- h ,h!s HER V 

neu mRNA levels we measured In frozen ^ ^ „. ^ 

real-nme reverse b^nscripdon-poryroerase chain reaction assay- r™. 
egression «s assessed by immnnohistocbemistry udr« an Li- 

HER2/neu polyclonal antibody. Six normal skeletal muscfe speci . 
mens jere used ,„ establish basal levels of MER2/nen TnRNA. 
I ttaMm P B w <= de '^'«l ™ all normal tissues and SSs 

w . K " COmaS d<OT °»»™«l HHW/neu mRNA levels 

above .he mean value, whereas 3 rumors (23%) displayed HER2/neu 
protem overexpression. Bod. membranous and cytoplasmic patterns 
of unmunostauting w„e observed, and a strong correjan™ ™ 

tum^T." 1 S3rC ° m r (SS) " 311 =W«s™ soft tissue 
tumor that accounts for up ,o 10% of sarcomas, with a 
peak modence in adolescents and young adults. TOs 
umor occurs ,„ 2 major forms. Diphasic and monopha- 
1RU n ^"""fc^Uy characterized by the ,(X; 

il,h P I? 1) ,! ransl r ation > found in >95% of cases 
Although tracbuonally considered to be a hiph-gn.de 
neoplasm, recent investigations have suggested I lb* dif- 
ferent factors tnfluence, prognosis, including morpho- 
og-cal and cytogenetic features, treatment strategies 
the plo.dy status, and the apoptolic index.' ' 
,„,K ^ evel °P m /" t ° r "<™ Lerapeutic advancement 

or«er!f ' F**"** ° f alterauons 

present m human mahgnancies, has brought to light ihe 
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found between protein expression and mRNA levd (P = o 0 I> 
Incr^ed lM/neu mRNA levels **re significant* associated with 
a lowers of devclopng recurrences (/ > = 0.02). Moreover, none of 
*e panen* ™b HER2/oeu overexpression deeped metastnsis, 
Our data dcmpnstraie .ha. HER2/neu is expressed in SS» and that 
^rP^? M i a l C7t0p,aSmiC "^/^ «P«^on correlate 
W^H^^ rCSlJ,S ^ the P^e of leased 

course. Further stud.es are needed to assess the role of this oncogene 

diSlI tC aPPUCad ° n ^^torjhuznani.ed n,^ 

don^ auUbod.es u. the treanueut regimens for this mahWan™ 

r^^^ 64 ^ ® FJ ^~ AH rights re^T 

reverse trauscripoon^lymerase chain reaction, SS, synodal sarcoma.' 

need to identify not only prognostic fectors, but also 
tumoTfcaturcs that axe predictive of response to therapy. 
One of the most exiensively studied molecular 

^ Se «^ r o^ eraPy iS ^ HER - ? /n™ P rotcK>ncogene. 
ine hU^K-2/neu oncogene (also Itno^n ^ c -e r b&-2) 
located on chromosome I7q2l, is a member of the 
tyrosine kinase receptor family and encodes for a 185- 
kilodalton protein that shows 50% homology with the 
epidermal growth factor receptor. 2 ^ T^is gene is am- 
plified and/or overexpressed in"20% to 30% of breast 
carcinomas - and in-various other tumors, 6 and usually 
is associated with tumor aggressiveness and poor prJ- 

™£o/ CJaI StUdieS W ra PPO«cd the value of 
HfcR-2/neu to predict the response to chemotherapy m 
breast cancer^nd the use of recombinant humanized 
an u bodies to HER-2/neu protein (Tras t u,umnb) in the 

care of patients with advanced, metastatic breast tumors 
nas been approved. 

The role of HER-2/neu aciivaiion in soit tissue 
tumors remains poorly understood, and scarce molec- 
ular data backing immunohistochemical sludjes have 
been reported. HER-2/neu protein expression was im- 
munohi Sl ochemically studied m 204 sarcomas, includ- 
ing b SSs, and overexpression was absent in all these 
majjgriant mesenchymal neoplasms 10 

Recently, HER-2/neu alterations have been de- 
scribed m osteosarcoma, *iih a high incidence of pro 
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TABLE 1. Qinicopathologic Features of 13 Patients With SS 



No. 


Agc(y)/G 


Site 


Subtype 


Surgery 


Chemo/Radio 


Rec 


Mets 


HER2 


FU(mo) 


1 


68/F 


Kl 


MF 


RR 


No 


Yes 


Yes 


L 


108 


2 


71/F 


KE 


. BF 


WA 


No 


No 


No 


L 


12 


3 


27/M 


Kl 


. BF 


ME 


Yes 


Yes 


Yes 


L 


24 


4 


39/M 


T 


MF 


WE. 


Yes 


No 


No 


H 


48 


5 


15/M 


N 


MF,PD 


WE 


Yes 


No 


No 


H 


48 


G 


41/F 


F 


MFJ>D 


WA 


Yes 


No 


■ No 


H 


48 


. 7 


57/F 


Kl 


MF 


WA 


No 


N/A 


N/A 


L 


N/A 


8 


48/M 


F 


BF,PD 


ME 


Yes 


Yes 


No 


L 


36 


9 


29/F 


T 


PD 


WE 


Yes 


Yes 


Yes 


L 


36 


10 


27/M 


KE 


BF 


WE 


No 


No 


No 


H 


24 


ii 


62/F 


A 


MF 


WE 


No 


Yes 


No 


L 


24 


12 


62/F 


F 


MF 


WA 


No 


No 


No 


L 


2 


13 


34/F 


L 


MF 


RR 


No 


No . 


■ No 


L 


2 



Abbreviations: Age, age at diagnosis; G,. gender; M» male; F„ female; Site, anatomic location; Kl, knee, intra-articular; KE, knee, 
extra-articular; T, thigh; N, neck; F, foot; A, arm; L, leg; Subtype, histological subtype; MF, roonophasic fibrous; BF, biphasic; MF-PD, 
monophasic fibrous with poorly differentiated areas; BF-PD, biphasic with poorly differentiated areas; PD, poorly differentiated; Surgery, 
primary surgical therapy; RR, radical en bloc resection; WA wde through-bone amputation; ME, marginal en bloc excision; WE^wide en bloc 
excision; Chemo/Radio, adjuvant postoperative chemotherapy and/or radiotherapy; Rec, recurrence; Mets, presence of metastasis; HER2, 
HER2 mRNA expression; L, low expression; H, high expression; FU, follow-up status; N/A, not avaible. 



tein expression, ranging from 42% to 61 %. 13 " 15 Indeed, 
despite limited information on Her2/neu in this type 
of malignancy, based mostly on immunohistocliemicaJ 
findings, 2 clinical trials of Trastuzumab have been 
initiated for recurrent and metastatic osteosarcoma pa- 
dents (http://w\vw.cancer.gov/clinical_triaJs: MSKCO 
99097/NCJ-T98-0083 and COG-AOST01 21 ) . 

Therefore, we evaluated the rriRNA expression and 
the gene product of HER-2/neu in 13 SS patients using 
real-time reverse transcription-polymerase chain reac- 
tion (RT-PCR) and immuriohistochemistry. To the best 
of our knowledge, this is the first report documenting 
increased levels of HER-2/neu mRNA and protein 
in SS. 



MATERIALS AND METHODS 

Patient Population 

Thirteen patients with primary SSs, obtained from the 
files of the Department of Pathology, Gaelano Pini Orthope- 
dic Institute,, were included in this study. Cases were chosen 
based on the availability of frozen primary rumor. Patient age 
ranged from 15 to 71 years (mean, 44.6 years). Anatomic sites 
included the knee (5 patients, 3 intraarticular), thigh (2 
patients), foot (3 patients), arm (3 patient), neck (1 patieDt), 
and lower leg (1 patient). The histological svbrj'pes were 3 
biphasic (BF), 6 monophasic fibrous (MF), 2 monophasic 
fibrous with poorly differentiated areas (MF-PD), 1 biphasic 
with poo try differentiated areas (BF-PD), and 1 poorly differ- 
entiated (PD). Clinical staging was 1JB for all the patients. 
Local surgical excision was performed in 9 patients; amputa- 
tion, in 4 patients. Recurrence was observed in 5 cases with 
subsequent amputation (3 patients) and local excision (2 
patients). Metastasis occurred to lung (2 patients) and ingui- 
nal lymph nodes (1 patients). Five patients received adjuvant 
postoperative chemotherapy, and ] patient (case 4) was 
neated with chemotherapy associated with radiotherapy. Fol- 
low-up ranged from 2 to 108 months (mean, 34.3 months). 
Qinicopathologic data are summarized in Table 3. 



Pathologic Studies and Frozen Tissue 
Selection 

In all cases, the primary tumor was available for study. 
Surgically resected tumor tissues were partly snap-frozen in 
liquid nitrogen and stored at -80°Cfor RNA extraction, and 
partly fixed in buffered formalin and embedded in paraffin 
blocks. Hematoxylin and eosin-stained sections were re-eval- 
uated and graded according to FNCLCC grading system. 16 
Frozen tissue blocks were handled as follows: 4-/nhick frozen 
sections were cut and stained with hematoxylin and eosin to 
determine the percentage of tumor cells present in the spec- 
imen. We used tissue blocks with tumor cells comprising 
more than 80% of the specimen. About 1 0 20-u-ibick sections 1 
were collected into Eppendorf tubes. Another 4-u.-tbick fro- 
zen section was cut after the serial sections and examined by 
light microscopy to guarantee the percentage of tumor cells 
collected. Trizol (Life Technologies; Gibco BRL, Gait he rs- 
burg, MD) was used for RNA extraction, according to the 
manufacturer's protocol. RNA was quantified spectrophoto- 
metrically. 



cDNA Synthesis 

Total RNA (200 ng) was reverse-transcribed in a total 
volume of 50 uX containing 1 X TaqMan buffer, 5.5 mmol 
MgClj, 1 mmol deoxynucleoiides, 2.5 u.moI random hexam- 
ers, 20 U RNase inhibitor, and 62.5 U MuLv reverse transcrip- 
tase. The samples were incubated at 25°C for 10 minutes, 
18*C for 30 minutes and 95°C for 5 minutes. 



PCR Amplification 

Amplification reactions were performed \vith the Univer- 
sal TaqMan 2 X PCR mastermix in a volume of 25 uL con- 
taming 300 nmo! of each primer, 100 nmol of probe, and 5 
uL of cDNA: Both 0-acrin and HER2/neu amplification weie 
done in duplicate for each sample. 

The thermal cycling conditions included 2 minuies*at 
50 L 'C and 30 minutes at i>5°C, followed by 40 cycles of 95^C for 
35 seconds and BOX for 1 minute. All reagents used for 
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RT-PCR were purchased from Applied Biosystems (Foster 
City, CA). 

Primers ond Probes 

Prime™ and probes for >acbn and HER2/neu mRNA 
were chosen using the computer program Primer Express 
(Applied Biosystems)'. Sequences of the forward primer for 
HER2/neu mRNA (CenBank accession number X03363) 
were 5*-TCC TCT GTG CAC CTG CAT CAC3' and the 
reverse pnmer 5 r -CCA AAG ACC ACC CCC AAC A-3'; the 
sequence of the TaqMan probe was 5'(FAM)-ACC ACA ATC 
CCA ACC ACC GCA CA-<TAM*A)-3\ Sequences of the for- 
ward pnmer for fkicrin mRNA (Gen Bank accession number 
X00351) were ^TCC TTC CTC GGC ATG GAG-3' and the 
reverse primer 5'-ACG ACC ACC AAT CAT CTF GAT CTT- 
3'; the sequence of the TaqMan probe was57FAM)-CCTGTG 
GCATCC ACC AAA OTA CCT TC-(TAMRA)3\ Probes were 
purchased from Applied Biosystems. 

Real-Time RT-PCR 

To measure HER2/ neu expression in these tumors we 
used a real-ume quantitative RT-PCR based on TaqMan meth- 
odology, as previously described, 17 with minor modifications. 
Briefly, this technique allows, by means of fluorescence emis- 
sion, to find the cycling point when PCR product is detectable 
(Ct value or threshold cycle). As previously reported, the Ct 
value correlates to the starting quantity of the target mRNA. ls 
To normalize the amount of total RNA present in each reac- 
tion-, we amplified the housekeeping gene J^ctin, which is 
assumed to be constant in both normal samples and rumor 
tissues. 

Our results are expressed as relative levels of HER2/neu 
mRNA, referred to a sample, called a "calibrator," chosen to 
represent IX expression of this gene. The calibrator was a 
breast cancer cellular line (MCF-7) 1 * that was analyzed on 
every assay plate with the unknown samples. All of the ana- 
lyzed tumors expressed n-fold HER2/neu mRNA relative to 
the calibrator. 

The amount of target, normalized to an endogenous 
reference (0-actin) and relative to the calibrator, was defined- 
by the AACt method as described by Livak K (Sequence 
Detector User. Bulletin 2; Applied Biosystems). Specifically 
the formula is applied as follows: * 



. 25,0 



target amount = 2"^°' 

where AA C, = ||Ct (HER2/neu sample) - Ct (0-actin 
sample)] - [C< (HER2/neu calibrator) - Ct (^crin calibra- 
tor)]}. . 

, Immunohistochemistry 

Formalin-fixed, paraffin-embedded tissue sections were 
deparaffimzed, lehydnued, and exposed to the primary anti- 
body using the EnVision+ sys.em (Dako, Carpin.eria, CA) 
Primary ano-HFR-2/neu antibody (rabbit polyclonal ami- 
^ y onn m !° S num ^ er A0485; Dako) wns appiied ,n a diluoon 
of 1:2000 foi 60 minutes at ioom temperature. Before expo 
sure to the primary antibody, sections were miciowave-pie- 
treaicd m EDTA, P H S.O, to retrieve antigenicity, and incu- 
bated wuh endogenous peroxidase-blocking solution for 10 
mmutesat room temperature. Positive control, constituted bv 
? „v " Icmoma lowing more than 80% posinve staining 
for HtR2/neu, as well as negabve control, in which the 
primary antibody was omitted, were stained in parallel. 




normal tissue n=6 synovia) sarcomas syncn/iaJ sarcomas 
(tow expreEscfi) (h^h expression) 

FIGURE 1. Distribution of HER2/neu mRNA levels" In normal 
muscle tissues, ond in low- and higrvexpression sarcomas. 
Data are expressed as mean and standard error of the mean 
for eocn group. 



All cases were examined for both cytoplasmic and mem- 
brane immunoreactivity. Cytoplasmic staining was evaluated 
on a semiquantitarive scale, according to Kilpatrict et al with 
minor modifications, 20 and reported as 0 (no staining or 
staining in <10% of cells). 1+ (weak staining in >10% of 
cells), 2+ (moderate staining in >10% of cells), or 3+ 
(strong staining in >10% of celb). The presence of a mem- 
branous pattern of staining was recorded separately and 
scored as absent, (no staining or weak staining in <10% of 
cells) or present (complete and/or incomplete staining in 
>107o of cells). Tumors with a cytoplasmic score of 3+ were 
considered to have high HER-2/neu protein expression. 



Stotisticol Analysis 

Statistical differences were calculated by Fisher's exact 
test. The Hest method was used to evaluate the differences 
between groups. Differences were considered statistically sig- 
nificant when Pwas <0.05. 



RESULTS 

HEP2/neu rnRNA Evaluation 

All of the tissues analysed contained delectable 
levels of H£R2-nev mRNA. Sir norma) tissue samples 
(skeletal muscle) were used to establish basal level of 
HER2/neu mRNA. All the normal samples expressed 
very low levels of HER2/n'eu mRNA, ranging from 0.9 
to 1.9 n (mean, 1.4 n). Among the 13 tumor samples, 
HER2/neu levels varied greatly, ranging from 2.1 to 
2*1 n . Setting a cmofT level at 7.9 n (a value that repre- 
sents the mean vaJue of expression distribution of the 
SSs), 9 cases (69%) had low HER2/neu expression and 
4 cases (31%) had high HER2/neu expression (Fig 1; 
Table 1). The difference between the 2 groups (low 
and high HER2/neu tumors) was statistically significant 
(P= 0.0001). 
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FIGURE 2. Immunohistochemicol locoltzotion of HER2/neu in SS. (A) Positive control (breast cancer) showing typical strong 
membrane pattern of positivity. (B) Case 2, a biphosic SS, displaying very tocal membrane staining limited to a gland. Inset: focal 
weak cytoplasmic positivity in the epithelial component of the same case. (C) Case 10. the epHhelJal component of this biphosic 
SS. displaying strong membrane pattern of staining. (D) Case A. an epithelioid area in a monophaslc fibrous SS. showing strong 
cytoplasmic positivrfy. 



HER2/neu Protein Expression 

Staining with HER2/neu antibody revealed a vari- 
able cytoplasmic and membrane staining pattern. 
Three tumors (23%) showed strong staining involving 
both the cell membrane and the cytoplasm (cases 4, 5, 
and 10); weak to moderate, exclusively cytoplasmic 
staining was observed in 7 cases (cases }, 3, 6, 7, 8, 9, 
and 1 ]). No staining was detected in 2 tumors (cases J 2 
and ]3). In 1 case (case 2), a cluster of glandular 
structures representing <5% of the nimor showed 
weak cytoplasmic and very focal membrane staining, 
the latter limited to a single gland; The epithelial/ 
epithelioid components exhibited stronger, cvioplasmic 
staining compared with die spindle-cell component of 
the tumors. Membrane staining was predominantly in- 
complete and limited to the epithelial/epithelioid ar- 
eas. AJ1 3 cases with high immunohistochemical expres- 
sion of HER2/ncu (cases 'I, 5, and 10) were grade LI 3 
sarcomas, including 1 MF, ] R¥ r and 1 MF-PD SSs. 



Examples "of" HER2/neu cytoplasmic and membrane 
staining are depicted in Figure 2. 

Correlation of Molecular ond 
Immunohislochernical Results 

A strong, statistically significant association was 
present between protein expression, for both mem- 
brane and cytoplasmic staining and HER2/neu rnRNA 
levels {P = 0.0] ), although 1 case (case 6) displayed 
discordant results. Interestingly, this neoplasm showed 
high >]ER2/neu rnRNA levels, whereas only weak stain- 
ing, limited to the cytoplasm of a minority of rumor 
cells, was detected by immunohisiochemical analysis. 

HER2/neu Expression ond Clinicopathologic 
Parameters 

Both HER2/neu protein expression and rnRNA 
levels were evaluated to establish the relationships to 
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TABLE 2. Correlation Behyeen Clintcopathologfc 
Features and HER2/neu Expression as Detected by 
IHC and RT-PCR 



HER2/neu 
IHC PGR 



Variable 



P value 



lvalue 



3 3 
7 0 



NS 



3 3 
6 1 



8 0 7 1 

2 3 0.03 2 3 

3 2 3 2 
7 1 NS 6 2 



NS 



NS 



NS 



3 0 3 0 

7 3 NS 6 4 NS . 



NS 



Age (years) 
<40 
>40 
Sex 
Female 
Male 

Tumor size (cm) 
<5 
>5 

Histological grade 
II 

in 

Histological type 

MF 5 1 5 1 

. BF 2 1 2 1 

H> 3 1 NS 2 2 

Cbem ©/Radiotherapy* 
Yes .42 33 

No ■ 5 1 NS 5 1 NS 

Recurrencef 

Yes 5 0 5 0 

No 2 3 NS 1. 4 0.02 

■ Metastasisf 

Yes '30' 30 

^ 4 3 NS 3 4 NS 

Abbreviations: U low expression; H, high expression; NS, not 
significant; MF, monophasic fibrous; BF, bipbasic; PD, poorly differ- 
entiated (including MF and BF with poorly differentiated areas), 
information not available for case 7. 
tCases 7, 12, and 33 were excluded from the analysis. 

clinicopathologic features, including local recurrence 
and metastatic disease. Two cases (cases 1 2 and 1 3) with 
follow-up less than 12 months and 1 case (case 1) for 
which clinical information was not available were ex- 
cluded from the analysis of recurrences and metastatic 
behavior. 

No correlation was observed between HER2/neu 
rnRNA expression and age, sex, tumor size, tumor 
grade, histotype, and metastasis. A correlation between 
sex of the patients and HER2/neu protein expression 
was found. In fact, none of the female patients showed 
high HER2/neu protein expression {P = 0.03). Pa- 
tients with high Her2/neu mRNA levels had a lower 
risk of recurrence • than those with low Her2/neu 
mRNA levels (P = 0.02). None of the cases with high 
HER2/neu mRNA levels developed metastatic foci, al- 
though the small number of observations precluded 
reaching statistical significance (P = 0.1). Results are 
detailed in Table 2. 

DISCUSSION 

The present work provides the first combined mo- 
lecular by real-time RT-PCR and immunohistochemical 
evidence that HER2/neu over expression occurs in SSs. 



Our results indicate that this parameter may provide 
prognostic information and suggest that a specific ther- 
apy with humanized monoclonal antibodies against 
HER2/neu may be considered in a significant number 
ofSSs. 

The HER2/neu oncogene has been .extensively 
investigated as.a prognostic factor and more recently as. 
a predictor of response to therapy. It has been demon- 
strated in breast cancer, where HER2/neu overexpres- 
sion is usually associated with gene ampliffcarion, 2 'and 
in other epithelial tumors, including ovarian, gastric, 
lung, and urinary bladder carcinomas. 

HER-2/neu amplification/ overexpression appears 
to be an early event in oncogenic transformation by 
interacting with other members of the HER family. 3 In 
breast cancer, it is involved in cell cycle and apoptotic 
pathways through the antiapoptotic effects mediated by 
p53 and p21 deregulation/ 2 '* 3 

Whether HER2/neu overexpression plays an im- 
portant role in mesenchymal neoplasms remains con- 
troversial. An immunohistochernical study of sarcomas; 
using a monoclonal antibody, reported no evidence of 
immunoreactivity for HER-2/neu in 6 SSs as well as in 
other 197 mesenchymal tumors, with cytoplasmic reac- 
tivity observed only in 1 case of peripheral neuroepi- 
thelioma.. A recent investigation reported gene ex- 
pression profiles of 41 soft tissue tumors with cDNA 
microarray analysis. Among these sarcomas, 6 
monophasic SSs were characterized by a unique expres- 
sion pattern of a cluster of 104 genes, including the 
epidermal growth factor receptor, which shows 50% 
homology with the HER2/ neu gene. 24 These data also 
suggest that the erb-B receptor family plays a significant 
role in SS. It has been demonstrated that a variable 
number of osteosarcomas over express HER2/neu. 11-15 
However, more recent studies 20 *^ 5 ' 26 were unable to 
detect any HER2/neu gene amplification and/or over- 
expression using fluorescence in situ hybridization 
(FISH), RT-PCR, and immunohistochemistry. 

Differences in the techniques used may play an 
important role and explain (at least in part) these 
discrepancies. HER2/ neu alterations can be evaluated 
using different techniques including irnmuno histo- 
chemistry, FISH, Southern hybridization, Northern 
blot, and competitive, differential, or real-time PCR. 27 
Immunohistochemistry is the most common method 
for detection of HER2/neu overexpression, but it is 
significantly affected by the sensitivity and specificity of 
the antibodies used, the type of tissue (frozen versus 
formalin-fixed), and the various interpretative criteria 
and scoring systems used to evaluate cases. Indeed, 
most studies of HER2/nen expression in osteosarcoma 
used immunohistochernical techniques, with different 
monoclonal or polyclonal antibodies. The discrepancy 
in results may stem from the use of different antibodies, 
as well as a lack of standardized evaluation. 

For these reasons, to evaluate HER2/neu immu- 
noreactiviry in our study, we used a polyclonal antibody 
(Dako, Carpinteria, CA), arguably the most diffuse and 
thoroughly tested antibody for HER2/neu assessment. 
Furthermore, we investigated HER2/ncu mRNA ex- 
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pression with real-time RT-PCR, because it has been 
demonstrated that mRNA levels correlates tightly with 
protein expression. 28 At present, real-time RT-PCR 
probably represents the most powerful tool for quanti- 
tative analysis, because it allows better internal control 
and reduction of sample contamination, and provides 
more objective results. 18 

We analyzed HER2/neu gene expression at the 
mRNA and protein level in 13 cases of SS. HER2/neu 
expression was found in all of the cases investigated, 
and mRNA content in the tumors varied from 2.1 to 
24 n. 

The variability of mRNA levels in SSs is reflected 
on heterogeneity of protein expression pattern as de- 
tected by immunohistochemistry. We found that 
HER2/neu immunoreactivity correlates strongly with 
mRNA levels. A convincing cytoplasmic immunoreac- 
tivity was documented with the polyclonal antibody in 
10 of 13 sarcomas. Distinct membranous staining was 
observed in 3 cases, although it was never comparable 
to the positive breast control. It was predominantly 
incomplete and identified in the epithelial/epithelioid . 
component of SS. Interestingly, all of the cases with 
. strong cytoplasmic staining also exhibited a membrane- 
staining pattern. 

In breast cancers, a membranous pattern, of stain- 
ing is thought to be specific for HER2/neu protein 
expression and correlated with gene amplification, 
whereas cytoplasmic staining is usually considered non- 
specific. 29 However, cytoplasmic posirivity for HER2/. 
neu has been reported to be prognosticaJly significant 
in other tumor types* including bladder, colon, pan- 
creas, thyroid, and nasopharyngeal carcinomas, and 
even in breast cancer. 3 

Patients with high HER2/neu mRNA expression 
had a significantly lower risk of recurrence. Similarly, 
all of the cases with high HER2/neu expression did not 
metastasize, although this correlation did not reach 
statistical significance due to the small number of cases 
studied. These data suggest that HER2/neu plays a role 
in the biology of SS and that HER2/neu overexpression 
may be linked to a less-aggressive clinical behavior. 
Indeed, unlike many cancers where MER2/neu overex- 
pression has been shown to correlate with poor prog- 
nosis, HER2/neu levels are linked to a more favorable 
clinical course in other malignant rumors, such as thy- 
roid carcinoma and osteosarcoma. 15 " 34 

The molecular mechanisms responsible for the 
action of HER-2/neu in SSs are unknown. A possible 
interaction between HER-2/neu and the other mem- 
bers of HER family could be important in tumorigen- 
esis. Derangements of other oncogenes, tumor suppres- 
sors, and apoptosis regulators have been described. in 
SSs. Fot instance, many SSs have been shown to be 
diffusely positive for be 1-2 family proteins (be 1-2, bax, 
bcl-x, and bak). These members of the bcJ-2 family are 
involved in the regulation of apoptosis in SS. 36 Tins 
raises the hypothesis that complex alterations in apop- 
tosis-controlling mechanisms are present in these neo- 
plasms, with HER-2/neu interacting with Be 1-2 family 



members. Further studies, are needed to clarify the 
mechanisms of apoptosis in SS. 

Depending on the size and location, the therapy of 
choice for SS is radical local excision or amputation., 
Whenever radical surgery cannot be performed, radio- 
therapy in concert with local excision is suggested in an 
attempt to avoid the need for amputation. Only re- 
cently has a study evaluated the possible role of chemo- 
therapy in the treatment of SS. 

The role of molecular markers in predicting treat- 
ment responsiveness is currently the focus of extensive 
investigation. Breast cancer patients with high HER2/ 
neu expression appeared to benefit from high-dose 
CAF (cyclophophamide, adriamycin, and 5-fluoroura- 
cil) therapy. 38 In our study, 3 of 4 patients with high 
HER2/neu expression received adjuvant chemother- 
apy with ifosfamide; these patients had a favorable clin- 
ical outcome. These data raise the possibility that 
HER2/heu may have value in predicting which patients 
are likely to respond to specific adjuvant chemotherapy 
regimens. Whether the favorable significance of 
HER2/neu expression depends on predicting clinical 
recurrence, response to chemotherapy, or both re- 
mains to be fully elucidated in SS patients. 

To the best of our knowledge, this is the first 
report that shows expression of HER2/neu in primary 
SS by real-time RT-PCR. Tlevated levels of HER2/neu 
mRNA and protein are found in a significant group of 
SS patients, and these levels appear to correlate with 
features of jgood prognosis. Furthermore, our results 
suggest that this mechanism of disease in SS may be the 
target of specific inhibitory therapies based on human- 
ized monoclonal antibodies. Considering the small 
number of patients examined, further investigation is 
needed to confirm these preliminary findings. 
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Expression of bcr-abl niRNA iu individual chronic myelogenous leukaemia 
cells as determined by iu situ amplification. 

Pachmanu IC Zhao S , Schenk T . Kantarjtan H . El-Naggar AtC Siciliano MX Guo 
JQ , ArLinghaus RB . And reef f M . 

The University of Texas M.D. Anderson Cancer Center, Department of Molecular 
Hematology and Therapy, 1515 Holcombe Boulevard, Houston, TX 77030, USA. 

We present the results of a novel method developed for evaluation of in situ 
amplification, a molecular genetic method at the cellular level. Reverse transcription 
polymerase chain reaction (RT-PCR) was used to study bcr-abl transcript levels in 
individual cells from patients with chronic myelogenous leukaemia (CML). After 
hybridizing a fluorochrome-Iabelled probe to the cell-bound RT-PCR product, bcr-abl 
raRNA-positive cells were determined using image analysis. A dilution series of bcr-abl- 
positive BV173 into normal cells showed a good correlation between expected and actual 
values, hi 25 CML samples, the percentage of in situ PCR-positive cells showed an 
excellent correlation with cytogenetic results (r = 0.94, P < 0.000 1 ), interphase 
fluorescence in situ hybridization (FISH) (r = 0.95, P = 0.001) and hypermetaphase FISH 
(r - 0.8 1, * P < 0.001). The fluorescence intensity was higher in residual CML cells after 
interferon (f FN) treatment than in newly diagnosed patients (P - 0.004), and was highest 
in late-stage CML resistant to IFN therapy and lowest in CML blast crisis (P = 0.001). 
Mean fluorescence values correlated with bcr-abl protein levels, as determined by 
Western blot analysis (r = 0.62). Laser scanning cytometry allowing automated analysis 
of large numbers of cells confirmed the results. Thus, fluorescence in situ PGR provides a 
novel and quantitative approach for monitoring tumour load and bcr-abl transcript levels 
in CML. 
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Expression of somatostatin receptor types 1-5 in 81 cases of 
gastrointestinal and pancreatic endocrine tumors. A correlative 
immunohistochemical and reverse-transcriptase polymerase chain 
reaction analysis. 

Papotti M, Bongiovanni ML Volnnte M . AHia £, Laadolfi S , Helboe L , Schindier M . 
Cole SL , Bussolari G . 

Department of Biomedical Sciences and Oncology, University of Turin, Via Santena 7, 
1 0126 Turin, Italy, mauro.papotti@unito.it 

Somatostatin receptors (SSTRs) have been extensively mapped in human tumors by 
means of autoradiography, reverse-transcriptase polymerase chain reaction (RT-PCR), in 
situ hybridization (ISH) and immunohistochemistry (IHC). We analyzed the SSTR type 
1-5- expression by means of RT-PCR and/or IHC in a series, of 81 functioning and non- 
functioning gastroenteropancreatic (GEP) endocrine rumors and related normal tissues. 
Moreover, we compared the results with clinical, pathological and honnonal features. 
Forty-six cases (13 intestinal and 33 pancreatic) were studied for SSTR 1-5 expression 
using RT-PCR, IHC with antibodies to SSTR types 2, 3, 5 and ISH for SSTR2 mRNA. 
The vast majority of tumors expressed SSTR types 1, 2, 3 and 5, while SSTR4 was 
detected in a small minority. Due to the good correlation between RT-PCR and IHC data 
on SSTR types 2, 3, and 5, thirty-five additional GEP endocrine tumors were studied with 
IHC alone. Pancreatic insulinomas had an heterogeneous SSTR expression, while 100% 
of somatostatinomas expressed SSTR5 and 100% gastrinomas and glucagonomas 
expressed SSTR2. Pre-opcrative biopsy material showed an overlapping 
immunoreactivity with that of surgical specimens, suggesting that the SSTR status can be 
detected in the diagnostic work-up. It is concluded that SSTRs 1-5 are heterogeneously 
expressed in GEP endocrine tumors and that IHC is a reliable tool to detect SSTR types 
2 3 3 and 5 in surgical and biopsy specimens. ' 
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Correlative iainuiiiohistociieniical ami reverse transcriptase polymerase 
chain reactiou analysis of somatostatin receptor type 2 in neuroendocrine 
tumors of the lung. 

Papotti M , CroceS, Macri L . Funaro A , Pccchioni C , Schindlcr ML Bussolati G . 

Department of Biomedical Sciences and Oncology, University of Turin, Italy. 

Somatostatin receptors type 2 (sst2) have been frequently detected in neuroendocrine 
rumors and bind somatostatin analogues, such as octreotide, with high affinity. Receptor 
autoradiography, specific mRNA detection and, more recently, antisst2 polyclonal 
antibodies are currently employed to reveal sst2. The aim of the present study was to 
investigate by three different techniques the presence of sst2 in a series of 26 
neuroendocrine tumors of the lung in which fresh frozen tissue and paraffin sections were 
available. It was possible, therefore, to compare, in individual cases, RNA analysis 
studied by reverse transcriptase polymerase chain reaction (RT-PCR), in situ 
hybridization (ISH), and immunohistochemistry. A series of 20 nonneuroendocrine lung 
carcinoma samples served as controls. RT-PCR was positive for sst2 in 22 of 26 samples, 
including 15 of 15 typical carcinoids, 5 of 6 atypical carcinoids, and 2 of 5 small-cell 
carcinomas. The sst2 mRNA signal obtained by RT-PCR was strong in the majority 
(87%) of typical carcinoids and of variable intensity in atypical carcinoids and small-cell 
carcinomas. A wealcly positive signal was observed in 5 of 20 control samples. In 
immunohistochemistry, two different antibodies (ariti-sst2) were employed, including a 
monoclonal antibody, generated in die Department of Pathology, University of Turin. In 
the majority of samples a good correlation between sst2 mRNA (as detected by RT-PCR) 
and sst2 protein expression (as detected by immunohistochemistry) was observed. 
However, one atypical carcinoid and one small-cell carcinoma had focal immunostaining 
but no RT-PCR signal. ISH performed in selected samples paralleled the results obtained 
with the odier techniques. A low sst2 expression was associated with high grade 
neuroendocrine tumors and with aggressive behavior. It is concluded that I) 
neuroendocrine tumors of the lung express sst2, and there is a correlation between the 
mRNA amount and the degree of differentiation; 2) immunohistochemistry and ISH are 
reliable tools to demonstrate sst2 in these tumors; and 3) sst2 identification in tissue 
sections may provide information on the diagnostic or therapeutic usefulness of 
somatostatin analogues in individual patients with neuroendocrine tumors. 
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Correlative Immunohistochemical and Reverse 
Transcriptase Polymerase Chain Reaction Analysis of 
Somatostatin Receptor Type 2 in Neuroendocrine 
Tumors of the Lung 



Aola Funaro, Ph.D:, Carla Pecchiorii, Marcus Schindler, M.D., and 
Gianni Bussolati, F.R.c.Path. 



Somatostatin receptors type 2 (sst2) have been frequently de- 
lected in neuroendocrine tumors and bind somatostatin ana- 
logues, such as octreotide, with high affinity. Receptor autora- 
diography, specific mRNA detection and, more recently, ami- 
5 sst2 polyclonal antibodies arc currently employed to reveal 
* ssi2rThe aim of the present study was to investigate by three 
" different techniques the presence of sst2 in a series of 26 neu- 
roendocrine tumors of the Jung in which fresh frozen tissue and 
' paraffin sections were available. It was possible, therefore, to 
compare, in individual cases. RNA analysis studied by reverse 
transcriptase polymerase chain reaction (RT-PCR), in situ hy- 
bridization (JSH), and irruT)unoWstc«;hernistry. A series of 20 
Donneuroendocrine lung carcinoma samples served as controls. 
RT-PCR was positive for sst2 in 22 of 26 samples, including 15 
of. 15 typical carcinoids, 5 of 6 atypical carcinoids, and 2 of 5 
small-cell carcinomas. The sst2 mRNA signal obtained by RT- 
PCR was strong in the majority (87%) of typical carcinoids and 
of variable intensity in atypical carcinoids and small-cell car- 
cinomas. A weaiJy positive signal was observed in 5 of 20 
control samples. In iromunohistochemistry, two different anti- 
bodies (anti-sst2) were employed, including a monoclonal an- 
tibody, generated in the Department of Pathology, University 
of Turin. In the majority of samples a good correlation between 
sst2 mRNA (as detected by RT-PCR) and sst2 protein expres- 
sion (as detected by immunohistochemisoy) was observed. 
However, one atypical carcinoid and one small-cell carcinoma 
had focal unmunostaining but no RT-PCR signal. ISH petr 
formed in selected samples paralleled the results obtained with 
the other techniques. A low sst2 expression was associated with 
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high grade neuroendocrine tumors and with aggressive behav- 
ior. It is concluded that 1) neuroendocrine rumors of the lung 
express sst2, and there is a correlation between the mRNA 
amount and the degree of differentiation; 2) irnrnunohistochem- 
istry and ISH are reliable tools to demonstrate sst2 in these 
tumors; and 3) ssi2 identification in tissue sections may provide 
information on the diagnostic oi therapeutic usefulness of so- 
matostatin analogues in individual patients with neuroendo- 
crine tumors. 

Key Words: Neuroendocrine— Lung— Tu mors—Somato- 
statin receptors— Immunohistochemistry— Small cell carci- 
noma — Reverse transcriptase polymerase chain reaction. 

Diagn Mol Pathol 9(1): 47-57, 2000. 



The somatostatin receptor family (sst) includes at least 
five isofonns that have been recently identified and char- 
acterized (18,32,41). The ssts are widely distributed in 
norma] human tissues and in human tumors. Sst type 2 is 
more commonly detected in neuroendocrine tumors 
(32,37) and binds the somatostatin analogue octreotide 
with high affinity. 

Sst localization had originally been demonstrated by 
means of binding assays of radiolabeled somatostatin 
analogues (20,25,31). Subsequently, specific sst messen- 
ger RNA (rnRNA) detection was obtained by means of in 
situ hybridization (ISH) and reverse transcriptase poly- 
merase chain reaction (RT-PCR) (14.32,37). Recently; 
polyclonal antibodies specific for different isoforms of 
sst were produced and used in ixrimunohisiocherjustry 
(10,12,15,1830,35,36)- Given the well-known heteroge- 
neity of neoplastic populations, in situ methods (immu- 
nohistochemistry and ISH) allow a more definite map- 
ping of the distribution of the receptor in such tissues. 
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This is potentially useful for predicting the responsive- 
ness of a given neoplastic cell population to medical 
treatment with somatostatin analogues, which are used in 
the clinical setting for both diagnostic and therapeutic pur- 
poses with special reference to neuroendocrine tumors. 

The spectrum of neuroendocrine tumors of the lung 
includes well-differentiated neoplasms (so-called typical 
. carcinoids) and poorly differentiated small- cell carcino- 
mas (SCCs). Intermediate forms sharing features of both 
the aforementioned types also belong to this spectrum 

. (so-called atypical carcinoids or well-differentiated neu- 
roendocrine carcinomas). Finally, large-cell neuroendo- 

— Grine-oaTeinoma-has-been-identified and included-in-this- 
tumor group (4,40). The tissue distribution of sst2 in 
neuroendocrine tumors of the lung has not been thor- 
oughly characterized, although individual samples of 

: bronchial carcinoids were found to express sst (30). 
SCCs (but riot non-small-cell types) were also shown to 
be sst2 positive by receptor binding assay (33). More- 
over, sst2 has been detected in in vitro cell cultures of 

. human SSC of the lung (39,42). No study on a series of 
neuroendocrine tumors of the lung including all neuro- 
endocrine lung tumor types has been reported to date. 

The aim of this study was therefore to investigate the 
presence of ssl2 mRNA and protein in a series of 26 
neuroendocrine tumors of the lung, employing different 
technical approaches, such as RT-PCR, JSH, and immu- 
nohistochemistry. To this purpose a monoclonal anti- 
body to sst2 (N- terminal) was generated in the Depart- 
ment of Pathology, University of Turin. The results were 
then compared and related to the rumor grade and to 

; other clinicopathologic parameters. 

MATERIALS AND METHODS 

Case Series and RNA Extraction 

Twenty-six samples of neuroendocrine tumors of the 
lung, in which fresh frozen tissue was available, were 
retrieved from the surgical pathology file of the Univer- 
sity of Turin, Italy. All samples were reviewed applying 
currently accepted criteria of classification (4,40), and 
the neuroendocrine nature was confirmed by positive im- 
munostaining for chromogranin A (CgA) (with or with- 
out antigen retrieval) or synaptophysin, and by positive 
RT-PCR for CgA mRNA. According io the classi- 
fications described here, these included J 5 well- 
differentiated neuroendocrine tumors (typical carci- 
noids), 6 well-differentiated neuroendocrine carcinomas 
(atypical carcinoids), and 5 SCCs. 

A series of 20 non- small-cell lung carcinomas (10 
squamous, 9 adenocarcinomas, and ] large- cell anaplas- 
tic) lacking neuroendocrine differentiation, as demonstrated 
by negative iminunohistochernistry and RT-PCR for CgA 
(1), served as a comrol group." Clinicopathologic data and 
follow-up infonnation were obtained for all patients. 



For hybridization analysis, total RNA was extracted 
using the guanidine thiocyanate-cesium chloride method 
(5). The concentration of RNA was estimated by spec- 
trophotometry, and RNA degradation was assessed by 
agarose gel electrophoresis, as previously reported {37). 

Reverse Transcriptase Polymerase Chain Reaction 
for ssL2 and Cbromograriin A 

Total RNA (2 ixg) was first digested, with 10 units of 
RNase-free DNase (Boehringer, Mannheim, Germany) 
in a 10- pL solution containing 20 mmol/L MgCl 2 , to 
. avoid DNA contamination. The solution was kept at 
room temperature forTO mirjutesrtben heated for 5 min- 
utes at 70 D C to inactivate the DNase molecules; 40 
pmol/L of bligodeoxythymidine primers (oligo~dT16) 
were added and the solution was heated again at 70°C for 
10 minutes, then chilled on ice to allow the primer hy- 
bridization. The resulting solution was reverse tran- 
scribed using 100 units of reverse transcriptase (Gibco 
BRL, Gaitersburg, MD). Complementary DNA (cDNA) 
was generated in a 50-pJL final reaction volume contain- 
ing 50 mmol/L Tris-HCl pH 83, 75 mmol/L KC1, 3 
mmol/L MgCl 2 , 10 mmol/L dithiothreitol, 1 ntmol/L de- 
oxynucleotide triphosphates (dNTPs), and 20 units of 
RNasin (Promega, Madison, Wl). The solution was 
heated at 37°C for 90 minutes. Finally, the enzymes were 
inactivated by hearing to 70°C for 10 minutes. 

The efficiency of the reverse transcription was deter- 
mined by perfoniung a PCR reaction having the 3 2 - 
microgJobulin "housekeeping gene" as a target. PCR was 
carried out in a 10-jxL final reaction volume containing 
1 pL of cDNA template, 10 pmol of sense and antisense 
oligonucleotide primers, 67 mmol/L Tris-HQ pH 8.8, 16 
mmol/L (NH 4 )2S0 4 , 0.01% polysorbate 20, 2 mmol/L 
dNTPs, 1 mmol/L MgCl 2> and 0.5 units of Taq polymer- 
ase. 3 2 -Miaoglobulin, sst2, and CgA PCR reactions were 
performed using the same protocol at the foDowing PCR 
conditions: 35 cycles, each cycle consisting of denaturation 
at 94°C for 2 minutes, annealing at 55°C for 1 minute for 
IV microglobulin, at 61 D C for sst2, and at 68°C for CgA; 
extension was performed at 72°Cfor 3 minute. The primers 
used for RT-PCR (9, 1 1 ,2337) are reported in Table 1 . 

The amplified fragments were run in a 1% agarose gel, 
containing .ethidium bromide. Strict precautions against 
contamination were undertaken (19) and negative con- 
trols (a no-template control and a no^-reverse transcriptase 
control and distilled water to replace the RNA) -were in- 
cluded. The RNA extracted from an H716 neujoendocrine 
colon carcinoma cell line, and from a neuroblastoma (37) 
served as positive controls for CgA and sst2, respectively. 

Antibodies 

Two different antibodies specific for sst 2 were em- 
ployed: The first one was a monoclonal antibody raised 
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TABLE 1 



Sequences of primers used lor reverse transcriptase polymerase chain reaction 



Size ot PGR 
product (bp) 



Position 



Study 



^.microg.obuDn sense: 5' ACC : OCC, ACTGAA .AM 6MGA* 
,{ ^-microdlobuKn antisense: 5' ATC TTC AAA CCT CCA TGA it.. 3 
3 SSTR2 Jense: 5' CA6 TCA TGA 6CA ICG ACC GA 3\ 
• 4 f ItR2 denser 5' GCA AAG ACA GAT GAT GGJ GA 3* 
3 CoA sense- 5' GCT CCA AGA CCT CGC TCT CC 3' 
g gS anfcense: 5' 6AC CGA CJC TCG CCT TTC CG 3' 

; PCK polymerase chain reaction. 
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by both A and B receptor isoforms). The octapeptide 
EpyYDLTS, corresponding to amino acids 35 to 42 of 
the human receptor (and differing by one amino acid 

■ Horn the mouse sequence), was synthesized, having a 
TysiD added to the N-terminal. This sequence was simflar 
:totbat used by other groups to produce polyclonal and- 

■ bbdies (17,18,27). This sequence was rather sbort but 
made it possible to avoid extensive homology with sstl. 
In addition, according to a genbank search using FASTA 

• (28), this protein sequence is unique to buman sst2 and 

■ has a partial homology only with rat and human nuclear 

■ receptor retinoid orphan nuclear- receptor-beta (a piotein 
■" having nuclear localization). Three Balb/c mice were im- 

rnunized with the peptide conjugated to keyhole limpets 
hemocyanin (KLH) (Sigma, St. Louis, MO) following 

' the standard procedure. After the first intrasplenic injec- 
tion (100 ftg of protein) at time 0, the mice were jnrra- 
peritoneally injected six times with the peptide-KLH 
conjugate (150 ug) in the presence of Freund adjuvant. 
The reactivity of the sera from each animal was evalu- 
ated using an enzyme-linked immunosorbent assay, us- 
ing the peptide coated onto the plastic. The bybridomas 
were produced by somatic fusion of immunized spJeno- 
cytes with the mouse myeloma cell line Ag8.X63.653, 

.following the standard .technique (21). Tbe monoclonal 
antibodies of interest were selected on the basis of tbe 
reactivity with the target peptide and with appropriate 
tissue sections: The latter included formalin-fixed and 
paraffin-embedded sections of pituitary. gland and pan- 
cieanc islets and were analyzed by means of irrrmuno- 
peroxidase staining. ParaUeJ control experiments were 
also performed by staining serial sections of these tis- 
sues, omitting tbe primary antibody or with the preim- 
muDe serum or with the antibody preadsorbed with high 
concentrations (1 mg/mL) of tbe antigenic peptide. In 
addition, the selected monoclonal antibodies (coded 
10C6 and 10G4), both of IgM isotype, were further char- 
acterized by Western blotting. Membranes were prepared 
from stable transfected Chinese hamster ovary (CHO> 
K) cells, individually expressing recombinant human so- 
matostatin receptors (sstl to sst5). Western blotting was 
performed as previously described (36). The monoclonal 
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antibody was used as culture supernatant at 1:3 dilution 
_jwr 9 hours at room temperature, jn Tris-buffered saline 
(TBS), supplemented with 0.1% polysorbate 20'. Blots 
were washed in TTBS and incubated with peroxidase- 
conjugated goat antimouse IgM, diluted 1:1-500 for 90 
minutes at room temperature. Then, blots were washed in 
TTBS and immunocoraplexes were visualized using 
ECt followmg manufacturer's instructions (Amersbam, 

Bucks, UK). " " 

A second polyclonal antibody was produced that bad 
been characterized previously (35,36). This antibody 
(coded K230) was raised in sheep and was specific for a 
sequence of the C-terrninal portion of the sst2A^(KSRL- 
hTETTETQRTlJ-NEDLQ, amino acids 347 to 366). 

Ixjarriun obist ochemistry 

Sections 4 or 5 \i thick, adjacent to those used for 
conventional histopathologic examination and immuno- 
staming for neuroendocrine markers, were collected onto 
poly-L-lysine-coaied slides: The proliferative activity of 
the tumors was assessed by means of K567 imrounosta- 
nining (clone MEB1, Immunotecb, Marseille, France), 
diluted 1:10 after microwave-based antigen retrieval in 
citrate buffer). Tbe ascitic fluid of monoclonal antibody 
10G4 was used in this study and was applied to tissue 
sections with prior antigen retrieval (three 3-minute pas- 
sages in a.microwave oven at 800 V/ in citrate buffer pH 
6.0), at the dilution of 1:10,000 or 1:12,000 for 30 min- 
utes' at room temperature, tbe antiserum coded K230- 
was applied overnight at a dilution of 1 :300 with no prior 
antigen retrieval. The immune reactions were then re- 
vealed with the immunoper oxidase technique (13) using 
the streptavidin-peroxidase kit and diaminobenzidine as 
chroroogen. A weak nuclear coumerstain or no counter- 
stain was used in parallel sections. Control stainings for 
both antibodies included immunoperoxidase of .serial 
sections using preirnmune serum or antibody pread- 
sorbed with the antigen or buffer instead of the primary 
antibody. 

In Situ Hybridization 

Selected tumors (32 samples) were also analyzed fof 
sst2 mRNA expression by means of a nonradioactive, 
tyramide deposition-based 3SH technique. Tire proce- 
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dure of amplification was modified from procedures re- 
ported by Kerstens et al (16), Speel et al. (38),. and' tbe 
GenPoint (biotinyl-iynaniide) manufacturer (Dako, Glos- 
trup, Denmark). Briefly, 5-u.m-thick paraffin sections 
were collected onto silane-coated slides and deparaf- 
finized through xylene and graded alcohols to phosphate 
buffer saline (PBS). The slides were then incubated for 5 
minutes in a microwave oven at 800 W in citrate buffer 
pH 6.0. After washing in PBS, they were digested with 
proteinase K (1 u-g/mL) for 10 minutes at 23°C Endog- 
enous peroxidase activity was blocked with 3% hydro- 
gen peroxide and -endogenous biotin was blocked using 
~^drir-ijJcK:king-Te^enrtoT— 15 minutes-foHo wed-by— 



washing in PBS and biotin-blocking reagent for 15 min- 
utes (3)." Sections were then prebybfidi2ed for 1 hour at 
room temperature in a mixture composed of 4x SSC, 
50% forroamide, Denhardt's Ix, dextrane sulfate 5x, 500 
Hg/mL salmon sperm DNA, and 250 p-g/mL tRNA. Hy- 
bridization took place overnight at A2°C in a solution 
containing the specific probe at a concentration of 1 
pmol/mL The probe was a digoxigenin-labeled 48-base 
oligonucleotide (32), complementary to positions 91 to 
139 of the human sst2 gene (41). After hybridization,, 
excess hybridization buffer and coverslips were removed 
by a rapid wash in 4x SSC followed by stringent washing 
in O.lx SSC for 10 minutes at 42°C. The hybrids were 
revealed by the following incubation steps: peroxidase- 
labeled antidigoxigenin (diluted 1:100 in PBS) for 30 
minutes at room temperature, biotinylated tyramide (di- 
luted 1:5 in PBS) for 15 minutes at room temperature, 
and peroxidase-labeled streptavidin for 15 minutes at 
room temperature. Diaminobenzidkie was used as chro- 
mogen. Controls for 1SH included staining of serial sec- 
tions with sense probe, an unrelated probe (EBER-3'of 
the Epstein-Barr virus), and omission of the probe in the 
hybridization mixture, with all other experimental con- 
ditions identical to the procedure described here. 



RESULTS 

Keverse Transcriptase Polymerase Chain Reaction 

All neuroendocrine rumors, but no normeuroendocrine 
lung carcinomas, were positive for CgA mRNA (Fig. I). 
Sst2 mRNA was amplified in 22 of 26 samples of neu- 
roendocrine tumor. The signals had variable intensities 
(Eg. 2) and were weak in moderately or poorly differ- 
entiated tumors (mostly in SCCs). No amplification was 
obtained in no- tempi ate or no-reverse transcriptase ex- 
periments. Control samples (nonneuroendocrine lung 
carcinomas proven by negative CgA RT-PCR) were 
weakly positive for sst2 in 5 of 20 samples only (includ- 
ing 3 adenocarcinomas," J squamous, and the large-cell 
. anaplastic carcinoma) (Fig. 3). These, differences were 
statistically significant (P < 0.01 ) by X 2 test 

Characterization of Monoclonal Antibodies to sst2. 

Several clones were identified having a positive bind- 
ing by enzyme-linied immuDOSorbent assay and a par- 
allel iromun ore activity" on formalin-fixed paraffin- 
embedded human endocrine tissues (pituitary and pan- 
creatic islets). In Western blotting experiments, two 
clones (coded 10C6 and 10G4) specifically developed a 
band at approximately 70 klX When the antibodies were 
used against CHO-trans/ecied cells expressing recombi- 
nant somatostatin receptors 1 through 5, a specific band 
corresponding to sst2 (at approximately 70 kD) was re- 
vealed by the monoclonal antibody 10G4. Monoclonal 
antibody 10C6 developed a strong band with sst2 but 
displayed a weaker reactivity also with sstl, 3, and 5, at 
least in the present experimental conditions (Fig. 4 A,B). 
The same antibodies were also tested by means of im- 
rounoperoxidase staining, on formalin- fixed, paraffin- 
embedded samples of normal human pituitary gland and 
pancreas. Monoclonal antibody 1QG4 gave good results 
in immujiornstochemistry and was used at increasing di- 
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. FIG. 1. Reverse transcriptase polymerase chain reaction for chromograniri A (CGA) mRNA 
in 26 samples ot neuroendocrine tumor ol the lung. Numbers in each lane correspond to 
sample numbers in Table 2. CGA mRNA is amplified at 429 bp. C and.C* stand for positive 
(neuroendocrine colon carcinoma cell line, H716) and negative (distilled water) controls, 
respectively. The last column to the right represents the molecular weight marker. All samples 
are pcsilive with a variable intensity ot the amplification band. . 
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FIG. 2. Reverse transcriptase polymerase chain reaction for sst2 mRNA in 26 samples of 
"neuroendocrine tumor of the lung. Numbers in each lane correspond to sample numbers in 
Table 2. sst2 mRNA is amplified at 284 bp. C and C* stand for positive (a neuroblastoma) ■ 
and negative (distilled water) controls, respectively. The last column to the right represents 

. the molecular weight marker. Twenty-two of 26 samples are positive with a variable Intensity 
of the amplification band. 



lutions (up to 1 :15,000) with specific staining. Using thin 
sections (approximately 4 pjrj), a strong membrane- 
bound and peripheral cytoplasmic immunoreactivity was 
found in an adenohypophyseal cell population (corre- 
sponding to growth hoj-m one- secreting cells, as con- 
- firmed by double immunohistochemical analyses) and in 
pancreatic islets (Fig. 4 C.D). In the latter, the staining 
was apparently not restricted to a specific hormone- 
producing cell type and had a peripheral cytoplasmic or 
membrane distribution. Exocrine pancreatic cells (both 
acinar and ductal) were only occasionally immuno : 
stained. Immunobistochernistty performed on serial con- 
trol sections, either omining. the primary, antibody or us- 
ing the preimmune serum or antibodies preabs orbed with 
the synthetic peptide, was negative in both tissues. 
Monoclonal antibody J0C6 had a relatively higher back- 
ground staining at similar dilutions. 

Immunohistocbemistry " 

The antibodies tosst2 (monoclonal antibody 10G4 and 
polyclonal K230) gave slightly different imrpunoreac- 
tions in 25 samples, and staining was not done in 1 
sample because of lack of residual paraffin blocks. The 
monoclonal antibody 10G4 stained 21 of 25 samples, the 



negative samples being 1 atypical carcinoid and 3 SCCs 
(Fig. 5). The rumors had 5% to 25% of the neoplastic 
cells immunoreactive. The staining was al the periphery 
of the cytoplasm, and omitting the counterstain its mem- 
brane-bound distribution was better outlined in most 
samples (Fig. 6). One sample of atypical carcinoid {no. 
21) was focally immunoreactive for sst2, despite nega- 
tive RT-PCR findings. Conversely, sample no. 26 was 
immunohistochemistry negative and RT-PCR positive. 
The antiserum anti-sst2A (code K230) gave positive sig- 
nal in 19 of 25 samples, in 5% to 60% of the neoplastic 
cell population (Fig. 7). The location of the staining was 
at the membrane level associated with a weak cytoplas- 
mic reactivity. The same pattern was seen in positive 
controls, e.g., pancreatic islets (Fig. 7, inset). Two 
samples (nos. 19 and 26) were negative in spite of a 
positive RT-PCR signal. Two other tumors (nos. 21 and 
22), apparently devoid of sst2 mRNA, showed a small 
percentage of immunoreactive cells. Incidentally, one of 
these latter samples (no. 21) was also immunoreactive 
with monoclonal antibody 10G4 (Table 2). 

The five control samples positive *by RT-PCR were 
also reactive with the antibodies. The type of immuno- 
cytochemical location of sst2 receptors was similar to 
that described here, being a peripheral cytoplasmic stain- 




M 1 2 3 4 5 6 7 8 9 10 31 12 13 14 15 16 37 18 39 20 C C* M 
FIG. 3. Reverse transcriptase polymerase chain reaction for sst2 mRNA in 20 control 
samples of nonneuroendocrine lung carcinoma. Five ol 20 samples show a weak band at 
284 bp corresponding to sst2 mRNA. Control'columns (C and C*) are identical to those in 
Fig. 2. 



284 bp 
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FIG. 4. Western blot analysis of monoclonal antibody 
clones 10G4 and 10C6 against sst2 in Chinese hamster 
oyary cells transfected with recombinant sst 1 through 5 
(numbers ol each column correspond to receptor type). 
Monoclonal antibody 10G4 shows a specific band at ap- 
proximately 70 kD for sst2 only (A) as opposed to mono- 
clonal antibody 10C6, which strongly reacts with sst2 but 
also has some degrees of cross-reactivity with sst 1, 3 f 
and 5 (B). The lower figures show control formalin-fixed 
paraffin-embedded pancreatic islets immunostained with 
monoclonal antibody 10G4 without (C) and with (D) 
preadsorption with the peptide antigen, respectively. The 
majority of endocrine cells show a membrane-bound im- 
munoreactivity (C) (immunoperoxidase). Bar: 90 pm. 



ing present in 40% to 70% of neoplastic celts. A weak 
and focal staining was also observed in five of the re- 
maining RT-PCR-negative samples, when the antibody 
K230 was used (but not when the monoclonal was em- 
ployed). 

Several cells in peiiiumoral tissues were occasionally 
stained. Ciliated cells of bronchia) mucosa had a periph- 
eral staining at the cijia border. .Mucous glands were 
negative. Rare chondrocytes had a membrane staining. 
The wall of periiumoraJ as well as of occasional distant 
vessels was stained at the endothelium level and in oc- 
casional smooth muscle cells; 

. The reaciiviry / of both antibodies was abolished in se- 
rial sections when the reagents were preabsorbed with 
the respective synthetic peptides, but not when an unre- 
lated peptide was used. The peri tumoral bronchial mu- 
cosa had a focal staining of ciliated cells with both an- 
tibodies. This icactWity disappeared when the preab- 
sorbed antibody was applied. 



In Situ Hybridization 

Eight of 12 samples stained by ISH were positive for 
sst2 mRNA. The mRNA was present in a percentage of 
cells (ranging from 10% to 40%) and gave a weak signal 
(fig. 8), despite the amplification provided by the tyra- 
mide-based procedure. The background level was mini-, 
mal using diluted biotinylated tyramide. Control sections 
stained with sense probe or an unrelated probe, or omit- 
ting the probe, were consistently negative. 



Clinical Data 



Clinicopathologic data are summarized in Table 2. At 
follow-up, the majority of patients with typical carci- 
noids are free from disease 1 to 1 1 years after surgery. 
Two patients are alive with stable metastatic disease. 
Patients affected by atypical carcinoids had disease pro- 
gression in one third of samples. Finally, patients with SCC 
had fatal outcomes within 1 year from diagnosis (except the 
recent sample). Eight patients had preoperative octreotide 
scintigraphy performed at the time of diagnosis. All patients 
had positive octreoscan findings, 'and, in these patients, 
also the tumor was positive by RT-PCR and immu- 




FIG. 5. sample no. 25 (small cell carcinoma). Absence of 
immunoreactivity lor sst2 with the monoclonal This 
sample was also negative by reverse transcriptase poly- 
merase chain reaction and in situ hybridization. (Immuno- 
peroxidase in a formalin-tbted paraffin-embedded sample. 
NiKtei slightly counterstained with hemalum.) Bar: 45 pm. 
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nohistocbemistry or ISH. In addition, three of these pa- 
rjerrts received octreotide therapy administered at the 
tune of rumor recurrence or metastatic spread. Stable 
disease is recorded at follow-up more than 5 years after 
diagnosis. 

Correlations 

Overall, complete overlapping (i.e., RT-PCR, ISH, 
and rmmunobistoeheirjistry with two antibodies) be- 
tween sst2 gene and protein expression was obtained in 
21 of 25 samples (84%) and between RT-PCR results 

the antibodies in 24 of 25 samples (96%). The monoclo- 
nal antibody 10G4 looked highly sensitive, being able to 
stain all but one sample (do. 26) (95%) positive for sst2 
mRNA by RT-PCR. Sst2 expression, at mRNA as weD 
as at protein levels, was reduced in high grade tumors, 
with SCCs being weakly positive in ODly tvyo of five 
samples. Decreasing expression of sst2 appears to cor- 




FIG. 6. sample no. 16 (typical carcinoid). Immunohisto- 
chemical detection of sst2 by means of monoclonal anti- 
body 10G4. The neoplastic cells have a peripheral cyto- 
plasmic staining and membrane posilivity in some xells, 
whereas the peribronchial gland adjacent to lhe tumor is 
unreacfrve. (Imrnunoper oxidase in a formalin- fixed paraf- 
iin-embedded sample. Nuclei slightly counlerstained with 
hemalurn.) Bar: 45 pm. The membrane-bound distribution 
of the immunoslaining is better outlined in a parallel sec- 
tion stained for monoclonal antibody 10G4 omitting . 
nuclear counter stain (inset). 




FIG. 7. Same sample as in Fig. 6. Immunohistocbemical 
detection of sst2 by means of the polyclonal antibody 
K230. The immunostainrng is more intense at the cell bor- 
der (arrows), as observed with the monoclonal antibody. 
In lhe inset, a pancreatic islet, used as positive control, 
shws a predominant membrane-bound immunostaining of 
many neuroendocrine cells. (Immunoperoxidase in a for- 
malin-fixed paraiJtn-embedded sample. Nuclei slightly 
counlerstained with hemalum.) Bar 45 urn. 

relate with high tumor grade and elevated proliferative 
activity, but not with other parameters such sex, age, or 
tumor size. 

DISCUSSION 

In this study, the presence of sst2 mRNA has been 
demonstrated in a series of resected neuroendocrine tu- 
mors of the lung by.means of RT-PCR and confirmed by 
a sensitive nonradioactive tyramide-based ISH procedure 
and by immunohistochemistry with anti-$st2 antibodies. 
Samples of both carcinoid tumors and SCCs were sst2 
positive, although a reduced or absent signal was ob- 
served in poorly differentiated (srnallrcell) carcinomas. 
This is the first study of sst2 expression in a relatively 
larce series' of neuroendocrine tumors of the lung. Single 
samples of human carcinoids and SCCs (including cell 
lines of lhe Jailer) bad previously been analyzed and 
found to express sst2 (7,15:30,32,33,39,42). Several 
methods have been used to detect these receptors and 
partially overlapping results-were obtained. 

In the present study, die expression of high amounts of 
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TABLE 2, Clinicopathologic data and somatostatin receptor type 2 (sst2) expression in 26 cases of neuroendocrine 

- lung tumors 



Patient 






S(2G 


Follow-up 




ugA 


OVD 


rs.ro/ 


SSuf 


SSU: InO 


sst2 IHC 


no. 






w. 


(mo) 




RT.PTD 


Inu 




dt or* a 
HI-rUH 


(vlaD( 11A>4/ 


(IVdJO Ad) 


1 


WD NET . 


F/35 


3.5 


NED 90 






+ F 


1.5 


44-4 




* + 


2 


WD NET 


F/29 


4 1 


NED 45 


* 






0 


4+ 


+ 


+ 


3 


WD NET 


F/41 


2.5 


NED 70 




+4 




n -1 


44-4 


*+ 


• c 

+ r 


4 


WD NETf 


F25 


4 


MFD P1 


■ 


-f+ 


+ 




4-4-4 


+ r 


+ r 


* 5 


WD NETf 


IVJ/OO 


o.o 


Men o-t 

IMC l-' £0 


+ 


++4 


-f 


1J 


+44- 


4 r • 


+ 




U/n MPT 

VVU IMC 1 






Men ito 


+ 


+-+ 


+ 


• 4,5 


4+4 


4 r 


+ r 


7 


WD NET 






KlPO A1 


+ 


+-f 


+ 


■* 
I 


4 


*4 


+ 


$ 


WD NET 


M/29 


3 


IMCL/ /U 




+++ 






444- 


"4 


+ 


•g 




FVl/x / 


j« 
** 


NfcL> TUB 


+ • 


++^ 


+ 


NT 


4-f+ 


- 4 


4 


10 


WD NE Ca$ 


M/66 


8 


A WD 55 




+■+ 




1 1 




+ r 


4 


-.11.... 


WD NET± 


F/29 


2 


AWD56 


+ 


+++ 


- + 


23 


+44- 


4 


+ 


12 


WD NET* 


F/32 


' "3 


NED. 26 


+ 


++ 


+ 


A" " 


44-4- 


+ 


4 


13 


WD NE Ca 


M/60 


3 


NED 133 


+ 


4++ 


+ 


1 


444- 


NT 


NT 


14 


WD NET 


M/28 


4 


NED 130 


+ 


+++ . 


4 


1 


4-44- 


4 


+ 


15 


WD NET* 


M/41 


1;3 


A WD 53 


+ 


4-f+ 


+ 


1.5 


+4-4 


4 


+ 


16 


WD NETf 


F/31 


1 


NED 13 




++ 


4-f 


2.6 


4 


. 4 


4 


17 


WD NET 


F/53 


4 


NED 24 . 


+ 


+4 + 


+ F . 


4 


44+ 


4F 


4 


1B 


WD NE Ca 


M/62 . 


3 


NED 6 




4+ 




13 


4++ 


4 


+ F 


19 


WD NE CA 


F/73 


5 


DOD20 


+ . 


44-4 


+ 


3 


++ 


4 




20 


sec 


M/57 


6 


D0D12 


+ 


++ 


4 


45 


+ 


+ F 


4F 


21 


sec 


M/51 


4.5 


DOD5 


+ 


+ 


4 


35 




+ F 


+ F 


22 


WD NE Ca 


M/60 


6 


NED 51 


+ 


4 + 


4- 


1.5 






+ 


23 


sec 


F/56 


6 


D0D11 


+ 


4 


4 


50 








24 


WD NE Ca 


W77 


2.5 


NED 21 


+ 


+ 


4 


24 


+ 


+ F 


+ F 


25 


see 


M/57 


5 


DOD10 


+ 


4+- 


4+ 


80 








26 


sect 


M/68 


-11 


recent case 


+ 


4 


44- 


71 


+ 







AWD, alive with disease; CgA, chromogranin A; DOD, died of disease; -» F, focal: positive in <5% of cells; IHC; Immunohistochemislry; 
Mab, monoclonal antibody. NECa, Neuroendocrine carcinoma; NED, no evidence of disease; NET. neuroendocrine tumor, NT, not tested; 
RT-PCR. reverse transcriptase polymerase chain reaction; SCC» SmalKcell lung carcinoma; SYP, synaptophysin; WD, well difierenUated! 

' Ki67 IHC; values cpr/espond to percentage of positive nuclei of neoplastic cells. 

t Patients who had preoperative octreoscan performed. 

t Patients who had octreoscan performed and octreotide treatment. 



in metastatic deposits of SCC has been shown to be low 
or absent (29,2). The present findings suggest that the 
sst2 mRNA content is reJated to the degree of tumor 
differentiation. These data must be confirmed in larger 
series of noimeurbendocjrine rumors to ascertain whether 
the observed loss or decrease of sst2 expression in neu- . 
joendocrine tumors is a common, event Jinked to neo- 
plastic dediffejentianon. In addition, further studies are 
needed to assess the functionality of such receptors, by 
comparing the profile of sst2 expression in tumor tissues - 
with binding assays employing labeled somatostatin and 
with the clinical response to diagnostic and therapeutic 1 
administiatipn of somatostatin analogues. J 

To this purpose, several investigators have demon- 
strated a correlation between clinical imaging or re- j 
sppnse to somatostatin analogue treatment, and ssl2 j 
mRNA content in single . samples of carcinoid rumors I 
(15,22). Northern blotting and ISH were the techniques J 
used for sst2 mRNA identification. This kind of corre- J 
lation is useful for selecting patients for somatostatin j 
analogue treatment, although the demonstration of recep- . ! 
tor mRNA in a cell does not imply per se that the recep- * 
tor is fully functional. - j 

The. present study relied on a highly sensitive tech- ) 



sst2 mRNA was confirmed in well to moderately differ- 
entiated neuroendocrine tumors, in agreement with the 
results obtained by Reubi et a}. (32) by means of radio- 
active ISH. The presence of sst2 mRNA in SCC had 
never been reported in human specimens, except for two 
samples included in Reubi et al.'s series (32). Although 
the data on cell lines support the observation that'SCCs 
contain sst2 (42), slightly discrepant results were found 
in some of samples described here. Unfortunately, SCCs 
are rarely operated on, and therefore it is difficult to 
collect a large number of surgical specimens. The Jive 
samples studied in the current series by means of RT- 
PCR had a low amount (two samples) or absent (three 
samples) sst2 mRNA. This could be the result of the 
extensive necrosis commonly present in such tumor 
types. However, because care was taken to freeze frag- 
ments that were macroscopically devoid of necrotic ar- 
eas, a more likely hypothesis is that sst2 exprpssion is 
reduced in poorly differentiated rumors. Recently, Reis- 
inger et ah (29) showed that the uptake of somatostatin 
analogues in patients with SCC undergoing chemo- 
therapy is significantly lower, and therapeutic externa] 
factorTrnay affect the receptor status of individual ru- 
mors. In addition, the uptake of somatostatin analogues 
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-F1G. .8» sample -no.- -14- (typical carcinoid). In -sHtHhybfid- 
ization (ISH) for sst2 mRNA shows a weak cytoplasmatlc 
staining (A) in most tumor cells.. An ISH performed with an 
unrelated probe was negative in a serial section of the 
same tumor (B). This sample was strongly positive by 
reverse transcriptase polymerase chain reaction for sst2 
mRNA and by immunohistochemistry. (Nonradioactive 
ISH revealed by peroxidase and diaminobenzidine, as 
substrate. Nuclei counterstained with hemaJum.) Bar; 
75 pm. - 



: nique, RT-PCR, to identify all samples bearing even 
small amounts of sst2 mRNA. indeed, in a previous 
study, single samples exhibiting octreotide-binding sites 
" had no demon suable sst2 mRNA by means of ISH, pos- 
sibly due -to ibe low sensitivity of the ISH procedure (34). 
The RT-PCR bas shown sst2 mRNA transcripts in the 
majority of samples here studied. Only four samples 
were negative, all belonging to poorly differentiated high 
grade tumors, which usually follow an aggressive course. 
A decrease of ssi2 mRNA expression in association with 
neuroendocrine rumor dedifferentiation had also been re- 
ported in neuroblastomas (37). In the above report, as 
well as in the current study, samples having an unfavor- 
able piognosis were found to contain a relatively low . 
amount of sst2 mRNA, as compared with well- 
differentiated rumors. 

In the current sample series, eight samples were in- 
vestigated before surgery with radiolabeled octreotide. 
Despite the low figures, all the samples positive at the 
diagnostic procedure had a strong RT-PCR signal for 
sst2 mRNA. Three of eight patients were also responsi ve 
to octreotide treatment administered at the time of re- 
lapse or met as I a tic spread. More extensive correlative 
clmicopathologic smdies on the sst status are needed to 
better define the tissue distribution, of somatostatin bind- 
ing sites and their potential cjinica] role in the treatment 
of patients. . 

Sst2 evaluation. by means of ISH (14,32) or RT-PCR 
(26,37) is a highly sensitive and reliable procedure. Un- 
fortunately, ihese techniques have limitations because 
frozen tissue is needed for some of them; and radioactive — 
material or costly and time-consuming methods are nec- 



essary for others. InimrjmohistodieraicaJ analysis of sst2 
by means of specific antibodies represents an ideal , 
cheap, and rapid alternative, easily applicable to archival 
material. For these reasons, several investigators have 
raised polyclonal antibodies specific for sst (8,10,15,17, 
18). Li the current study, tested tuinor fragments adjacent 
to those snap frozen for RT-PCR analysis were tested 
with a polyclonal antibody against a C-terminal portion 
of the sst2A splice variant (35,36), In addition,. a mono- 
clonal antibody was produced in the Department of Pa- 
thology (University of Turin) against an N-tenninal se- 
quence of the human sst2. This antibody was the first 
monoclo n al devel oped against sst2 and was. .shown to be 



highly specific for sst2 in Western blot and immunobis- 
tocbemical analysis. Both the monoclonal and the poly- 
clonal antibodies specifically reacted with all. samples 
also positive by RT-PCR (with minor discrepancies in 
two samples, likely due to rumor heterogeneity). The 
observed correlation between RT-PCR and irnxnunohis- • 
tocbemistry indicates that the latter may be a reliable 
diagnostic tool and may allow immunom^tcwrbermcal in- 
vestigation for sst2 even in small biopsy samples. This in 
rum may enable a rapid screening of sst2-positjve tumors 
for medical treatment with somatostatin analogues. 

Having confirmed in a relatively large series that the 
vast majority of neuroendocrine tumors of the lung con- 
tain variable amounts of sst2 mRNA, a final comment is 
deserved for sst2 expression in nonneuxoendcKrrine lung 
carcinomas. No data have been reported thus far in the 
literature concerning normal human lung, although in the 
present study some bronchial cells of peritumoral paren- 
chyma were positive for sst2 when iiamunohistocbemi- 
cal analysis was performed with either antibody. The 
staining was specific because it was abolished using pre- 
absorbed antibodies. Therefore,, it is likely that normal 
human Jung tissue contains sst2. This might be con- 
firmed by alternative techniques (e.g.,* Western blot, RT- 
PCR). However, in situ morphologic procedures, such as 
those employed here, have definite advantages. In fact, 
the lung is rich in vessels, and in several tissues (either in - 
tumoral or in mflammatory-reactive conditions) the ves- 
sels were recently shown to contain sst (6). 

A low expression of sst2 was found in 25% of lung 
carcinomas of nonneurbendocrine type investigated in 
the present study by means of RT-PCR. Thej-eforc, sst 
type 2, at least, does not appear to be extensively -ex- 
pressed in nonneuroendocrine carcinomas of the lung. 
However, because two tumors in the control group (a 
squamous carcinoma and an adenocarcinoma, respec- 
tively) had positive octreotide scintigraphy y but no sst2 
mRNA, it is plausible that a heterogeneous distribution 
of sst occurs in nonneuroendocrine lung rumors. Other 
receptor types may be expressed in these tumors and may 
be responsible for the positive results in diagnostic test- 
ing. Because sst5. is also known to bind somatostatin 
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analogues, such as octreotide, with high affinity (24), the 
expression of this receptor type will be investigated in 
future studies. , □ 
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P-cadherin overexpression is an indicator of clinical outcome in invasive 
breast carcinomas and is associated with CDH3 promoter 
liypomethylation. 

JParedesJ, Albergaria A. Oltveii a JT . Jcronimo C . Milanezi F . Schmitt FC . 

Institute of Pathology and Molecular Immunology of Porto University (IP ATIMUP), 
Braga, Portiigal.jparedes@ipatimup.pt 

PURPOSE; P-cadherin overexpression has been reported in breast carcinomas, where it 
was associated with proliferative high-grade histological tumors. This study aimed to 
analyze P-cadherin expression in invasive breast cancer and to correlate it with tumor 
markers, pathologic features, and patient survival. Another purpose was to evaluate the P- 
cadherin promoter methylation pattern as the molecular mechanism underlying this gene 
regulation. EXPERIMENTAL DESIGN; Using a series of invasive breast carcinomas, P- 
cadhenn expression was evaluated and correlated with histologic grade, estrogen 
receptor, MIB-1, and p53 and c-erbB-2 expression. In order to assess whether P-cadherin 
expression was associated with changes in CDH3 promoter methylation, we studied the 
methylation status of a gene S'-flanking region in these same carcinomas. This analysis 
was also done for normal tissue and for a breast cancer cell line treated with a 
demethylating agent. RESULTS: P-cadherin expression showed a strong correlation with 
high histologic grade, increased proliferation, c-erbB-2 and p53 expression, lack of 
estrogen receptor, and poor patient survival. This overexpression can be regulated by 
gene promoter methylation because the 5-Aza-2'-deoxycytidine treatment of MCF-7/AZ 
cells increased P-cadherin mRNA and protein levels. Additionally, we found that 71% of 
P-cadherin-negative cases showed promoter methylation, whereas 65% of positive ones 
were unmethylated (P = (X005). The normal P-cadherin-negative breast epithelial cells 
showed consistent CDH3 promoter methylation; CONCLUSIONS: P-cadherin expression 
was strongly associated with tumor aggressiveness, being a good indicator of clinical 
outcome. Moreover, the aberrant expression of P-cadherin in breast cancer might be 
regulated by gene promoter hypomethylattom 
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Mammary-derived growth inhibitor protein and messenger ribonucleic 
acid concentrations in different physiological states of the gland. 

Politis h Gorcwtt RC MulJer 1\ Grossc R . 

Department of Animal Science, Cornell University, Ithaca 14853. 

Expression of mammary-derived growth inhibitor in tissue from lactating and involuting 
bovine mammary glands was investigated. Seventeen lactating, pregnant (220 to 272 d in 
gestation) cows were divided in two groups of 8 and 9 cows each. Cows of the first group 
were slaughtered while in lactation. Cows of the second group (9 involuting cows) were 
slaughtered at 13 to 52 d following sudden cessation of milking. High concentrations of 
mammary-derived growth inhibitor (.63% of the total protein) were detected in mammary 
tissue of lactating cows. Mammary-derived growth inhibitor (less than .10% of the total 
protein) was dramatically reduced during most of the involution period (13 to 45 d 
following cessation of milking). Mammary-derived growth inhibitor was again detected 
(.28% of the total protein) during the last stage of the involution (46 to 53 d after 
cessation of milking), which coincided with colostrum formation. When steady state 
concentrations of mammary-derived growth inhibitor mRNA were examined, the results 
obtained mirrored those obtained at the protein concentration. These data suggest that 
regulation of mammary-derived growth inhibitor occurs via modulation of the steady 
state concentration of its mRNA. Furthermore, there is a strong correlation between 
mammary-derived growth inhibitor expression and lactation in dairy cows. 
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T-cell receptor V beta-family usage in primary cutaneous and primary 
nodal T-cell non-Hodgkin's lymphomas. 

Prcesmati AH , Hu HZ, Tilanus MG, deGeusB Schunrman HJ . Reitsma R. van 
Wictieu DK van Vloten WA . de Weger RA ~ ~ ~ — 

Department of Pathology, University Hospital Utrecht, The Netherlands. 

To evaluate whether the expression of T-cell receptor (TCR) V beta families in eight 
cases of mahgnant T-cell lymphomas took place in a preferential manner, we analyzed 
four cases of mycosis fiingoides (MF), the most common form of primary cutaneous T- 
cell non-Hodgkin's lymphomas (NHL), and four cases of primary nodal T-cell NHL. The 
usage of V beta families in T-cell populations was investigated on mRNA that was 
transcribed to cDNA using a C beta primer and reverse transcriptase. Subsequently, the 
specific usage of the families was analyzed by polymerase chain reaction (PCR) using 
combinations of the selected C beta-oligonucleotide primer and one of the family-specific 
V beta primers. Peripheral blood lymphocytes from four healthy volunteers and 1 
"reactive" lymph node served as a control and expressed all 20 V beta families tested for. 
In T-cell lines, with restricted V beta expression, and in three patients with advanced MF, . 
only one or two V beta families were expressed at the mRNA level. In an early MF lesion 
this monoclonal expression was absent: several V beta families were expressed with a 
weak intensity. This may indicate either a polyclonal origin of MF, or that too few 
monoclonal neoplastic cells were present in the tissue specimen. In the four nodal T-cell 
NHL, only one family could be clearly distinguished, whereas some of the other V beta 
families showed only a weak expression. These latter families represent the reactive T- 
cell component in the nodal T-cell NHL. Both in nodal T-cell NHL and in MF there was 
no preferential expression of a particular V beta family. There was a good correlation 
between PCR data and the expression of V beta-family protein products observed by 
lmmunohistochemisuy on tissue sections of the T-cell lymphomas. All T-cell lines three 
cases of MF, and three cases of nodal T-cell NHL showed a rearrangement of the TCR 
beta chain on DN A level. 
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Matrilin-3 in human articular cartilage: increased expression in 
osteoarthritis. 

P"l»g O, WeselohG, Kiatt.AR . Wagener R . Swoboda B . 

Division of Orthopaedic Rheumatology, Department of Orthopaedics, University of 
Erlangen-Nuremberg, Rathsberger Str. 57, D-91054 Erlangen, Germany. 
oliver.pulltg@med.uni-erlangen.de 

OBJECTIVE: Matrilin-3 is a member of the recently described matrilin family of 
extracellular matrix proteins containing von Willebrand factor A-like domains. The 
matrilin-3 subunit can form homo-tetramers as well as hetero-oligomers together with 
subunits of matrilin- 1 (cartilage matrix protein). It has a restricted tissue distribution and 
is i strongly expressed in growing skeletal tissues. Detailed information on expression and 
distribution of extracellular matrix proteins is important to understand cartilage function 
. in health and in disease like osteoarthritis (OA). METHODS: Normal and osteoarthritic 
cartilage were systematically analysed for matrilin-3 expression, using 
inununohistpchemistry, Western blot analysis, in situ hybridization, and quantitative 
PCR. RESULTS: Our results indicate that matrilin-3 is a mandatory component of 
mature articular cartilage with its expression being restricted to chondrocytes from the 
tangential zone and the upper middle cartilage zone. Osteoarthritic cartilage samples with 
only moderate morphological osteoarthritic degenerations have elevated levels of 
matrilin-3 mRNA. In parallel, we found an increased deposition of matrilin-3 protein in 
the cartilage matrix. Matriiin-3 staining was diffusely distributed in the cartilage matrix, 
with no cellular staining being detectable. In cartilage samples with minor osteoartliritic 
lesions, matrilin-3 deposition was restricted to the middle zone and to the upper deep 
zone. A strong correlation was found between enhanced matrilin-3 gene and protein 
expression and the extent of tissue damage. Sections with severe osteoartliritic 
degeneration showed the highest amount of matrilin-3 mRNA, strong signals in in situ 
hybridization, and prominent protein deposition in the middle and deep cartilage zone. 
CONCLUSION: We conclude that matrilin-3 is an integral component of human articular 
cartilage matrix and that the enhanced expression of matrilin-3 in OA may be a cellular 
response to the modified microenvironment in the disease. Copyright 2002 OsteoArthritis 
Research Society International. 
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Up-regulation of mitochondrial peripheral benzodiazepine receptor 
expression by tumor necrosis factor alpha in testicular leydig cells. 
Possible involvement in cell survival. 

Rcy C , M audmt C , Naureils Q , Benahtncd M , Louisot P , Gasnier F . 

INSERM U. 189, Faculte de Medecine Lyon-Sud, BPI2, 69921 cedex, Oullins, France. 

Porcine Leydig cells in primary cultures are resistant to tumor necrosis factor alpha 
(TNFalpha) cytotoxicity. Here we report that these cells can be rendered sensitive to 
TNFalpha killing by treatment with the translational inhibitor cycloheximide, suggesting 
the existence of proteins that can suppress the death stimulus induced by the cytokine. In 
search of these cytoprotective proteins, we focused on the constituents of the 
mitochondrial permeability transition pore (PT pore), whose opening has been shown to 
play a critical role in the TNFalpha-mediated death pathway. We found that TNFalpha 
up-regulated mRNA and protein expression of the mitochondrial peripheral 
benzodiazepine receptor (PBR), an outer membrane-derived constituent of the pore. A 
strong correlation was established between the resistance of the cells to TNFalpha killing 
and the density of PBR-binding sites. Concomitantly, TNFalpha down-regulated Bcl-2 
mRNA and protein expression. As Bet -2 has been shown to be an endogenous inhibitor 
of the PT pore, we hypothesize that the TNFalpha-induced up-regulation of PBR 
expression may compensate for the decrease in Bcl-2 levels to prevent the opening of the 
PT pore. 
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GLUT1 messenger RNA and protein induction relates to the malignant 
transformation of cervical cancer. 

Radlowski C Becker AJ; Schroder W . RatUW, Buttner R , IVtoser M. 

Dept of Gynecology and Obstetrics, University Hospital Heidelberg, Vossstr 7-9, D- 
691 15 Heidelberg, Germany. 

We studied whether induction of glucose transporters (GLUTs) I to 4 correlates with 
human papillomavirus (HPV)-dependent malignant transformation of cervical epithelium. 
Tissue samples of cervical intraepithelial neoplasia (CIN; grades I to 3), invasive 
carcinomas, and lymph node metastasis were examined. HPV typing was performed. 
Tissue sections were immunostained with GLUT1 to GLUT4 antibodies. Messenger 
RNA (mRNA) in situ hybridization confirmed GLUT1 protein expression. Weak 
expression of GLUT1 was found in nondysplastic HPV-positiveand HPV-negative 
epithelium; significant expression was observed in preneoplastic lesions, correlating with 
the degree of dysplasia. In CIN 3 high-risk HPV lesions, cervical cancer, and metastasis, 
GLUT ! was expressed at highest levels with a strong correlation of GLUT 1 mRNA and' 
protein expression. Immunostains for GLUT2 to GLUT4 were negative. Cervical tumor 
cells respond to enhanced glucose utilization by up-regulation of GLUT1 . The strong 
induction of GLUT1 mRNA and protein in HPV-positive CIN 3 lesions suggests GLUT1 
overexpression as an early event in cervical neoplasia. GLUT1 is potentially relevant as a 
diagnostic tool and glucose metabolism as a therapeutic target in cervical cancer. 
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Expression and distribution of laininin alphal and alpha2 chains in 
embryonic and adult mouse tissues: an immunochemical approach. 

Max-PIanck-Institute for Biochemistry, Martinsried, D-82 1 52, Germany. 

Protein levels, mRNA expression, and localization of laminin alphal and alpha2 chains in 
development and in adult mice were examined. Recombinant fragments were used to 
obtain high-titer-specific polyclonal antibodies for establishing quantitative 
radioimmuno-inhibition assays. This often demonstrated an abundance of alpha2 chain, 
but also distinct amounts of alpha I chain for adult tissues. The highest amounts of alphal 
were found in placenta, kidney, testis, and liver and exceeded those of alpha2. All other 
tissue extracts showed a higher content of alpha2, which was particularly high in heart 
and muscle when compared to alphal. Content of gammal chain, shared by most 
laminins, was also analyzed. This demonstrated gammal chain levels being equal to or 
moderately exceeding the sum of alphal and alpha2 chains, indicating that these isoforms 
represent the major known laminin isoforms in most adult mouse tissues so far examined; 
Moreover, we found good correlation between radioimmuno-inhibition data and mRNA 
levels of adult tissues as measured by quantitative real-time reverse transcriptase-PCR. 
Embryonic tissues were also analyzed by radioimmuno-inhibition assays. This 
demonstrated for day 1 1 embryos comparable amounts of alphal and gammal and a 
more than 25-fold lower content of alpha2. This content increased to about 10% of 
alphal in day 13 embryos. The day 18 embryo showed in heart, kidney, and liver, but not 
yet in brain and lung, alphal/alpha2 chain ratios comparable to those in adult tissues. 
Immunostaining demonstrated alphal in Reichert's membrane (day 7.5), while alpha2 
could not be detected before day 11.5. These data were compared with 
irnmunohistochemical localization results on several more embryonic and adult tissue 
sections. Our results regarding localization are consistent with those of earlier work with 
some notable exceptions. This was in part due to epitope masking for monoclonal 
antibodies commonly used in previous studies in esophagus, intestine, stomach, liver, 
kidney, and spleen. 
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Discordant regulation of granzyme H and granzyme B expression in 
human lymphocytes. 

Sedeiies KA, Sayers TJ , Edwards KM . Chen W, PcUicci DG . Godfrey PI , Trapaai 
JA . 

Cancer Immunology Laboratory, Peter MacCallum Cancer Centre, Locked Bag 1 , 
A'Beckett Street, East Melbourne, 8006, Australia. 

We analyzed the expression of granzyme H in human blood leukocytes, using a novel 
monoclonal antibody raised against recombinant granzyme H. 33-kDa granzyme H was 
easily detected in un fractionated peripheral blood mononuclear cells, due to its high 
constitutive expression in CD3(-)CD56(+) natural killer (NK) cells, whereas granzyme B 
was less abundant The NK lymphoma cell lines, YT and Lopez, also expressed high 
granzyme H levels. Unstimulated CD4(+) and particularly CD8(+) T cells expressed far 
lower levels of granzyme H than NK cells, and various agents that classically induce T 
cell activation, proliferation, and enhanced granzyme B expression failed to induce 
granzyme H expression in T cells. Also, granzyme H was not detected inNK T cells, 
monocytes, or neutrophils. There was a good correlation between mRNA and protein 
expression in cells that synthesize both granzymes B and H, suggesting that gzmH gene 
transcription is regulated similarly to gzmB. Overall, our data indicate that although the 
gzmB and gzmH genes are tightly linked, expression of the proteins is quite discordant in 
T and NK cells. The finding that granzyme H is frequently more abundant than granzyme 
B in NK cells is consistent with a role for granzyme H in complementing the pro- 
apoptotic function of granzyme B in human NK cells. 
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BCL2 protein expression parallels its mRNA level in normal and 
malignant B cells. 

Sheu Y . Iflba[J, Huang JZ , Zhou G , Chan WC . 

Department of Pathology and Microbiology, University of Nebraska Medical Center, 
Omaha, USA. 

The regulation of B-cell. lymphoma 2 (BCL2) protein expression in germinal center (GC) 
B ceils has been controversial. Previous reports have indicated posttranscriptional 
regulation plays a dominant role. However, a number of recent studies contradicted these 
reports. Using real-time polymerase chain reaction (PCR) and Standardized Reverse 
Transcriptase-PCR (StaRT-PCR), we measured the level of mRNA expression in GC 5 
mantle zone (MNZ), and marginal zone (MGZ) cells from laser capture microdissection. 
Both quantitative RT-PCR measurements of microdissected GC cells from tonsils 
showed that GC cells had low expression of BCL2 transcripts commensurate with the 
low protein expression level. These results are in agreement with microarray studies on 
fluorescence-activated cell sorter (FACS)-sorted cells and microdissected GC cells. We 
also examined BCL2 mRNA and protein expression on a series of 30 cases of diffuse 
large B-cell lymphoma (DLBCL) and found, in general, a good correlation. The results 
suggested that BCL2 protein expression is regulated at the transcriptional level in normal 
B cells and in the neoplastic cells in most B-cell lyrnphoproliferative disorders. 
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Quantitative determinations of the steady state transcript levels of 
hexokinase isozymes and glucose transporter isofomis in normal rat 
tissues and the malignant tumor cell line ABC130. 

Shin ohara Y , Yamamoto K , Inoo K. Yamazaki N . Terada H . 

Faculty of Pharmaceutical Sciences, University of Tokushima, Japan. 
yasuo@ph.tokushima-u.ac.jp 

The steady state transcript levels of the four hexokinase (HK) isozymes and four glucose 
transporter (GLUT) isoforms were determined quantitatively by Northern analysis of 
RNA samples from rat tissues using synthetic fragments of the RNAs encoding the HK 
isozymes and GLUT isoforms. Results showed that the levels of HK isozyme transcripts 
were low in rat tissues, the level of that most highly expressed, the type I isozyme (HKI) 
ui the brain being 0.025% of the total poly(A)+ RNA. A good correlation was found 
between the reported HK activities and the total amounts of transcripts encoding all HK . 
isozymes in various tissues, showing that the HK activities in tissues can be estimated 
from the total amount of transcripts encoding HK isozymes. The proposed associated 
expressions of HK isozymes and GLUT isoforms in particular tissues were confirmed at 
their transcript levels. The steady state transcript levels of type II HK and the type 1 
GLUT isoform in the malignant tumor cell line AHl 30 were also determined 
quantitatively. 
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UVA irradiation-induced activation of activator protein-1 is correlated 
with iuduced expression of AP-1 family members in the human 
kcratinocytc cell line HaCaT. 

Silvers AL , Bowden GT . 

Department of Radiation Oncology, Arizona Cancer Center, The University of Arizona 
Tucson 85724, USA. 

To determine whether the transcription factor activator protein- 1 (AP-1) could be 
modulated by ultraviolet A (UVA) exposure, we examined AP-1 DNA-binding activity 
and transactivation after exposure to UVA in the human immortalized keratinocyte cell 
line HaCaT. Maximal AP-1 transactivation was observed with 250 kJ/m2 UVA between 
3 and 4 h after irradiation. DNA binding of AP-1 to the target 12-0- 
tetradecanoylphorbol- 13 -acetate response element sequence was maximally induced 1-3 
h after irradiation. Both de novo transcription and translation contributed to the UVA- 
induced AP-1 DNA binding. c-Fos was implicated as a primary component of the AP-1 
DNA-binding complex. Other components of the complex included Fra-2, c-Jun, JunB 
and JunD. UVA irradiation induced protein expression of c-Fos, c-Jun, Fra-1 and Fra-2. 
Phosphorylated forms of these induced proteins were determined at specific time points. , 
A strong correlation existed between UVA-^induced AP-1 activity and accumulation of c- 
Fos, c-Jun and Fra-1 proteins. UVA irradiation also induced c-fos and c-jun mRNA 
expression and transcriptional activation of the c-fos gene promoter. These results 
demonstrate that UVA irradiation activates AP-l and that c-fos induction may play a 
critical role in the response of these human keratinocytes to UVA irradiation. 
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Rat kidney glutamyl aminopeptidase (aminopeptidase A): molecular 
identity and cellular localization. 

SongL, Ye M, Troyanovskava Wtik E. WOk S. Healy DP . 

Department of Pharmacology, Mount Sinai School of Medicine, City University of New 
York, New York 10029. 

Glutamyl aminopeptidase [aminopeptidase A (EAP), EC 3.4.11,7] is an ectoenzyme that 
selectively hydrolyzes acidic amino acid residues from the amino terminus of 
oligopeptides. EAP activity is highest within the kidney and small intestine. The murine 
pre-B cell BP-1/6C3 and the human kidney glycoprotein gpl60 differentiation antigens 
have been reported to have biochemical properties indistinguishable from EAP. It is not 
known, however, if rat kidney EAP is a homologue of these antigens or molecularly 
distinct. Using the reverse transcription-polymerase chain reaction method with 
oligonucleotide primers based on the BP-1/6C3 nucleotide sequence, we isolated a 450- 
bp partial cDNA from rat kidney poly(A)+ RNA. The partial cDNA encoded a predicted 
protein that was 92% and 86% identical to the murine BP-1/6C3 and human gpl60 
antigens, respectively; the amino acid sequence within the zinc-binding domain was 
completely conserved. Purification of EAP from rat kidney and microsequence analysis 
of a tryptic digest peptide fragment (1 8-mer) indicated that the fragment was highly 
similar to a region within the BP- 1/6C3 and gp 1 60 proteins. Northern blot hybridization 
and immunoblot analyses were also consistent with labeling of products the same size as 
reported for the BP-1/6C3 and gp 160 antigens. There was a good coaelation between the 
cellular distribution of E AP mJRNA and EAP immunoreactivity, with proximal tubules 
and glomerular mesangtal cells having the highest densities. These results indicate that 
rat kidney EAP is a species homologue of the murine BP- 1 /6C3 and human gp 1 60 
antigens. Furthermore, on the basis of its cellular localization, rat kidney EAP is likely to 
be involved in degradation of oligopeptides within the glomerulus and the glomerular 
filtrate. Since cells that express EAP also express receptors for angiotensin II, an 
intrarenal vasoactive hormone that is a substrate for EAP, these results further suggest 
that EAP may play a role in modulating the activity of intrarenal angiotensin IL ) 
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Tumor necrosis factor-alpha upregulates the prostaglandin E2 EP1 
receptor subtype and the cycIooxygcnasc-2 isoform in cultured amnion 
WISH cells. 

Spaziani EP , Benoit RR , Tsibris JC , Gould SF , O'Brien WF . 

University of South Florida Health Science Center, Department of Obstetrics & 
Gynecology, Tampa 33612, USA. espazian@comLmed.usf.edu 

Recent studies have demonstrated a strong correlation between infection and preterm 
labor. Preterm delivery is also associated with high levels of cytokines and prostaglandins 
in amniotic fluid. The purpose of this study was to investigate the effect of tumor necrosis 
factor-alpha (TNF-alpha) on the levels of cyclooxygenase, prostaglandin E2 production 
(PGE2), and expression of the PGE2 receptor subtype EP 1 in amnion WISH cell culture. 
Amnion WISH cell cultures were incubated in increasing concentrations of TNF-alpha 
(0-50 ng/ml). Changes in cyclooxygenase and EPl receptor proteins were evaluated by 
Western blot analysis. Changes in EPl mRNA were evaluated by Northern blot, and 
culture fluid concentrations of PGE2 were estimated by enzyme immunoassay (EtA). 
EPl protein (p<0.01), EPl mRNA (p<0.05), cyclooxygenase-2 (COX-2) protein 
(p<0.001), and PGE2 concentrations (p<0.0l) all increased with increasing 
concentrations of TNF-alpha. Changes in COX-1 protein were not observed following 
TNF-alpha-incubation. The results suggest that TNF-alpha may play a role in infection- 
induced preterm Labor by its pleiotropic ability to simultaneously stimulate COX-2 
activity, PGE2 concentrations, and PGE2 EP 1 receptor levels in human amnion. 
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Transcriptional activity of potent glucocorticoids: relevance of 
glucocorticoid receptor isoforms and drug metabolites. 

Spika t, Hammer S , Kleuser B , Korting HC Schafer-Kortinff M 

Institut for Pharmazie, Abteilung fur Pharmakoiogie und Toxikologie, Freie Universitat 
Berlin, Berlin, Germany. 

As compared to standard glucocorticoids (GC), prednicarbate (PC) is favorable in the 
treatment of eczema due to its high benefit/risk ratio. The remarkable anti-inflammatory 
effects of PC are in strong contrast to its reported low glucocorticoid receptor (GR) 
binding affinity, [n transfected COS-7 cells we related the transcriptional potencies of 
PC, its metabolites and conventional GC to their receptor binding properties. Moreover, 
the expression pattern of the human GR isoform hGRalpha and its mutual dominant 
negative inhibitor hGRbeta in skin cells have been investigated as well as the influence of 
hGRbeta on receptor binding and transactivation. hGRalpha mRNA and protein was 
largely overexpressed in skin cells. hGRbeta showed no influence on hGRalpha binding 
and transactivation. Concentration response curves indicated the greater transactivation 
potency of betamethasone 17-valerate followed by dexamethasone and prednisolone 17- 
ethylcarbonate. Native PC appeared almost as potent as dexamethasone. With both a 
strong correlation was observed between transactivation and GR binding Copyright 2003 
S. Karger AG, Basel 
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Specific inhibition of AQP1 water channels in isolated rat intrahepatic bile 
duct units by small interfering RNAs. 

Splinter PL , Masyuk AI , LaRusso NF . 

Center for Basic Research in Digestive Diseases, Division of Gastroenterology and 
Hepatology, Mayo Medical School, Clinic, and Foundation, Rochester Minnesota 55905 
USA. 

Cholangiocytes express water channels (i.e. aquaporins (AQPs)), proteins that are 
increasingly recognized as important in water transport by biliary epithelia. However, 
direct functional studies demonstrating AQP-mediated water transport in cholangiocytes • 
are limited, in part because of the lack of specific AQP inhibitors. To address this issue, 
we designed, synthesized, and utilized small interfering RNAs (siRNAs) selective for 
AQP1 and investigated their effectiveness in altering AQP I -mediated water transport in 
intrahepatic bile duct units (IBDUs) isolated from rat liver, twenty-four hours after 
trans fection of IBDUs with siRNAs targeting two different regions of the AQP 1 
transcript, both AQP I mRNA and protein expression were inhibited by 76.6-92.0 and 
57.9-79.4% respectively. siRNAs containing the same percent of base pairs as the 
AQP 1 -siRNAs but in random sequence (i.e. scrambled siRNAs) had no effect. 
Suppression of AQP I expression in cholangiocytes resulted in a decrease in water 
transport by IBDUs in response to both an inward osmotic gradient (200 mosm) or a 
secretory agonist (forskolin), the osmotic water permeability coefficient (P(f)) decreasing 
up to 58.8% and net water secretion (J(v)) decreasing up to 87%. A strong correlation 
between AQP I protein expression and water transport in IBDUs transfected with AQP1- 
siRNAs was consistent with the decrease in water transport by IBDUs resulting from 
AQP1 gene silencing by AQP 1 -siRNAs. This study is the first to demonstrated 
feasibility of utilizing siRNAs to specifically reduce the expression of AQPs in epithelial 
cells and provides direct evidence of the contribution of AQP 1 to water transport by 
biliary epithelia. 
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Type IV collagcnasc (M(r) 72,000) expression in human prostate: benign and 
malignant tissue, 

Stearns ME , Wang M 

Department of Pathology, Medical College of Pennsylvania, Philadelphia 19129. 

The expression of type IV collagenase (M(r) 72,000) has been examined in tissues from patients 
with benign prostatic hyperplasia (6 patients) and varying Gleason grades of malignant prostate 
cancer (1 8 patients). Immunoperoxidase labeling indicated that expression of the type IV 
collagenase was weak or nonexistent in benign tissue but consistently strong in the glandular and 
ductal epithelial cells of prostate tumors diagnosed at Gleason grades 1-8. In moderate to 
advanced cancer (i.e., Gleason grades 2 to 8), invasive tumor foci in the stromal tissue produced 
relatively modest amounts of type IV collagenase. The normal stromal tissue (i.e,, fibroblasts) 
uniformly failed to produce detectable levels of type IV collagenase in the 24 patients examined. 
Northern and quantitative slot blot hybridization assays demonstrated that collagenase type IV 
mRNA levels were low in benign tissue and high in malignant tumors. In contrast, the stromal 
cells did not express significant amounts of type IV collagenase mRNA. Eazyme-Iinked 
immunosorbent assays demonstrated that the amounts of type IV collagenase protein correlated 
directly with the mRNA levels in the tumor tissue. The studies suggest that type IV collagenase 
may be selectively overexpressed by malignant, preinvasive prostatic epithelial cells! 
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The decompensated detrusor III: impact of bladder outlet obstruction on 
sarcoplasmic endoplasmic reticulum protein and gene expression. 

Stein R , Gong C. Hutcheson JC Canning DA , Zderic SA . 

Division of Urology, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 
USA. 

PURPOSE: Regulation of calcium ion homeostasis has a significant role in smooth 
muscle contractility. The sarcoplasmic endoplasmic reticulum, calcium, magnesium, 
adenosine triphosphatase (SERCA) is a regulatory ion pump that may have a role in the 
junctional outcome after outlet obstruction. We investigate what correlation if any existed 
between SERCA protein and gene expression, and the contractile properties in the same 
bladder. MATERIALS AND METHODS: Standardized partial bladder outlet 
obstructions were created in adult New Zealand white rabbits, which were divided into, 
control, sham operated and obstructed groups. Muscle strip studies subcategorized the 
obstructed group into compensated (force greater than 50% of control) and 
decompensated (force less than 50% of control). Microsomal membrane and total RNA 
fractions were prepared from the same bladder tissue. Membrane proteins were used for 
Western blot analysis using a SERCA specific monoclonal antibody, and total RNA was 
assessed with Northern blot analysis. RESULTS: The relative intensities of signals for 
the Western and Northern blots demonstrated a strong correlation between protein and 
gene expression. Furthermore there was a strong association between the loss of SERCA 
messenger RNA and protein expression and loss of bladder function. CONCLUSIONS: 
Bladder contractility after outlet obstruction is influenced in part by smooth muscle cell 
ability to maintain calcium homeostasis via SERCA. The loss of SERCA protein 
expression is mediated by down-regulation in gene expression in the same bladder. These 
data suggest that smooth muscle ion pump gene expression is in part mechanically 
(pressure work) regulated. 
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TNF-alpha and IL-8 are upregulated in the epidermis of normal human 
skin after UVB exposure: correlation with neutrophil accumulation and E- 
selectin expression. 

Strickland I . Rhodes LE . Flanagan BF, Fried man n PS . 

Department of Dermatology, University of Liverpool, United Kingdom. 

The in vivo response to ultraviolet B (UVB) radiation in skin is characterized by the 
accumulation of both mononuclear and polymorphonuclear cells within the dermis and an 
induction of vascular endothelial adhesion molecules. Epidermal production of cytokines 
(IL-8 and TNF-alpha) has been strongly implicated in the development of UVB-induced 
inflammation. In the current study, we examined the time course of IL-8 and TNF-alpha 
mRNA and protein expression in die epidermis over a 24-h period after in vivo UVB 
irradiation. Also, the induction of adhesion molecule expression and the accumulation of 
neutrophils widun the dermis were followed. We found constitutive expression of both 
cytokines (mRNA and protein) in the epidermis of unirradiated skin. IL-8 was rapidly 
upregulated after irradiation and mRNA and protein increased at 4 h, reaching a 
maximum between 8 and 24 h. TNF-alpha mRNA and protein was minimally increased 
by 8 h after UVB irradiation and reached a maximum by 24 h. No significant alteration in 
ICAM-l or VCAM-l expression was observed. E-selec,tin expression, which was absent 
from control samples, was increased from 4 h onward and also reached a maximum at 24 
h, coinciding with peak neutrophil accumulation. A strong correlation (r = 0,96) was 
found between number of E-selectin-positive vessels and numbers of infiltrating 
neutrophils at this time. Moreover, because E-selectin expression was increased before 
any apparent increase in TNF-alpha protein (4 h), TNF-alpha does not appear to be 
involved in the early induction of the adhesion molecule, but cytokines such as TNF- 
alpha and IL-8 may act subsequently to augment the inflammatory response. 
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Basic fibroblast growth factor expression is increased in human renal 
fibrogenesis and may mediate autocrine fibroblast proliferation. 

Strutz F, Zeisberg M , Hemmerlem B , Sattler B, Hummel K Becker V . MuHer GA . 

Department of Nephrology and Rheumatology, Georg-August-University Gottingen, 
Germany, fstrutz@gwdg.de 

BACKGROUND: Interstitial fibroblasts play a critical role in renal fibrogenesis, and 
autocrine proliferation of these cells may account for continuous matrix synthesis. Basic 
fibroblast growth factor (FGF-2) is mitogenic for most cells and exerts intracrine, 
autocrine, and paracrine effects on epithelial and mesenchymal cells. The aims of the 
present studies were to localize and quantitate the expression of FGF-2 in normal and 
pathologic human kidneys and to study the in vitro effects of FGF-2 on proliferation, 
. differentiation, and matrix production of isolated cortical kidney fibroblasts. METHODS: 
FGF-2 protein expression was localized by immunofluoresence double labelings in 
normal and fibrotic human kidneys. Subsequently, interstitial FGF-2 labeling was 
determined semi quantitatively in 8 normal kidneys and 39 kidneys with variable degrees 
of interstitial fibrosis and was correlated with the morphomerrically determined 
interstitial cortical volume. In addition, FGF-2 expression was quantitated by immunoblot 
analysis in three normal and six fibrotic kidneys. FGF-2 mRNA was localized by in situ 
, hybridizations. Seven primary cortical fibroblast lines were established, and expression of 
FGF-2 and FGF receptor- 1 (FGFR-1) were examined. The effects of FGF-2 on cell 
proliferation were determined by bromodeoxyuridine incorporation and cell counts, those 
on differentiation into myofibroblasts by staining for alpha-smooth muscle actin, and 
those on matrix synthesis by enzyme-linked immunosorbent assay for collagen type I and 
fibronectin. Finally, proliferative activity in vivo was evaluated by expression of MIB-1 
(Ki-67 antigen). RESULTS: In normal kidneys, FGF-2 expression was confined to 
glomerular, vascular, and a few tubular as well as interstitial fibrobJast-Hke cells. The 
expression of FGF-2 protein was increased in human kidneys, with tubulointerstitial 
scarring correlating with the degree of interstitial fibrosis (r = 0.84, P < 0.01 ). 
Immunoblot analyses confirmed a significant increase in FGF-2 protein expression in 
kidneys with interstitial scarring. In situ hybridization studies demonstrated low-level 
detection of FGF-2 mRNA in normal kidneys. However, FGF-2 mRNA expression was 
robustly up-regulated in interstitial and tubular cells in end-stage kidneys, indicating that 
these cells are the source of excess FGF-2 protein. Primary cortical fibroblasts express 
FGF-2 and FGFR-1 in vitro. FGF-2 induced a robust growth response in these cells that 
could be blocked specifically by a neutralizing FGF-2 antibody. Interestingly, the 
addition of the neutralizing antibody alone did reduce basal proliferation up to 31.5%. In 
addition, FGF-2 induced expression of alpha-smooth muscle actin up to 1.6-fold, but no 
significant effect was observed on the synthesis of collagen type 1 and fibronectin. 
Finally, staining for MIB-1 revealed a good correlation of interstitial FGF-2 positivity 



with interstitial and tubular proliferative activity (r = 0.71, P < 0.01 for interstitial 
proliferation, N = 30). CONCLUSIONS; Interstitial FGF-2 protein and mRNA 
expression correlate with interstitial scarring. FGF-2 is a strong mitogen for cortical 
kidney fibroblasts and may promote autocrine fibroblast growth. Expression of FGF-2 
correlates with interstitial and tubular proliferation in vivo. 
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Adiposity elevates plasma MCP-1 levels leading to the increased CD1 lb- 
positive monocytes in mice. 

Takahashi K. Mizuarai S . Arald H . Mashiko S , Ishihara A , Kanatani A , Itadani II , 
Kotani H i 

Banyu Tsukuba Research Institute in collaboration with Merck Research Laboratories, 
Tsukuba, Ibaraki 300-26 1 1 , Japan. 

Obesity is currently considered as an epidemic in the western world, and it represents a 
major risk factor for life-threatening diseases such as heart attack, stroke, diabetes, and 
cancer. Taking advantage of DNA microarray technology, we tried to identify the 
molecules explaining the relationship between obesity and vascular disorders, comparing 
.mRNA expression of about 12,000 genes in white adipose tissue between normal, high 
fat diet-induced obesity (DIO) and d-Trp34 neuropeptide Y-induced obesity in mice. 
Expression of monocyte chemoattractant protein- 1 (MCP-1) mRNA displayed a 7.2-fold 
increase in obese mice as compared with normal mice, leading to substantially elevated 
MCP-l protein levels in adipocytes. MCP-l levels in plasma were also increased in DIO 
mice, and a strong correlation between plasma MCP-1 levels and body weight was 
identified. We also showed that elevated MCP-1 protein levels in plasma increased the 
CD1 lb-positive monocyte/macrophage population in DIO mice. Furthermore, infusion of 
MCP-1 into lean mice increased the CD 1 lb-positive monocyte population without 
inducing changes in body weight. Given the importance of MCP-1 in activation of 
monocytes and subsequent atherosclerotic development, these results suggest a novel role 
of adiposity in the development of vascular disorders. 



PMID: 13129912 [PubMed - indexed for MEDLINE] 





168: Circulation. 2002 Jan 29; l05(4):490-6. 



Related Articles, 



Links 




Augmented expression of neuronal nitric oxide synthase in the atria 
parasympathetically decreases heart rate during acute myocardial 
infarction in rats. 

Takimoto Y , Aoyama T . Tanaka IC Kcyamura R , Yui Y , Sasayama S . 

Department of Cardiovascular Medicine, Graduate School of Medicine, Kyoto 
University, Kyoto, Japan. 

BACKGROUND: Nitric oxide (NO) synthesized within sinoatrial cells recently has been 
shown to participate in the autonomic control of heart rate. We hypothesized that NO in 
the neuronal cells in the heart was increased and parasympathetically regulated heart rate 
after myocardial infarction (MI). METHODS AND RESULTS: We examined heart rate 
dynamics and neuronal NO synthase (nNOS) expression and activities in the atria of rats 
.with MI 1, 3, 7, and 14 days after MI (n=7 to 22 for each group). Both the mRNA levels 
of nNOS in the atria determined by competitive reverse transcriptase-polymerase chain 
reaction and the protein levels determined by Western blotting were significantly 
increased compared with controls 1, 3, and 7 days after MI. nNOS activity in the atria 1 
day after infarction was also increased in MI rats. nNOS immunoreacu'vity was observed 
in nerve fibers in the atria. After infusion of a specific inhibitor of nNOS and iNOS, l-(2- 
trifluoromethylphenyl) imidazole (TRIM) (50 mg/kg TV), heart rate was significantly 
(P<0.01) increased in MI rats compared with controls I, 3, and 7 days after MI. The 
iNOS-specific inhibitor, 1400W (10 mg/kg SC), did not significantly affect the heart rate 
in rats with MI. The effect of TRIM was abolished by pretreatment.with L-arginine (25 
mgfag IV) or by parasympathetic blockade with atropine but not by propranolol There 
was a strong correlation (r=0.837, PO.0001) between the nNOS protein expression and 
heart rate change after TRIM infusion. CONCLUSIONS: These results indicate that . 
increased nNOS parasympathetically decreased heart rate via the production of NO in 
rats with acute ML 
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Differential upregulation of cellular adhesion molecules at the sites of 
oxidative stress in experimental acute pancreatitis. 

Teick G , Ducroc R . Scoazec JY . Pasquicr C . Feldmann G . Roze C . 

INSERM U 410, Universite Paris 7 Denis Diderot, 75870 Paris, France. 

BACKGROUND: Severe acute pancreatitis (AP)(2) is associated with exaggerated 
leukocyte adherence and activation. Endothelial cellular adhesion molecules (CAMs) can 
be induced by cytokines, but also directly by oxygen free radicals (OFRs), mediated by 
nuclear factor kappa-B (NF-kappa B). We investigated the behavior of inducible CAMs 
in relation to pancreatic oxidative stress. Our novel modification of cerium capture 
histochemistry (reaction of OFRs with cerium produces laser reflective Ce perhydroxide 
precipitates) combined with reflectance confocal laser scanning microscopy (CLSM) 
allows the histological codemonstration of in vivo OFR production and immunolabeled 
CAMs, or NF-kappa B. METHODS: Taurocholate AP was induced in rats; sham 
operated and normal animals served as controls. To achieve in situ, in vivo reaction of 
cerium with OFRs, animals were perfused with CeCl(3) solution at different time points 
(1, 2, 8, 24 h) and then sacrificed; E-selectin, P-selectin, ICAM-1, VCAM, and NF-kappa 
B p65 were labeled by immunofluorescence (IF) on frozen sections of cerium perfused 
pancreata. IF and Ce perhydroxide reflectance were simultaneously detected by CLSM. 
Pancreatic gene expression of the same CAMs was quantified by competitive RT-PCR 
(MfMIC internal control). RESULTS: Control pancreata showed negligible reflectance 
and minimal CAM expression. Early (1, 2 h) AP samples were characterized by intense, 
heterogeneous acinar OFR production, strong P-selectin, and increasing [CAM 
expression, with nuclear translocation of p65, histologically all colocalizing with the 
areas of acinar oxidative stress. Adherent polymorphonuclear leukocytes (PMNs) 
displayed weak OFR formation. Later (8, 24 h), a slowly declining P-selectin, but 
persisting ICAM-1 expression, was paralleled by widespread adherence of PMNs 
producing surprisingly large amounts of OFRs. VCAM and E-selectin showed a mild 
increase at 24 h. CAM gene activation was in good correlation with the protein 
expression; CONCLUSIONS: The early acinar oxidative stress is colocalized with NF- 
kappa B activation, preferential P-selectin, and ICAM upregulation in this AP model. 
Subsequently, adherent, activated PMNs become the major source of OFRs, thereby 
contributing to tissue damage. Copyright 2001 Academic Press. 
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Myotonic dystrophy: an unstable CTG repeat in a protein kinase gene. 
Timchenko L , Monckton DG . Caskey CT . 

Department of Molecular and Human Genetics, Baylor College of Medicine, Texas 
Medical Center, Houston 77030, USA. 

Myotonic dystrophy (DM) is caused by the amplification of CTG repeats in the y 
untranslated region of a gene encoding a protein homologous to serine/threonine protein 
kinases. In DM patients the CTG repeats are extremely unstable, varying in lengtii from 
patient to patient and generally increasing in length in successive generations. There is a 
strong correlation between the size of the repeats and the age of onset and severity of the 
disease. The molecular basis of the effect of the CTG expansion on the development of 
the DM phenotype continues to be investigated. The first working hypothesis of the 
molecular mechanism of DM was a reduction in steady-state myotonin-protein kinase 
(Mt-PK) mRNA and protein levels. However, altfiough the consensus finding is that the 
Mt PK mRNA and protein levels are decreased in DM patients, it is still not clear if this 
reduction leads directly to the DM phenotype. In this short review we discuss the 
molecular aspects of CTG instability and the expression of the myotonin-protein kinase 
gene in normal and DM populations. . .- 

Publication Types: 

• Review 

• Review, Tutorial 
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Induction of class 3 aldehyde dehydrogenase iu the mouse hepatoma cell 
line Hepa-1 by various chemicals. 

Torronen R , Koi kalatnen M Karcnlanwi SO 

Department of Physiology, University of Kuopio, Finland. 

The mouse hepatoma cell line Hepa-l was shown to express an aldehyde dehydrogenase 
(ALDH) isozyme which was inducible by TCDD and carcinogenic polycyclic aromatic 
hydrocarbons. The induced activity could be detected with benzaldehyde as substrate and 
NADP as cofactor (B/NADP ALDH). As compared with rat liver and hepatoma cell 
lines, the response was moderate (maximally 5-fold). There was an apparent correlation 
between this specific form of ALDH and aryl hydrocarbon hydroxylase (AHH) in the 
Hepa-l wild-type cell line-in terms of inducibility by several chemicals. However, the 
magnitude of the response was clearly smaller for ALDH than for AHH. Southern blot 
analysis showed that a homologous gene (class 3 ALDH) was present in the rat and 
mouse genome. The gene was also expressed in Hepa-1. and there was a good correlation 
between the increase of class 3 ALDH-specific mRNA and B/NADP ALDH enzyme 
activity after exposure of the Hepa-1 cells to TCDD. It is concluded that class 3 ALDH is 
inducible by certain chemicals in the mouse hepatoma cell line, although the respective 
enzyme is not inducible in mouse liver in vivo. 
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Relationship between cyclin Dl and p21(Wafl/Cipl) during differentiation 
ol human myeloid leukemia cell lines. 

UHmannova V , Stockbaucr P , Hradcova M , Soucek J , Haskovec C . 

Department of Molecular Genetics, Institute of Hematology and Blood Transfusion, U 
Nemocnice 1, 128 20 Prague 2, Czech Republic, ullman@ulikt.cz 

Expression of cell cycle-regulating genes was studied in human myeloid leukemia cell 
lines ML-1, ML-2 and ML-3 during induction of differentiation in vitro. 
Myelomonocytic differentiation was induced by phorbol ester (12-o-Tetradecanoyl- 
phorbol-13-acetate, TPA), tumor necrosis factor alpha (TNFalpha) or interferon gamma 
(INFgamma), or their combination. Differentiation (with the exception of TNFalpha 
alone) was accompanied by inhibition of DN A synthesis and cell cycle arrest. Inhibition 
of proliferation was associated with a decrease in the expression of cdc25A and cdc25B, 
cdk6 and Ki-67 genes, and with increased p21(Wafl/CipI) gene expression, as measured 
by comparative RT-PCR. Expression of the following genes was not changed after 
induction of differentiation: cyclin Al, cyclin D3, cyclin El and p27(Kipl). Surprisingly, 
cyclin Dl expression was upregulated after induction by TPA, TNFalpha with 
IFNgamma or BA. Cyclin D2 was upregulated only after induction by B A. The results of 
the expression of the tested genes obtained by comparative RT-PCR were confirmed by 
quantitative real-time (RQ) RT-PCR and Western blotting. Quantitative RT-PCR showed 
as much as a 288-fold increase of cyclin Dl specific mRNA after a 24h induction by 
TPA. The upregulation of cyclin Dl in differentiating cells seems to be compensated by 
the upregulation of p21(Wafl/Cipl).These results, besides others, point to a strong 
correlation between the expression of cyclin Dl and p21(Wafl/Cipl) on the one hand 
and differentiation on the other hand in human myeloid leukemic cells and reflect a rather 
complicated network regulating proliferation and differentiation of leukemic cells. 
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Intestinal carbamoyl phosphate synthase I in human and rat. Expression 
during development shows species differences and mosaic expression in 
duodenum of both species. 

Van Beers EH , Rings EH, l'osthuma G. Dingemanse MA. Taminiau JA . Hermans 
HS, Eincrliand AW . Bullcr HA , Dckker J . 

Pediatric Gastroenterology and Nutrition, Department Pediatrics, Emma Children's 
Hospital, Academic Medical Center, Amsterdam, The Netherlands. 

The clinical importance of carbamoyl phosphate synthase I (CPSI) relates to its capacity 
to metabolize ammonia, because CPSI deficiencies cause lethal serum ammonia levels: 
Although some metabolic parameters concerning liver and intestinal CPSI have been 
reported, the extent to which enterocytes contribute to ammonia conversion remains 
unclear without a detailed description of its developmental and spatial expression 
patterns. Therefore, we determined the patterns of enterocytic CPSI mRNA and protein 
expression in human and rat intestine during embryonic and postnatal development, using 
in situ hybridization and immunohistochemistry. CPSI protein appeared during human 
embryogenesis in liver at 31-35 e. d. (embryonic days) before intestine (59 e.d.), whereas 
in rat CPSI detection in intestine (at 16 e.d.) preceded liver (20 e.d.). During all stages of 
development there was a good correlation between the expression of CPSI protein and 
mRNA in the intestinal epithelium. Strikingly, duodenal enterocytes in both species 
exhibited mosaic CPSI protein expression despite uniform CPSI mRNA expression in the 
epithelium and the presence of functional mitochondria in all epithelial cells. Unlike rat, 
CPSI in human embryos was expressed in liver before intestine. Although CPSI was ' 
primarily regulated at the transcriptional level, CPSI protein appeared mosaic in the 
duodenum of both species, possibly due to post-transcriptional regulation. 
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Expression of deoxycytidine kinase iu Ieukaemic cells compared with solid 
tumour cell lines, liver metastases and normal liver. 

van tier Wilt CL, Kroep JR . Loves WJ . Rots MG . Van Groeningen CJ . Kaspers GJ 
reters GJ . 

Department of Medical Oncology, VU University Medical Center, Amsterdam, The 
Netherlands. 

Deoxycytidine kinase (dCK) is required for the phosphorylation of several 
deoxyribonucleoside analogues that are widely employed as chemotherapeutic agents. 
Examples include cytosine arabinoside (Ara-C) and 2-chlorodeoxyadenosine(CdA) in 
tlie treatment of acute myeloid leukaemia (AML) and gemcitabine to treat solid rumours. 
In this study, expression of dCK mRNA was measured by a competitive template reverse 
transcriptase polymerase chain reaction (CT RT-PCR) in seven cell lines of different 
histological origin, 16 childhood and adult AML samples, 10 human liver samples and 1 1 
human liver metastases of colorectal cancer origin. The enzyme activity and protein 
expression levels of dCK in.the cell lines were closely related to the mRNA expression 
levels (r=0.75, P=0.026 and r=0.86, P=0.007). In AML samples, dCK mRNA expression 
ranged from LI 6 to 35.25 (xl0(-3)xdCK/beta-actin). In the cell line panel,' the range was 
2.97-56.9 (x 1 0(-3)xdCK/beta-actin) of dCK mRNA expression. The enzyme activity in 
liver metastases was correlated to dCK mRNA expression (r=0.497, P-0.05). In the liver 
. samples, these were not correlated. dCK mRNA expression showed only a 36-fold range 
in liver wfiile a 1 50-fold range was observed in the liver metastases. In addition, dCK 
activity and mean mRNA levels were 2.5-fold higher in the metastases than in the liver 
samples. Since dCK is associated with the sensitivity to deoxynucleoside analogues and 
because of the good correlation between the different dCK measurements in malignant 
cells and tumours, the CT-RT PCR assay will be useful in the selection of patients that 
can be treated with deoxycytidine analogues. 
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Modulation of the glutamatergic receptors (AMP A and NMD A) and of glutamate 
vesicular transporter 2 in the rat facial nucleus after axotomy. 

ELeoreJU Vassias I , Vidal PP , de Wacle C . 

LNRS (CNRS-Paris V), ESA 7060, Centre Universitaire des Saints-Peres, 45 rue des Saints- 
Peres, 75270 Paris Cedex 06, France. 

Facial nerve axotomy is a good model for studying neuronal plasticity and regeneration in the 
peripheral nervous system. We investigated in the rat the effect of axotomy on the different 
subunits of excitatory glutamatergic AMPA (GLuRl-4), NMDA (NR1, NR2A-D) receptors, post- 
synaptic density 95, vesicular glutamate transporter 2, beta catenin and cadherin. mRNA levels 
and/or protein production were analyzed 1,3, 8, 30 and 60 days after facial nerve axotomy by in 
situ hybridization and immunohistofluorescence. mRNAs coding for the GLuR2-4, NR1, NR2A, 
B, D subunits of glutamatergic receptors and for post-synaptic density 95, were less abundant after 
axotomy. The decrease began as early as 1 or 3 days after axotomy; the mRNAs levels were 
lowest 8 days post-lesion, and returned to normal or near normal 60 days after the lesion. The 
NR2C subuhit mRNAs were not detected in either lesioned or intact facial nuclei. 
Immunohistochemistry using specific antibodies against GLuR2-3 subunits and against NR1 
confirmed this down-regulation. There was also a large decrease in vesicular glutamate transporter 
2 immunostaining in the axotomized facial nuclei at early stages following facial nerve section. In 
contrast, no decrease of NR2A subunit and of post-synaptic density 95 could be detected at any 
time following the lesion, "beta Catenin and cadherin immunoreactivity pattern changed around the 
cel l body of facial motoneuron by day 3 after axotomy, and then, tends to recover at day post- 
lesion 60 days. Therefore, our results suggest a high correlation between restoration of 
nerve/muscle synaptic contact, synaptic structure and function in facial nuclei. To investigate the 
mechanisms involved in the change of expression of these proteins following axotomy, the facial 
nerve was perfused with tetrodotoxin for 8 days. The blockade of action potential significantly 
decreased GLuR2-3, NR1 andNR2A mRNAs in the ipsilateral facial nuclei. Thus, axotomy- 
mduced changes in mRNA abundance seemed to depend partly on disruption of activity; 
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Expression- of cytokines and growth factors in human 
gloiuerulonephritides. 

Waldhcrr EL Noronha IL , Niemir Z , Kruger C Stela H , Stumm G . 
Department of Pathology, University of Heidelberg, Germany. 

Numerous experimental studies point to the potential role of cytokines and growth factors 
in the pathogenesis of renal disease. However, from the various autocrine and paracrine 
mediators identified in vitro and in animal models, so far only a few have been 
demonstrated in selected human glomerulopathies. We examined two types of 
glomerulonephritis (GN): extracapillary GN with anti-neutrophil cytoplasmic 
autoantibodies (ANCA), an example of an acute form of GN, and mesangial IgA GN, 
usually a chronic form of GN, with immunocytochemistry, in situ hybridization and the 
polymerase chain reaction. Normal renal tissue from tumour nephrectomies served as a 
control. In ANCA-positive GN with active renal lesions (crescents, glomerular and 
vascular necrosis), infiltrating mononuclear cells in glomeruli and in the interstitiurn 
expressed interleukin (IL)-1 beta, tumour necrosis factor (TNF)-alpha, IL-2, interferon 
(IFN)-gamma, platelet-derived growth factor (PDGF) and transforming growth factor 
(TGF)-beta. Cytokine expression was also observed in activated resident cells, including ■. 
endothelial cells, capsular epithelial cells, smooth muscle cells of vessel walls, fibroblasts 
and some tubular epithelial cells. lit addition, we noted an increase in the cytokine and 
growth factor receptors TNF-R, IL-IR type II, IL-2R, IFN-garnrria R and PDGF beta-R! 
In contrast, in mesangial IgA-GN, IL-1 beta, TNF-alpha, IFN-gamma and IL-2 were 
usually absent in glomeruli, Mesangial expansion in this disorder was accompanied by an 
increased expression of PDGF, PDGF beta-R, TGF-beta and IL-6 in mesangial areas. In 
both conditions a good correlation was observed between cytokine expression at the 
mRNA (in situ hybridization) and protein level (immunocytochemistry).(ABSTRACT 
TRUNCATED AT 250 WORDS) 

Publication Types: 

• Review 

• Review, Tutorial 
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Malignant transformation of the human endometrium is associated with 
overexpression of lactoferrin messenger RNA and protein. 

Walmcr PIC Padin CJ , Wrona MA . Healy BE. Beutley RC Tsao MS , Kohler MF , 
McLachlan JA , Gray KB . 

Department of Obstetrics and Gynecology, Duke University Medical Center, Durham, 
North Carolina 27710. 

In the mouse uterus, lactoferrin is a major estrogen-inducible uterine secretory protein, 
and its expression correlates directly with the period of peak epithelial cell proliferation. 
In this study, we examine the expression of lactoferrin mRNA and protein in human 
endometrium, endometrial hyperplasias, and adenocarcinomas using 
immunohistochemistry, Western immunoblotting, and Northern and in situ RNA 
hybridization techniques. Our results reveal that lactoferrin is expressed in normal 
cycling endometrium by a restricted number of glandular epithelial cells located deep in 
the zona basalis. Two thirds (8 of 12) of the endometrial adenocarcinomas examined 
overexpress lactoferrin. This tumor-associated increase in lactoferrin expression includes 
an elevation in the mRNA and protein of individual cells and an increase in the number 
of cells expressing die protein. In comparison, only I of the 10 endometrial hyperplasia 
specimens examined demonstrates an increase in lactoferrin. We also observe distinct 
cytoplasmic and nuclear immunostaining patterns under different fixation conditions in 
both normal and malignant epithelial cells, similar to those previously reported in the 
mouse reproductive tract Serial sections of malignant specimens show a good correlation 
between die localization of lactoferrin mRNA and protein in individual epithelial cells by 
in situ RNA hybridization and immunohistochemistry. Although the degree of lactoferrin 
expression in the adenocarcinomas did not correlate with die tumor stage, grade, or depth 
of invasion in these 12 patients, there was a striking inverse correlation between the 
presence of progesterone receptors and lactoferrin in all 8 lactoferrin-positi ve 
adenocarcinomas. In summary, lactoferrin is expressed in a region of normal 
endometrium known as the zona basalis which is not shed with menstruation and is 
frequently overexpressed by progesterone receptor-negative cells in endometrial 
adenocarcinomas. 
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Malignant Transformation of the Human Endometrium Is Associated with 
Overexpression of Lactoferrin Messenger RNA and Protein 

David K. Warmer, 5 Cheryl J. Padin, Mark A. Wrona, Bridget E. HeaJy, Rex C. BentJey, MiDg-Sound Tsao, 
Matthew F. KohJer, John A. McLachJan, and Karen D. Gray 

Department of Obstetrics and Gynecology. Duke University Medico] Center. Durham. North Carvlina 27770 ID. K. B. £. JL. M. F. JL); Department of Obstetrics anj 
Gynecology, University of Michigan. Ann Arbor. Michigan 48)09 /C J. P.): School of Medicine. University of Maryland, Baltimore. Moryland 2J20J \M. A. WJ; Department cf 
Pathology. Duie University Medical Center. Durham. Nonb Carolina 277)0 /Jc C BJ; Deportment of Pathology. Montreal General Hospitol Montreal, Quebec, Canada 
H3GIA4 /Af-X TJ; laboratory of Reproductive and Developmental Toxicology. National Institute of Environmental Health Sciences, Research Triangle Pari, ftonh Carolina 
27709 1 J. A. M.}; and Department of Obstetrics and Gynecology. Uniformed Services University of the Health Sciences, Bethesdc, Maryland 201)4-7799 IK. D. G.J 



ABSTRACT 

In the mouse uterus, lactoferrin is a major estrogen -inducible uterine 
secretory protein, and its expression correlates directly with ibe period of 
peak epithelial cell proliferation. In this study, we examine the expression 
of Jactoferrin mRNA and protein in human endometrium, endometrial 
hyperplasias, and adenocarcinomas using immuDohisiochemistry, West- 
ern immunobJotlinj, and NoHhem and in situ RNA hybridization tech- 
niques. Our results reveal that lactoferrin is expressed in normal cycling 
endometrium by a restricted number of glandular epithelial cells located 
deep in the zona basalis. Two thirds (8 of 12) of the endometrial adeno- 
carcinomas examined over express lactoferrin. This tumor- associated in* 
crease in lactoferrin expression includes an elevation in the mRNA and 
.protein of individual cells and an increase In the number of cells express- 
ing the protein. In comparison, only 1 of Ibe 10 endometrial hyperplasia 
specimens examined demonstrates an increase in lactoferrin. We also 
observe distinct cytoplasmic and nuclear immunostaJning patterns under 
different fixation conditions in both normal and malignant epithelial cells, 
similar to those previously reported in the mouse reproductive tract- 
Serial sections of malignant specimens show a good correlation between 
the localization of lactoferrin mRNA and protein in individual epithelial 
cells by in situ RNA hybridization and immu nobis tocbeinislry. Although 
the degree of lactoferrin expression in the adenocarcinomas . did not 
correlate with the rumor stage, grade, or depth or inrasion in these 12 
patients, there was a striking inverse correlation between the presence of 
progesterone receptors and lactoferrin in all 8 lactoferrin- positive adeno- 
carcinomas. In summary, lactoferrin is expressed in a region of normal 
endometrium known as the zona basalis which b not shed with menstru- 
ation and is frequently over expressed by progesterone receptor-negative 
cells in endometrial adenocarcinomas. 



INTRODUCTION 

The uterus is a sex steroid- responsive organ thai plays a major role 
in women's heajlh. Hysterectomies were the most frequently per- 
formed major surgical procedures in a 20-year study interval 
(1965-2984; Ref. 1). Fifty -«gbt to 80% of these 12.5 million 
procedures were performed for estrogen-related disorders of prolifer- 
ation. Chronic unopposed estrogen exposure, roost commonly asso- 
ciated with type II ovulatory disorders, eventually Irads to the devel- 
opment of complex endometrial hyperplasia and adenocajcjooma- 
Since the sex steroids, estrogen and progesterone, act on their target 
tissues by regulating the expression of a wide variety of signaling 
molecules, identifying these regulatory facrors will provide critical 
information towards understanding normal reproduction and repro- 
ductive tract pathology. Our current knowledge of estrogen and pjo- 
gesterone action on the reproductive tract is based to a great extent on 
information collected from todeots (2). Although differences exist 
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between the reproductive physiology of rodents and humans, the 
mouse has been a useful model for studying sleroid hormone action in 
the human female reproductive trad (3, 4). One potential regulatory 
molecule shown to be regulated by estrogen in the mouse reproductive 
tract is lactoferrin. haaofcjiin is n basic glycoprotein with an extraoj. 
dinariJy high affinity for iron that was originally discovered in milk. 
This protein is expressed in a wide vajiety of tissues, most notably in 
poly roorphonu clear leukocytes and. most mammalian exocrine glan- 
dular secretions. In the mammary gland (5) and ibe female reproduc- 
tive tract of the mouse (6-8), laeiofemn is regulated by endocrine 
hormones. Prolactin stimulates lactoferrin synthesis in the breast; 
whereas in the uterus and vagina, the ovarian sex steroid, 170- 
estradiol, is tbe inducer. (6, 7, 9). To date, lactoferrin is one of the few 
genes thai have been identified in the Todent that are directly regulated 
by estradiol. The lactoferrin gene contains an ERE 2 that is important 
for. regulating its expression in vivo in the mouse reproductive tract 
Being linked to estradiol, the expression of lactoferrin by the uterine 
epithelium parallels the onset of DNA synthesis. Although sequencing 
information suggests that the human lactoferrin gene also contains a 
functional imperfect ERE in the 5'-flanJting promoter region (10, 11)^ 
there is very little data regarding Jacioferrin expression in the human 
female reproductive tract. 

The purpose of our study was to examine the expression of Jacto- 
ferrin in the human endometrium under normal and pathological 
conditions by immuDohistocbemistry, immunoblotling, and Northern 
and in situ RNA hybridization techniques- In addition, we looked for 
correlations between lactoferrin expression and several parameters, 
such as the stage of the menstrual cycle, the distribution of estrogen 
and progesterone receptors, HER-2fneu expression, markers of cell 
proliferation, and the Mslopathological grade and extent of royome- 
trial invasion in the adenocarcinomas. Our data demonstrates that 
lactoferrin is expressed in a very restricted Dumber of glands in the 
basal region of normal human endometrium and is markedly oveiex- 
pressed in a significant number of the uterine adencx^rcinornas by 
PR- negative cells. 



MATERIALS AND METHODS 

Tissue Preparation and Histological Evaluation. Surgical paihology 
Specimens were obtained fjorn DuJcc University Medical Ceaier (Durham, NQ 
and ibe DepailmcDt of Pathology al MooueaJ Genrial Hospital (Quebec. 
MoDiiesl, Canada). Cycling codcnictiiuxn was obtained fiom 22 women (age* 
31-49). and atrophic endometrium was obtained bom 7 postmenopausal 
women (ages 64-77). Hysreitcromies from cycling women were performed 
foi subscrosal leiomyomas (n = 6).. pelvic jelaxatioD (n — 8), pelvic pain 
(n = 4), peiiioneal endometriosis (;i = 2), and cancer of either the exocervi* 
(n — 3) or the ovary (n - J). In addition, 12 adenocarcinomas, 3 atypical 
complex' hyperplasias, 3 complex hyperplasias without atypia, and 4 simple 
hyperplasias were analyzed. Each human uterus w as bivaJvcd sbordy after 
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bysi erect omy, and endometrium was rt moved from tbc fundal region. A full 
thic)o>css biopsy was placed into cither 10% neutral-buffered formalin or 
Bouin's solution overnight at room temperature before dehydration, paraffin 
embedding, and sectioning at 4 /nn on sitanized slides. Histological evalua- 
tions of hematoxylin and eosin-stained slides were performed blindly by one 
hoard-certified pathologist Normal endometrial samples were dated by the 
criteria of Noyes et oL (32). Endometrial hyperplasias and carcinomas were 
classified according to the current recommendations, of the International 
Society of Gynecological Pathologists under the auspices of WHO (13). 
Histological grading of tumors was performed according 10 Federation Inter- 
nationale des Gynccologistes el Obstetrisies criteria (14). Each specimen was 
read a minimum of three times, and only specimens that were read consistently 
the same way were included in tbc study. Unstained sections of the same 
tissues were used foi the cyiocbemical analysis of pjotcin and mRNA expres- 
sion using specific reagents. A few endometrial samples were frozen for 
subsequent protein and RNA extraction, which were evaluated by Western and 
Northern blotting, lespectively. All human tissues were handled with the 
precautions and the guidelines required by Duxe University and National 
Institute of Environmental Hearth Sciences. 

ImmunolocaKzatJon. Slides chosen foi study were deparaffinized and 
rinsed in 20% glacial acetic acid at 4"Cfor 15 s to inhibit endogenous alkaline 
phosphatase. AD subsequent incubations and washes were at room lempexa- 
hrre. Sections were next equilibrated in PBS for 20 min ami incubated for 20 
ruin with 15% normal goat serum dfluted in PBS to blocfc nonspecific binding. 
Detection of lactoferrin was performed primarily with a rabbit anti-human 
lactoferrin polyclonal antisera generated in our laboratory arid affinity purified. 
Similar results were also seen with a uonaffiniiy-pmified commercial antisera 
(Biogcnex, San Ramon, Ca). Following incubation at room temperatuje witb 
primary antisera for 60 min, the sections were washed in PBS twice for 30 min 
each, and lactoferrin was localized using an alkaline phosphatasc-biotin- 
Streptavidin detection system {Vectastain ABC-AP kit: Vector Laboratories, 
Burlingame, CA; or the Super Sensitive Detection System: Biogenex, San 
Ramon, CA). To identify nonspecific staining, preimmune rabbit serum was 
used in place of ibe primary antibody. The immun oread ion was quantitBtcd by 
determining the percentage of glands and ibe percentage of cells staining for 

- lactoferrin in the 2ona basal is and the zona funcrionalis, with b rninimuDi of . 
300 cells coumed in each region. PR antibody was provided by Geoffrey 
Greene (KD68), and a commercial source was also used (Biogenex, San 
Ramon, CA). Identical staining patterns were confirmed with boih prepara- 
tions. Other commercially obtained antisera include PCNA (BiogcDex, San 
Ramon, CA), ER (ERJD5; AMAC, Wesibrook, Ma), M1B-2 (AMAQ, and 
HER-Z'net* (Biogenex, San Ramon, CA). The primary antisera incubations 
were 2 h for the PR, 1 b for PCNA ; M1B-3, and ER, and 30 min for 
HER-2/neu. Aohsera dilutions were 3:)00 for M1BO and 1:20 for HER-27/wm. 
Antigen retrieval (Biogenex, San Ramon, CA) was performed before adding 
the progesterone primary antisera. 

Western Blot Analysis. Proteins were exnacied from endometrial biopsies 
by bomogenization on ice in 1% Tritbn-X and 20 dim Tris-HO (pH 7.4) with 
pole use mhibilors 00 >ig/ml leupcpiin. 200 KU/ml aprotinm, and 20 jxg/ml 
phenylinctbylsnjfonyl fluoride) and clarified by cenrrifugation at 45,000 rpm 
for 30 min in a BecXrnan 70.1 Ti rotor: then ihc supernatant was analyzed for 
protein concentration by ibe BCA protein assay (Pierce, Rockfoid, fL). Ali- 
quois of 200 fig were separated by electrophoresis on an 8J>% SDS polyac- 
rylamide gel, blotted onto nitroceDulose membranes, incubated with poly- 
clonal rabbit amihuman lacto/erjin amiscra, and localized with an 135 Mabelcd 
donkey an ri- rabbit immunoglobulin, e5 described previously (7). 

hi Situ Hybridization. All slides were pre treated with 02. N HQ for 30 
nun at room temperature, digested with \ ^pm\ proteinase-K (Sigma Chem- 
ical Co.. St. Louis. MO) b 0.1 M Ti is- HCl \pH 7.4VO.05 u EDTA for 15 min 
at 37"C, and then treated with 0.1 M iriethanol3Diiue-CL25% aceiic anhydride 
for 5 min at room t crop eraruic and 0 ] m Tris- glycine (pH 7.4) for 30 min at 
loooj temperature. The sections were subsequently dehydrated with graded 
. tihznol, air dried, and prebybridi2cd at M)"C for J b in 2 x SSC. 30 dim DTT, 

- X Denhajdt's solution, J 00 jje/ml oi boib salmon ipcixn DNA and yeast 
^RNA, and 50% fprmamide ( 15). The slides were iben hybridized overnight at 
50*C in tbc same medium with 10% do nan so If ate and 2*30" cpm/>i of ibe 
ipecifie RNA probe. The laciofcrrin oligonucleotide piobe was amplified by 
^CR using primers that spanned nucleotides 71 8- J 654 (10) and cloned into 
pGEM-42.. - 3 S-labelcd sense and antistnsr RNA piobes were »nade wjih the 



Promega Riboprobe kit (Promega, Madison, Wi), washed twice in 1 X SSC for 
J0 mb at room temperature, digested with RNase [2.8 jig/ml RNase-A, 03 
pg/ml RNasc-Tl, 10 m* Tris-Hd (pH 7.4), and 15 mM NaCI], and washed 
again with 1 X SSC twice for 20 min each time at 50*C, twice for 20 in 
OJ X SSC at 55*C and once for 20 min at 6(TC The sections were then 
dehydrated and dipped- m Kodak autoradiographic emulsion (NTB-2) for 
detection of specific mRNA expression. The slides were allowed to develop for 
2 weeks. After this period, the slides were developed using Kodak PJ9 
developer and Kodak Rapid Fixer. 

Northern RNA Analysis. Total cellular RNA was purified from frozen 
tissue by the guanidine isothiocyanate-cesium chloride method, and poly(A*> 
RNA was isolated by oligo(dT>cellulose chromatography using methods 
described previously (J5> For Northern blot analysis. poly(A + )-RNA 'was 
resolved by electrophoresis on 13% formaldehyde agarose gels, stained witb 
eihidium bromide, and transferred to a nylon membrane. The membrane was 
probed with a ^P-labcted lactoferrin cDNA derived fioro human uterus (nu- 
cleotides 718-1654; accession no. S52659) using PCR techniques, followed 
by cloning into pGEM-42 (Promega, Madison, WI). Jn order to insure that the 
quality and quantity of RNA analyzed by Northern blotting was equivalent 
between control and fteated groups, the blot was probed simultaneously for 
glyceraldehyde-3-phosphate dehydrogenase. 

." Statistical Analysis. Values are presented as means ± SD. Differences 
between the zona basalis and functionalis were tested by the two-taDed 
Student's / test. • 

RESULTS 

lmmunohistochemical Analysis of Lactoferrin Protein 

Normal Cycling Endometrium. ImiBUDohistocherDjca! studies of 
normal cycling human endometrium localize lactoferrin protein pre- 
dominantly to ibe glandular epithelium deep in tht zona basalis and 
not to the functionalis (Fig. 1A). The association of lactoferrin protein 
expression with the zona basalis is statistically significant. (P < 0-001; 
Fig. 2). Two to 56% of the glands express lactoferrin at any given time 
during the menstrual cycle. Within positive glands, lactoferrin protein 
immunolocaJization is heterogeneous in that positively staining epi- 
thelial cells are interspersed with cells negative for laaoferru) expres- 
sion. No apparent differences in morphology, PCNA, ER, or PR 
expression are seen to account for the heterogeneous pattern of intra^ 
and intei gland lactoferrin expression in normal endometrium. SirmlaJ 
to our previous findings in mouse uterine epithelial cells, the positive- 
staining glandular cells of the human endometrium demonstrate two 
distinct iromunostaining patterns for lactoferrirt, cytoplasmic and nu- 
clear (Fig. IB\ seen with both formalin and Bouin's fixation. Jit 
evaluating the temporal expression of lactoferrin, there is a trend 
towards more glands expressing lactoferrin during Ihc secretory 
phase. Because of the large variance, the trend is not statistically 
significant. As expected, the polymorphonuclear leucocytes in the 
endometrium also stain intensely for lactoferrin protein, which is 
stored in their secondary granules (Fig. 3> These results demonstrate, 
for the first lime, that lactoferrin protein is expressed in the human 
endometrium predominately by polymorphonuclear leukocytes and 
epithelial cells of glands located deep in the zona basalis. 

Proliferative Endometrial Disorders: Hyperplasias and Adeno- 
carcinomas. Imn-ttmohistocliemisiry reveals that the expression of 
larjofrrriji psr-trSn if wae^exi in 66.6% (S of 22) of me- endometrial 
adenocarcinomas examined. In one-half of these cases (4 of 12), 
Jacioferrin is intensely expressed by malignant epithelial cells 
rruougJtour the entire tumors (Fig. 4). The other four adenocarcinomas 
demonstrate increased staining for laciofeirin in concentrated regions 
of the !urno;s. In all tight cases where lactofenin expression is 
elevated, the cells expressing lactoferrin have one similarity with 
normal positive glands in that they demonstrate heterogeneous slain- 
irjg of interspersed positive and negative cells. However, the expres- 
sion oi iactofenin by the malignant cells clearly differs from normal 




F»£. ). Localiution of laciofcrrin protein in normal cycling endometrium, by immxt- 
" nohisiocberoisuy using a specific polyclonal antibody. Our analysis rtvepfc thai UcioS- 
crrio pTDlfbi is pjcscnl'in > limited number oJ glands located in the zona basalts ol the 
endometrium (AX * J n * Ako, note thai JBCiofcmD is beterofcncoos within positive 
{lands. It., cells staining foi lactofenin we inlcnpetscd with negative -si a ming epithelial 
cells throughout the "land. Two immunohiMocbcmicaJ staJninf psltems arc noted for 
laciofemn in normal uterine epithelium. In one pattern, lactoierrin protein is imm undo- 
es tired primarily over the cy top* asm (BJ\ and in tbc other, ibe stiimng ts seen over the 
rmcleni (B2); X 40. 



positive glands id that tbe Jactoferrin is not limited to the basal regions 
of the tumors, many more ceDs arc positive, and tbe relative iDlensity 
of the stainiDg over individuals cells is increased. Although inci eased 
lactofenin expression is associated with malignant transformation, we 
do not find a correlation between lactofenin protein presence and the 
stage, nuclear giade, Federation Jnternationalt des Gynecologistes el 
Obsietristts grade, or the depth of myometrial invasion in ihe 12 
tumors studied (Table 1). In sharp contrast to the common dys regu- 
lation of laclofeirin expression found in the malignanl endometrium, 
only J of 10 endometrial hyperplasia specimens evaluated contained 
an increased number of cells staining for lacloferrin. The hyperplastic 
specimen over expressing Jactoferrin was tead as complex without 
atypia. 

In Situ and Northern Analyses of Lacloferrin mRNA 
Expression in Normal and Malignant Endometrium 

To further our trndei sending of !hc location cf lactofenin p;oiei/> 
synthesis in the human endometrium, we examined lactofenin mRNA, 
expression by in situ and Northern hybridi2ation using specific 33 S- 
labeled probes foi human lactoferrin. No delectable RNA hybridiza- 
tion is observed in the norma) endomeuium by in sriu hybridization, 
even in the presence of iromunodeteciable protein, usins equivaJent 
hybridization conditions and deveJopmenl times a5 used /oj the ade- 
nocarcinomas- Con sir tent with the results obtained by in situ hybrid- 
ization, long exposure times were. required to demonstrate laciofenin 
mRNA in not ma I endometrium by Northern blot u>jng poiy(A^)- 
mRNA. This indicates that laciofcnin mRNA is present in normal 
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tissue but in very low levels, consistent with tbe very limited pattern 
of protein expression in normal endometrium. Equivalent RNA load- 
ing and quality for each specimen was demonstrated by etbidiuni 
" ' " " bromide staining of the RNA gels (data not shown) and by probing for 
the housekeeping gene glyccraldehyde-3-phosphate dehydrogenase 
that does not fluctuate significantly with the metabolic state of the 
tissue. A representative Northern blot is shown in fig. 5. In situ 
hybridization with several adenocarcinomas reveals that there is a 
direct correlation between the localization of lactoferfin mRNA and ; 
the irnmunostaining of expressed lactofenin protein (Fig. 4). Lactof- 
enin mRNA is not associated with polymorphonuclear leukocytes by 
* in situ hybridization in either normal or malignant tissue. • 

Western Biol Analysis 

To confirm the specificity of tbe antisera that we used for immu- 
nomstocnemistry, we perform Western Mot auja/ysrs on proteins 
extracted from both normal and malignant endometrium which were 
separated by 8.5% SDS-polyacrylamide gel electrophoresis (Fig, 6).. 
Immunoblotting identified a single broad protein band with a molec- 
ular weight between 70,000- 80,000 in both normal and neoplastic 
endometrial tissue bomogenates, consistent with tbe reported molec- 
ular weight of human lactoferrin. Supporting tbe imraunocytocbemi- 
cal analysis, a representative irnmunoblot • clearly demonstrates thai 
tbe proportion of protein thai is lactofenin is markedly increased in 
ibe adenocarcinomas in comparison to the normal endometrium. The 
molecular weight of lactofenin in the adenocarcinomas appears to 
have a slightly higher molecular weight that the predominarit form in 
normal tissue. 
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Fig. 2. Percentage of endometrial clami* cj. pi easing lacioierrin piolein by imoi "no- 
bis tocbe mis try, S if nifica nOy mote glindi arc positive in the region ©( ibe zona bawl* 1 
than in the 2 on* runciicmitis of tbe enrfomciihini (P < 0.001). Zona basal is. »; *° n * 
iuncijOi»l>». Kl Although iticie is a uend towards mofc ol ibe basalis glands expreWt 
, laciricnin in the seoetory phase (right » tpaip?red with th« Ic/t M) t ibis vai nD * 
jiaiinicstly 5ieniltC3Dt. I 
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Fig. 3. Polymorphonuclear lewkocyics (or;o>»-j) art scan tied throughout the endome- 
trium and siain initnseiy ioi lanofenin. Laclofe/rin is s known component ©/ the 
secondary granules in poljroorprtonuctcn leukocytes. Trie presence of a segmented 
nucleus and l»cioftmD proiein w in excellent method foi identify »r>£ this group of 
rDflprnmatory cells. A. X 20. 0, X 60. 



Correlation of Lacloferrin Expression with the Expression of 
PCNA, Ki-67, HER~2/neu f ER, and PR 

In an attempt to characterize the phenotype of endometrial cells 
which express lacloferrin, we performed irorouKohisrocheroistry on 
serial sea ions for lbe Ki-67 antigen, PCNA, HER-2/neu, lactofenin, 
ER, and PR. In normal tissue, Ki-67 and PCNA expression are cell 
cycle -specific markers of cell proliferation (J 6, 17). Upon analysis of 
normal cycling endometrium, do relationship between lactofenin pro- 
tein expression and ER, PR, or Ki-67 expression was observed. 
Similarly, in roost of the adenocarcinomas evaluated, no relationship 
was noted between lactofenin and PCNA protein expression. How- 
ler, in one adenocarcinoma (Fig. 7), there was a cJeai inverse 
lelaiionship seen between Jactofenin and PCNA localization, which 
was present throughout the entire tomor. Most dramatic, however, was 
* striking inverse con elation seen between Jacioferrin and PR expres- 
sion in 8 of 8 PR-positive uterine adenocarcinomas (Fig. 7). Two 
tumors negative for PR also did noi express Jacioferrin. Although an 
inverse corielaiion was also suggested w ith YYERUncu and PR, the 
iuvetse correlation was mote precise with Jacioferrin in these tumors. 

DISCUSSION 

In the mouse uierus. Jactoirrrin is an estrogen- induced uterine 
iecjciory protein thai is present duougjiom die epithelium (7), and it 



is expressed concomitantly with epithelial cell proliferation. In con- 
trast to bctoferrir/s ubiquitous expression in the estrogenized mouse 
uterine epimeJiumt, laclofenin protein is limited to glandular epithelial 
cells in the basal regions of Donnal bum an endometrium and usually 
to glands that were directly adjacent to the myometrium (ie,, the 
deepest glands of the zona basalis). This regional localization of 
lactofemn expression is not surprising in that other biochemical 
parameters have been reported to show site specificity in primate 
endometrium. These parameters include the proliferative index and 
the expression of the secretory component of IgA (18). Similar to our 
observations in the endometrium, lacloferrin is also expressed region- 
ally in the mammary gland. In bovine breast tissue, lactofemn is 
localized primarily to the basal alveolar cells (19); whereas in human 
breast tissue, the ductal epithelium appears to be the primary source of 
secreted laclofenin during Jactation (20). 

Examining the endometrium on different days of the menstrual 
cycle demonstrates a trend towards increased lactofemn expression 
during the luteal phase. Although this data is not statistically signif- 
icant, the cyclic variation may be biologically relevant Kim etoL (21) 
recently reported that the basal endometrial epithelial cells are unique 
because they proliferate during the postovulatory luteal phase. Inter- 
estingly, in the mouse uterus, there is a direct correlation between 
lactofenin expression and epithelial cell proliferation (6). Therefore, 
lacloferrin may have a similar role in the human and mouse endome- 
trium. If lacloferrin expression is cyclic, the ERE in the 5'-flanlcing 
promoter legion of the human lactofemn gene may be activated 
during the luteal phase (30, 11). 

Another similarity between human and mouse uterine Jacioferrin 
expression is the observation of two immunobisiochemical staining 
patterns. In one pattern, the anlisera binds primarily over the cyto- 
plasm, and in the other, the nucleus is the primary site of localization. 
Although this could represent a fixation artifact, we have now ob- 
served this pattern in two species and under differ tnl fixation condi- 
tions. It has been demonstrated that signaling peptides, i.e., platelet- 
derived growth factor (22), Inl-2 (23), and probasin (24), can be 
. selectively directed into the. nucleus, cytoplasm, or secreted. II is 
believed thai this differential processing may allow proteins to have 
intracrine, autocrine, and paracrine roles, depending on the physio- 
logical state of the cell. The two locaTjzation patterns observed suggest 
that lacloferrin might have signaling sequences that direct the final 
destination of the mature peptide. 

In normal endometrium, lacloferrin roRNA is present but in very 
low levels. Prolonged exposure times are needed to visualize lbe 
mRNA band with Northern analysis. Although with in situ hybrid- 
ization the Jacioferrin mRNA signal is easily seen in adenocaicino- 
mas, we failed lo localize the laclofenin mRNA in normal endome- 
trium using an equivalent exposure time. These low levels of mRNA 
in noimal endometrium suggest that the synthesis and degradation of 
lactofenin mRNA is moie tightly regulated in normal tissue than in 
the adenocarcinomas. Because of the low levels of message in normal 
tissue, we are unable at this lime to definitively conclude thai laclo- 
fenin mRNA is synthesized by the same epithelial cells which express 
the pjotein. 

In endometrial adenocarcinomas, malignant transformation of the 
endometrium is associated with the up-regulation of laclofenin 
mRNA and protein biosynthesis. The up-regulation al the RNA level 
is deroonstraied by an increase in steady-stale RNA levels using boih 
in siru hybridization and Noithern analysis techniques. In these can- 
cers, we also observe an increase in the number of lactoferiin-posiiive 
cells, which express both ihe protein and mRNA. In this study, 8 of 
the 32 adenocarcinomas evaluated overexpress laciofeirin, compared 
with only 1 of 10 hyperplastic specimens. The form of Jacioferrin 
pjotein extracted from endometrial adenocarcinomas appears to have 
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Fig. 4. Coloealiiatwn of lactoferrin protein pontU) and 
mRNA (n^Af paneb) in a normal proliferative endometrium tnd 
endometrial adenocarcinomas by performing imcmftohhtochcjnis- 
try and in iiui RNA hybridization on serial sections. Dual analysis 
of protein and mRNA expression reveals thai glands in norma) 
endometrium do Dot have detectable mRNA, as measw td by in lira 
hybridization (top panrtr t X 40), whereas analysb of the adeno- 
carcinomas dearly demonstrates a direa correlation bcrwtcn pro- 
tein &nd RNA expression fox Janofcrrin (middle and bottom pontfr, 
X 10). Note that lactoferrin protein and mRNA is distributed io a 
heterogeneous paiiera in the epithelial cells of tot adenocarcinomas. 
As is shown in Pig. 1, » betcrogeneops staining pattern for lactof- 
crrin protein is also seen frequently fn norma] endometrium. 
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a slightly higher molecular weight than ihe prorein present in normal 
tissue. This could be due lo alteraiions in the processing of the 
lactoferrin mRNA, protein, or gJycosyJatjoi) by the malignant cells. 
Alternatively, there could be minor differences between lactoferrin 
protein which is present in neutrophils and the form synthesized by 
uterine epithelial cells. suggest two hypotheses to explain lactof- 
errin ove i expression in endometrial adenocarcinomas. In the first 
hypothesis, laaofenin biosynthesis is deregulated by the same pro- 
cesses that lead to the malignant transformation of endometrium. If 
Ihis hypothesis, is Hue then lactoferrin may be a useful marker for 
endometrial adenocarcinoma investigation, and further research is 
needed to determine whether lactoferrin plays a contributing role in 
tbe malignant transformation. A second hypothesis is thai Iaaoferrin- 
positive human endometrial adenocarcinomas evolve from tbe clonal 
expansion of cells residing in tbe regenerative zone {zona basaJis) of 
normal endometrium. It is interesting to speculate that lactoferrin 
expression in endometrial cancer may be Jinked to estrogen action in 



some way, since proliferative disorders of human endometrium are 
linked to chronic estrogen exposure over several years and sequencing 
data suggests that the promoter for the human lactoferrin gene does 
contain an ERE. 

Although the function of lactoferrin is unknown, a variety of 
biological roles have beeD proposed for lactoferrin which could link 
this protein to a role in cancer, including the regulation of DNA 
synthesis (25-29), modulation of the immune response (25, 30, 33)> 
and iron transport (32). Some forms of lactoferrin are reported lo have 
RNasc activity (33, 34). Secreted RNases ate involved in develop- 
ment, reproductive function, neoplasia, angiogenesis, and iinmuije 
suppression. (35, 36) If angiogenesis and immunosuppression art 
components of lactoferrin RNase activity, these properties could pro- 
mote tumor growth. 

In the endometrial adenocarcinomas, we observed a heterogeneous 
expression pal tern for lactoferrin, PCNA, Hct r Vneu y ER, and PR- 
With regard to prognosis, patient survival is reportedly worse when 
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Fig. 5. Nonbtm analysis of hciofeiiio mRNA t*p>essioji confirms ihnl endometrial 
adenocarcinomas (Lanes 3 and sigtnScantly overotpfcss the 2 _5-ki)obas* transcript of 
human tactoferrin in comparison to norma] endometrium (L<m« 7 and 2). The Northern 
hybridization data supports tbe in situ RNA results and confirms thai Jftciofwrin RNA 
expression is dys/e^ulatcd in uterine adenocajcinomas. Normal uterine tissues appear to 
contain. Jow steady-sutc RNA levels of betofemn, reflecting a controlled pattern of 
protein expression. Equivalent RNA loading and quality «ch specimen was demon- 
strated by ethidium bromide staining of the RNA gels (data not shown) and by probing for 
B housekeeping gene (g)yceraldthyde-3-phosphale dehydrogenase) that dot* r>o» fluctuate 
significantly with tbe metabolic Slate of the tissue. 
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endometrial adenocarcinomas Jose sex steroid receptors (37, 38), have 
a higher proliferative index (39), and demonstrate DNA aneuploidy. 
During the tumor progression of endometrial adenocarcinomas, it 
appears that the loss of steroid hormone receptors occurs earlier than 
either the increase in proliferation rale or the development of DNA 
aneuploidy (40). In our study, we note a striking inverse correlation 
between the expression of lacloierrm and PR in the endometrial 
adenocarcinomas. An inverse relationship also has been described for 
HER-2//ww and PR in endometrial adenocarcinomas that correlates 
with patient prognosis. Furthermore, in cancers of the human endo- 
metrium, ovary and breast Ber-2/neu expression has been associated 
with advanced disease and poor survival (41-43). HER-2/neu is an 
oncogene lhat shares sequence homology with the epidermal growth 
factor receptor and is speculated to contribute. to aberrant growth. Of 
note is that lactofernn biosynthesis in the mouse uterus is associated 
with the expression of the epidermal growth factor. Like HER-2/neii, 
the epidermal growth factor receptor is also frequently overexpressed 
in PR-negative cells of endometrial adenocarcinomas (44). The am- 
plification of growth factor receptor expression in PR-negative endo- 
metrial adenocarcinomas may be associated with the acquisition of 
growth autonomy and hormone independence, which may contribute 
to the pooier prognosis of PR-negative endometrial carcinomas (45). 
Some endometrial adenocarcinomas, including recurrent tumors, can 
be treated successfully with progesterone therapy (46, 47). Although 
the significance of the inverse relationship between lacioferrin and PR 
expression is not known, we speculate that the PR-negative cells do 
not undergo the normal growth inhibition and secretory differentiation 
normally associated with progesterone action. ; 

A survey of human tissues reveals that lactofernn is expressed by 
most normal mammalian exocrine glands and may be a prognostic 
marker in tumors (20). Lacioferrin is found in normal ductule breast 
epithelium and in primary breast carcinomas. In breast tumors, there 
is an inverse conelalioD between lactoferrin and ERs (20). Notably, 
lactoferTin expression in breast cancer may fall into the same category 



A. Immunoblot 



B. Coomassie Blue 
Protein Staining 



Fig. 6. Western blot analysis using an antiserum 
speciBc lot human lactofenin was performed on 
proteins extracted hvm normal' and malign: ni en- 
dometrium and separated by SDS- poly aery larrudc 
;et electrophoresis. A broad protein band with a 
molecular >eight between 70,000 and 30,000 is 
delected in both normal and neoplastic endometrial 
tissue homogrnatei, consistent willj the reported 
mojtculai weight of human lacto/emn (A). Tbe 
roost significant observation from the immunoblDt- 
ling studies is that lactoferrin protein is markedly 
elevated in ibe accnocai cinemas, in comparison to 
norm*! t ndorocirium. which iuppom ibt irnrmmo- 
r> toe he mica) analyst* thst demonstrates t »r eater 
number of cells positive for lactofenin protein in 
the uicrine tumors. B. the relative zmotml of protein 
loaded in cich lane by Coomassie bhre staining. 



MW 

markers 



97.4 

Lactoferrin - 



CM 




t — 






to 


E 


• E 


o 


o 


c 


c 


*o 

w 




m 




o 


o 


o 


o 


c 


cr 


00 


CP 






< 


< 



> 
CO 

J? 

1 

rx 



o 
o 

CD 
CO 



68 




MW 

markers 



97.4 — 



OJ 

rtJ 

E 
o 
c 
o 

r5 
o 
o 

tz 

CO 
HO 

< 



E 
o 
cz 

o 

k_ 

ro 
O 

o 
cz 
a> 

X5 

< 



CD 
> 

o 



O 

o 

CD 

in 



68 




1H3 



LACTOFERntN DYSREGULATlOhi & ENDOMETRIAL CMXTR 



F>g. 7. ConeJaitro of laoofmin protein exprts- 
shm with the PCNA, a marker of pr oldest ion, and 
PR as measured by immunohistocheniistry pcr- 
iorroed on serial sections; X 20. In most cases* no 
relationship between bcrolcnin and PCNA expres- 
sion is found in either normal or malignant endo- 
roeimtm (d»la not shewn). However, in 1 of 12 
Bdenocarcinomas, an inverse correbbon is seen 
btrwten bctofemn (A) and PCNA (B) localixa- 
lion. This striking pattern wis consistent through- 
out the entire tumor, suggesting 0* possibilily of 
crll cycle regulation of l*ciofcfrrr expression in 
this adenocaidnoroa. The boacm ponth exhibit aa 
inverse correlation between bctofcrrin (C) and PR 
expression [D\ which was seta in all eight endo- 
metrial adenocarcinomas which cxprf««* PR- 
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as the other maimers for ER-Dcgativc rumors, such as amplification of 
EGF leceptor, HER-2/neiz, and transforming growth factor a expres- 
SiOD, which arc associated with poor prognosis. Id gastric carcinomas, 
■ bctoferrin expression is associated with transformation of specific 
cell types including intestinal -type carcinomas, adenomas, and incom- 
plete intestinal metaplasias (28). Although this is complete specula- 
tion at this time, perhaps lactoferrin over expression in the various 
malignancies may complement the actions of the giowth factor path- 
way molecules and contribute to the autonomous growth of these 
himois. 

In conclusion, om studies reveal that lactoferrin is. associated 
with a unique population of epithelial cells in ibe zona basaJis and 
that lactoferrin ovei expression may be associated with roalignanl 
tiansfoiroation of the human endometrium. Further studies are 
Deeded to elucidate the role of lactoferrin id normal and patholog- 
ical endometrial physiology. 
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Cell proliferation in human soft tissue tumors correlates with platelet- 
derived growth factor B chain expression: an immunohistochemical and in 
situ hybridization study. 

Wang J. Coltrera MP . Gown AM . 

Department of Pathology, University of Washington, Seattle 98 195. 

The authors tested the hypothesis that the B chain of the platelet-derived growth factor 
(PDGF), a known connective tissue mitogen and growth factor, could be expressed by 
human soft tissue tumors, and that its expression could play a role in the control of cell 
proliferation in these tumors. Using a set of 56 soft tissue tumors, including benign 
tumors and all three grades of sarcomas, PDGF-B chain protein was localized using 
unmunohistochemistry and PDGF-B mRNA was localized using in situ hybridization 
The hypothesis that PDGF-B expression was related to cell proliferation was tested by 
simultaneously demonstrating the expression of the proliferating cell nuclear antigen in 
sequential tissue sections of the same tumors. Sixty and 82% of tumors had demonstrable 
PDGF-B mRNA and protein, respectively, with a strong correlation between their 
degrees of expression (P = 0.0001 ). Among the sarcomas, a strong correlation between 
PDGF-B expression and increasing malignant tumor grade (P = 0.006), and between 
PDGF-B expression and increasing proliferating cell nuclear antigen index (P = 0.01) 
was found. All tumors were also demonstrated to express the beta receptor of PDGF via 
unmunohistochemistry: These studies suggest that PDGF-B expression may be an 
important mediator of cell proliferation control, via an autocrine mechanism, in human 
soft tissue tumors and may correlate with clinical outcome in the sarcomas. 

PMiD: 790391 1 [PubMed - indexed for MEDLINE] 
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Expression of cadnerins and catenins in paired tumor and non neoplastic 
primary prostate cultures and corresponding prostatectomy specimens. 

Wan&J, Krill D. Torbenson M. Wang O: Bisceglia M Srnnorjr Thnn... A . 
DeFlavia P. Dhir R. Becich MJ . ' ' 

Department of Pathology, University of Pittsburgh Medical Center, PA, USA. 

Cadherins are a family of transmembrane proteins that play a crucial role in cell 
differentiation, cell migration, and intercellular adhesion. Cadherins are associated with 
catenins through their highly conserved cytoplasmic domain. Down-regulation of E- 
cadhenn protein has been shown in various human cancers. This study examined the 
expression of cadherins and associated catenins at the mRNA level. Paired rumor and 
nonneoplastic primary prostate cultures were obtained from surgical specimens 
Quantitative multiplex fluorescence reverse transcriptase-polymerase chain reaction 
(QMF RT-PCR) and quantitative analysis were performed and correlated with 
lmmunostain results. Six of seven cases of neoplastic cultures showed moderately-to- 
markedly decreased levels of E-cadherin and P-cadherin mRNA. Similar losses of alpha- 
catenin and beta-catemn mRNA were alsb observed. The results of QMF RT-PCR 
showed good correlation with the results of immunohistochemical studies based on 
corresponding formalin-fixed sections. In conclusion, this paper presents a coordinated 
down-regulation in the expression of E-cadherin and associated catenins at the mRNA 
and protein level in most of the cases studied. This down-regulation may play an 
important role in the pathogenesis of prostate cancer. 

PMID: 1 1 127708 [PubMed - indexed for MEDLINEJ 
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Abstract Cadhenns arc a family of transmembrane 
proteins that play a crucial roJe in cell differentiation 
cell migration, and intercellular adhesion. Cadherins are 
associated with catenins through their highly conserved 
cytoplasmic domain. Down-regulation of E-cadherin 
protein has been shown in various human cancers. This 
study examined the expression of cadherins and associ- 
ated catenins at the mRNA level. Paired tumor and non- 
neoplastic primary prostate cultures were obtained from 
surgical specimens. Quantitative multiplex fluorescence 
reverse transcriptase-polymerase chain reaction (QMF 
RT-PCR) and quantitative analysis were performed and 
correlated with imrounostain results. Six of seven cases 
of neoplastic cultures showed moderalely-to-markedly 
decreased levels of E-cadherin and P-cadherin mRNA. 
Similar losses of a-catenin and 0-catenin mRNA were 
also observed. The results of QMF RT-PCR showed 
good correlation with the results of immunohistochem- 
ical studies based on corresponding formalin-fixed sec- 
tions. In conclusion, this paper presents a coordinated 
down-regulation in the expression of E-cadherin and 
associated catenins at the mRNA and protein level in 
most of the cases studied. This down-regulation may 
play an important role in the pathogenesis of prostate 
cancer. 
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Prostate cancer is the most common malignant tumor 
and the second leading, cause of cancer death in men 
The clinical course of patients with prostate cancer 
varies widely, and different factors contribute to this 
marked clinical variability, including genetic back- 
ground, Jiormonal environment, and the invasive 
potential of the tumor. Invasion and metastasis are the 
hallmarks of malignancy and have been closely linked to 
alterations in cell-to-cell adhesion, cell migration, and 
interactions with extracellular matrix components [22J. 

. Cadherins are a family of transmembrane glycopro- 
teins responsible for maintaining the integrity of tissue 
and are involved in cell differentiation, cell migration, 
and intercellular adhesion through a calcium-dependent 
mechanism characterized by homotypic adhesion [35- 
37]. Their highly conserved cytoplasmic domains asso- 
ciate with catenins, a group of intracellular proteins that 
mediate contact between the cadherins and the micro- 
filaments of the cytoskeleton. Each cadherin subclass 
shows a unique tissue distribution: E-cadherin is pre- 
dominantly expressed in epithelial cells and P-cadherin is 
restricted to decidua tissue and the basal or lower layers 
of stratified epithelium |30]. 

The accumulating evidence suggests a decrease or loss 
of function in E-cadherin and P-cadherin in several 
human carcinomas (3, 9, 23, 32]. Loss of heterozygosity 
(LOH) at chromosome 16 in the location of the E-cad- 
herin gene is present in a high percentage of prostate 
ca ncers |6, 2 1 , 26, 28J. Decreased expression of E-cadherin 
is seen in various human malignant tumor cell lines, and 
the level of decrease correlates with the invasive potential 
of the tumor cell lines (1, 9, 12, 24, 40, 42]. In addition, 
many, but not all, immunohistochemical studies using 
formalin-fixed, paraffin-embedded tissue have shown that 
the E-cadherin protein is decreased in prostate cancer and 
the decrease is correlated with tumor grade |4, 8, 13, 27, 
38]. Other studies have shown decreased-to-absent 
P-cadherin levels, but variable E-cadherin levels [33], 
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a- and /J-catenins bind the cytoplasmic domain of 
E-cadberin and link it to the cytoskelelon [16, 3) J. Down- 
regulation of expression and deletion or a-catenin genes 
were identified in several human cancer cell lines [19, 24, 
42], Immunohistochemical studies showed decreased 
ctiatenin staining, which correlated well with the loss of 
E-cadherin staining and patient survival [25, 29, 39]. In 
addition, decreased ^-catenin protein expression appears 
to be associated with malignant transformation of epi- 
thelial tissue [34]. These results suggest that cadherins 
and catenins may function as tumor invasion-suppressor 
genes. However, most of these results were obtained m 
studies using transformed tumor cell lines and formalin- 
fixed, paraffin-embedded tissue. To our knowledge, 
no studies to date have examined the co-expression of 
cadherins and catenins at the mRNA level using material 
derived from surgical specimens. 

Recently, we have developed the methodology to 
cultivate primary epithelial cells under defined condi- 
tions from surgical prostatectomy specimens [20], Areas 
of both carcinoma and noh-neoplastic tissue are identi- 
fied grossly, verified histologically, and then cultured 
separately, resulting in paired primary cultures of both 
non-neoplastic and neoplastic epithelium from the same 
patient. The non-neoplastic tissue cultures serve as an 
important control of any person-to-person variability in 
the expression of the genes of interest. 

In this study, we investigated the co-expression 
of cadherin and catenin mRNA from multiple paired 
primary prostate cultures derived from surgical prosta- 
tectomy specimens using quantitative multiplex fluores- 
cence reverse transcriptase-polymerase chain reaction 
(QMF RT-PCR) [41, 44]. Immun ©histochemistry stud- 
ies of cadherins and catenins were performed on the 
corresponding paraffin-embedded prostate tissue. 



Materials and methods 

Tissue specimen and primary prostate cultures 

We studied patients with adenocarcinoma of the prostate who 
underwent prostatectomy at the University of Pittsburgh Medical 
Center during the lime interval January 1996-January 1998. No 
patient had received ireaLmcnt prior to surgery. Fresh prostatec- 
tomy specimens were sectioned and grossly examined, representa- 
tive samples of neoplastic and non-neoplastic tissue were collected, 
and the diagnoses were confirmed by examination of hematoxylin 
and eosin (H&E) -stained sections. Epithelial cells from tumor and 



non-neoplastic areas of the specimen «««<>J«ed ! ™ d f th £- 5^ 
tured with a chemically defined medimn (CDM) without addition 
of growth factors as previously described. Tissue fragments ^wcre 
cut into small pieces and underwent a series of co agenous diges- 
tions. Following each digestion, the cells were pelleted by centri- 
bgation-at 1,000 rpm for 4 M1N \20). To selectively promote 
epithelial cell growth, the pellets were resuspended and maintained 
in serum-free CDM supplemented with epithelial I growth factor 
(EGF) [201. The culture's morphology was examined dairy, and 
epithelial cells were allowed lo grpw until confluence was reached, 
between days 7 and 10. 

Isolation of total RNA and synthesis of cDNA 

Total RNA was extracted from the cultured primary prostate 
epithelial cells at first passage according to the Trizol solution 
(Gibco BRL, Rockville, MD) modified method of Chomczynsxi 
and Sacchi 17). The RNA was then quantified spectrophotomet- 
rically. Two micrograms of total RNA were used for first strand 
cDNA synthesis using oligo-dT primers and MMLV reverse tran- 
scriptase (Gibco BRL, Rockville, MD). 

PCR primers and quantitative multiplex fluorescence PCR 

PCR primers for hepatocyte growth factor (HGF) and c-myc 
were synthesized as previously described |I7}. PCR primers for 
E-cadherin, P-cadherin, a-catenin, 0-catenin, and jjJ-actin cDNA 
were designed according to cDNA sequences provided by the 
GeneBank (WWW2.ncbi.nlm.gov/geneb3nk/query). The P™*;" 
were selected to amplify 150-250 bp target genes and the PCR 
products from each target gene were designed to have a different 
size nable 1). The reverse primers were synthesized with fluores- 
cein molecules covalently attached to the 5' end (B^U Rockvdle. 
MD). Twenty-five microliters of PCR reactions for QMJ*-PL.K 
contained primers (20 uM each), cDNA corresponding lo 50 ng of 
total RNA, dNTPs, and reaction buffer. The reactions were am- 
plified for 21 cycles at 94 °C for I min, 57 »C for 2 mm; and 72 C 
for 2 min Five microliters of QMF-PCR reactions were mixed w,ib 
an equal volume of sequencing gel loading buffer, denatured, and 
aliquols were electrophoresed on an AB1 373A automated sequencer 
(Applied Biosyslcms, Foster City. CA) using a matrix specific for 
fluorescein (Fig. 1A). . 

Automated sequencer gels were run for 6 h at 30 W using 
Genescan software (ABJ, Foster City, CA). Ume assignments and 
areas of the peaks corresponding to fluorescent peaks were assigned 
and quantitated by the Genescan software using Photorouilipber 
tube (PMT) voltages (Fig. IB). AH experiments were done m 
triplicate and the results presented as means and standard devia- 
tions (SD). 

Antibodies and immunohislochemislry 

Five-micron sections were obtained from fonnalin-fixed, paraffin- 
embedded tissue blocks. They were deparaffimzed and to«»i<* 
with graduated clhanols. Slides were then microwaved in 1.1 M 



T»Me 1 Summary of PCR 
primer sequences and PCR 
products 



Primer 


Sequence 


Target gene 


Size of PCR product (bp) 


I 


Forward cccacaclgtgcccatctacg 


p*- Act in 


170 


2 


Reverse gettctccitaaigtcacgc 


E-cadherin 


190 


3 


Forward caaagtgggcacagatggtgig 


4 


Reverse cigctlggallccagaaacgg 


P-cadherin 


220 


5 


Forward gcaagagccagctcigtttagc 


6 


Reverse acttgagclgattcagtctclgg 


a-Catenin 


182 


7 


Forward gatggacaaclalgagccagg 


8 


Reverse lataccaggcgggaagcatcg 


0-Catenin 


218 


9 


Forward tlclggtgccaclaccacagc 


10 


Reverse tgcatgccctcalctaatgtc 
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Fig. I A Genescan image of 
QMFRT-PCR of five paired 
non- neoplastic and neoplastic 
primary prostate cultures. 
cDNA corresponding to 50 ng 
of total RNA was subjected to 
21 cycles of PCR, and all 
reactions were done in tripli- 
cate. RNA of both E-cadherin 
and P-cadherin showed moder- ' 
ate- to-rnar ked reduction in pri- 
mary neoplastic cultures in four 

. of five cases (Cases I, 2, 4, and 
5). B Representative automated 
sequencer traces (ABI 373A) 
Trom 21 cycles of QMF 
RT-PCR using cDNA from a 
paired non-neoplastic (top 
panel) and neoplastic {bottom 
panel) primary prostate cultures 
(Case 2), and copy number of 
PCR products (st-actin, 
E-cadherin, and P-cadherin) 

, was calculated using peak areas. 
Neoplastic culture showed 
markedly decreased expression 
of both E-cadherin and 
P-cadherin 
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curate buffer (pH 6.0) for antigen retrieval. The avidin-biotin 

V^?,,T elh0d /°'^ nn ^ SI ^ n, * l 'y w » »^"d from 
Vector Laboratories (Burhngamc. Calif., USA). The E-cadherin 

H^nfi , n ^ tCnin> 2nd ^ alenin antibodies, all mouse mono-' 
Evaluation of immunosiaining 

Ti C f T "P la « c °f Positive cells, intensity of the staining, and cel- 
lular loeahzalion of the staining were examined by two independent 
observers using normal prostate epithelium from the same speci- 
men as an internal control. The intensity of the signal-was graded 
as strong, moderate, weak, and negative. The staining pattern of 
the tumor was compared with that of normal epithelium from the 
same specimens. 



Result* 

Expression of E-cadherin and P-cadherin 

Since many prostate cancers are known lo be histolog- 
ically heterogeneous, adjacent H&E-stained sections 
ol the hssue fragments sent lo culture were reviewed 



lo assess tissue homogeneity and to rule out the presence 
of other diseases. The paired primary cultures for this 
study were selected based on the following histologic 
features: (1) non neoplastic tissue showed no cancerous 
foci or high-grade prostatic intraepithelial neoplasia 
(PIN); (2) neoplastic sections contained less than 5% 
non-cancerous epithelium. Seven out of 38 pairs of 
cultures met the selection criteria and were included in 
this study. Histologically, all seven cases were moder- 
ately differentia led adenocarcinomas, with Gleason 
scores ranging from 5 to 7 (median = 6). In six of these 
cases, adjacent sections of tumor contained no benign 
prostate epithelium, and in one case (specimen 4), there 
was less than 5% non-cancerous epithelium. Prostate 
stromal cells express HGF, whereas the epithelial cells 
express c-met, the receptor for HGF [17, 20]. The cases 
included in this study showed no detectable HGF 
expression after 21 cycles of QMF RT-PCR (data not 
shown). This indicates there was no significant stromal 
cell contamination in the current epithelial cultures. 

QMF-PCR is an accurate method or measuring the 
relative levels of mRNA in small tissue samples [41). In 



this study, we used this method to quantitate the mRNA 
levels of E-cadherin and P-cadherin, relative to 0-actin. 
All the RNA samples contained no DNA contamina- 
tion, as shown by the absence of automated signals when 
RT was omitted from the RT-PCR reactions. We ob- 
served the expected linear increases in 0-actin, E-cadh- 
erin, and P-cadherin signal intensities between cycle 
numbers 18 and 24, with a cDNA input corresponding 
to 50 ng of total RNA. The ratios of 0-actin to 
E-cadherin and P-cadherin were constant, as was the 
E-cadherin/P-cadherin ratio. These results indicated that 
the target genes were consistently amplified in the reac- 
tions. Twenty-one cycles of PCR were chosen for all 
subsequent experiments. In addition, the raw sequencer 
quantitation of peak areas for /J-actin using the same 
cDNA input was similar among the paired primary 
prostate cultures (data not shown). 

When compared with the paired non-neoplastic pri- 
mary cultures and normalized with the /J-actin internal 
controls, four of the neoplastic cultures showed marked 
(>85%) reduction of E-cadherin mRNA levels, with 
one case showing no detectable E-cadherin mRNA. The 
other three neoplastic cultures showed mild-to-moderate 
reductions (Table 2). 

Six of seven neoplastic cultures showed moderately 
lo markedly decreased P-cadherin mRNA levels when 
compared with non-neoplastic cultures. Interestingly, 



the four cases showing marked reduction in levels of 
E-cadherin mRNA, and also demonstrated significant 
losses of P-cadherin mRNA. The case with no detectable 
E-cadherin mRNA also demonstrated near total loss 
of P-cadherin mRNA (Case 1). Another case (Case 3), 
with only rmld reduction in E-cadherin mRNA, showed 
no significant change in P-cadherin mRNA. Genetic 
variations among the patients were evident in the 
marked differences in the baseline levels of E-cadherin 
and P-cadherin expression seen in the rion-neoplastic 
primary prostate cultures. 



Expression of cc-catenin and #-caleniri 

The highly conserved intracytoplasmic domain of 
the cadherins interacts with a- and /^catenins, with the 
catenins serving as a link between the cadherins and the 
microfilaments of the cytoskeleton. Six cases of neo- 
plastic cultures showed mild-to-moderate reductions in 
a-catenin mRNA levels, ranging from 26 to 62%. These 
cases also demonstrated more severe reductions of 
0-catenin mRNA levels and generally correlated with 
the changes of cadherins in the same specimen (Table 3). 
Interestingly, the case with no significant change of 
P-cadherin mRNA and only a mild loss of E-cadherin 
mRNA also showed no change in mRNA levels of both 



Table 2 E-cadherin and P-cadherin expression in paired non-neoplastic and neoplastic primary prosta.e cultures derived from prosla- 


tcctomy specimens 












Case 


E-cadherin* 




Percentage 
loss in lumor b 


P-cadherin* 




Percentage 
loss in tumor* 




Non-neoplastic 


Tumor 


Non-neoplastic 


Tumor 




1 

2 
3 
4 
5 
6 
7 


71.0 ± 2.2 

27.4 ± 6.4 
18.0 ± 0.2 

93.5 ± 12.0 
41.5 ± 3.0 

56.7 ± 1.1 

53.8 ± 6.1 


Nty 

3.7 ± 0.3 
12.6 ± 1.9 

5.4 ± 0.9 
24.4 i 4.0 

8.1 ± 1.0 
777 ± 0.7 


100 
86 
30 
94 
41 
86 
59 


65.7 ± 2.0 

53.4 ± 6.5 

88.8 ± 6.5 
87.8 ± 7.0 

30.5 ± 1.4 . 
71.8 ± 2.9 
62.7 ±3.5 


0.8 ± 0.1 
17.8 ± 1.3 

86.4 db 9.5 

25.5 ± 2.5 
18.1 ± 2.4 
17.1 ± 2.2 
19.4 ± 22 


99 
67 
3 
71 
41 
76 
69 


■ Level of E-cadherin and P-cadherin are normalised with /J-actin from the 
b Percentage loss in tumor primary culture: [{N - T)/N) x 100% 
C ND. not delectable 


same sample: (E-cadher 


in or P<adberin)/aclin x 100 


Table 3 ct-Calenin and 0-calenin expression in 
tectomy specimens . 


paired non-neoplastic and neoplastic primary prostate cultures derived from prosta- 


Case 


a-catenin* 




Percentage loss 


0-catcnin* 




Percentage 
loss in tumor 




Non-neoplastic 


Tumor 


in lumor b . 


Non-neoplastic 


Tumor 




1 

2 
3 
4 
5 
6 
7 


45.1 ± 7.1 

49.2 i 1.6 

48.0 ± 5.9 

61.1 i 2.5 

58.2 ± 18 
46.7 i 4.7 
43.2 ± 2.6 


17.3 i 0.5 
36.2 i 4.0 
47.8 ± 9.0 

• 40.8 ± 2.6 
27.8 ± 1.6 
33.6 ± 4.7 

26.4 dt 1.7 


62 
26 
< 1 
33 
52 
28 
39 


19.7 ± 3.6 

21.7 ± 1.5 
40.1 ± 5.3 

36.8 ± 18 
29.8 ± 2.0 
31.5 ± 4.2 
22.3 ± 1.5 


ND C 
3.9 ± 0.2 
38 9 ± 6.2 

22.1 ± 1.9 

15.2 ± I.I 

13.5 ± 16 

1 1.6 ± 1.4 


100 
82 
3 
40 
49 
57 
48 



* Level of a-catenin and /J-catenin are normalized with £-actin from the same sample: (calenin/aclin) X 100 
b Percentage loss in tumor primary culture: 1(N - T)/N] x 100% 
C ND, nol delectable 
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a- and /?-calenins. Moderate interspecimen variation was 
observed in the baseline expression of a-catenin and 
0-calenin mRNA levels in the non-neoplastic cultures. 

Immunohistochemical studies of cadherins and catenins 

In benign prostate tissue, E-cadherin was, in all cases, 
uniformly localized to the membranes of luminal glan* 
dular epithelial cells, predominantly at cell-cell junctions 
(Fig. 3A). One case of prostate cancer showed complete 
negative staining for E-cadherin (Fig. 3B), and the re- 
maining six cases demonstrated reduced immunoslaining 
for E-cadherin, with 25-75% of cancer cells positive 
(Fig. 3C). The cancerous glands generally showed re- 
duced signal intensity and an altered heterogeneous 
staining pattern, which included focal cytoplasmic stain- 
mg and reduced membranous staining (Fig. 3C, Table 4). 

Benign prostate tissue showed uniform basal cell 
immunoreactivity for P-cadherin, with principally cyto- 
plasmic and focal membranous pattern staining. The 

Table 4 Immunohistochemical expression of cadherins and catc- 
cu?m , re Pr05tatCCt0my SpCcimcM cor '"P<>nding to primary prostate 



benign, glandular non-basal epithelial cells and stromal 
cells were negative for P-cadherin (Fig. 3D)i P-cadherin 
immunoreactivity was completely absent in two cases of 
prostate cancer (Fig. 3E), and the remaining five cases 
showed variable focal positivity, which was predomi- 
nantly cytoplasmic (Fig. 3F). This focal P-cadherin 
immunoslaining positivity was confirmed by staining 
multiple sections and by using different monoclonal 
antibodies; In some cases, serial sections also appeared 
to show immunoslaining for both E- and P-cadherins 
with the same neoplastic cells. 

In all cases of benign prostate tissue, or- and 0-catenin 
protein expression showed strong homogeneous staining 
of the luminal glandular epithelium and the basal cells. 
In a paltern similar to that of normal Ercadhcrin, the a- 
and 0-catenins were localized predominately at luminal 
epithelial cell-cell borders (Fig. 2A, Q. In all cases of 
prostate cancer, there was a miJd-lo-moderate reduction 
in staining for both catenins with 50-75% of cells 
positive, and ihe staining tended to be heterogeneous 
(Fig. 2B, D; Table 4). . 



Discussion 



Case 



E-cadherin P-cadherin a-catenin jff-caiemn 



1 






+ + 


+ + 


2 




+ 


+ + 


+ + 


3 


+ + + 


+ + 


+ + + 


+ + + 


4 


-+ 




+ + 


+ + + 


5 


•f + -f 


+ 


+ + + 


+ -f + 


6 


+ + 


+ 


-f -f + 


+ + + 


7 


4 + 


+ 


+ + 


+ +'+■ 



, • .- 6II .. ^tjY* positive; -t + , 25-51 
50-75% positive; + + + +, >75% positive 



In this study, we observed a coordinated down-regula- 
tion of the expression in the genes involved in ihe 
cadherm and catenin mediated cell-cell pathway at the 
mRNA level. The protein levels, as demonstrated by 
the immunohistochemical studies on the corresponding 
tissue sections, were generally consistent with Ihe 
mRNA data as well as with that reported in the litera- 
ture [18, 33, 34, 38, 39], 

E-cadherin showed the most consistent loss of ex- 
pression at both the mRNA level and the protein level. 



Fig. 2A-D Immunohistochemi- 
cal staining of a- and /f-calenin 
in non-neoplastic prostate and 
•n prostate adenocarcinoma. 
Original magnification xl)5. 
A, C Normal membranous 
expression of a-calenin and 
l-caienin in non-neoplastic 
prostatic epithelium (Case I). 
B Decreased immunoslaining 
for a-catenin in prostate . 
adenocarcinoma (Case I). 
D Decreased and heterogenous 
immunoslaining for ^-catenin 
in prostate adenocarcinoma 
(Case 1) 
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Fig. 3 A lmmunohistochemica) 
staining of E- and P-cadbcrin in 
nonneoplastic prostate and in 
prostate adenocarcinoma. 
Original magnification xl I5i 
A Normal membranous ex- 
pression of E-cadherin in non- . 
neoplastic prostatic epithelium 
(Case I). B, C Complete 
negative (Case 1) and focal 
heterogeneous (Case 3) immu- 
nosiaihing for E-cadberin in 
prostate adenocarcinoma. 
D Normal continuous basal 
layer expression of P-cadherin 
in a portion of nonneoplastic 
prostatic epithelium (Case I). 
E, F Complete negative 
(Case I) and mild focal hetero- 
geneous (Case 3) immunostain- 
ing for P-cadherin in prostate 
adenocarcinoma 




LOH at chromosome I6q, where both E-cadherin and 
P-cadhcrin are located, occurs in up to 30% of prostate 
cancers. Four of our cases showed a greater than 50% 
reduction in mRNA, ranging from 87 to 100%, sug- 
gesting that mechanisms in addition to LOH may play a 
role in the reduction of E-cadherin mRNA levels below 
that oif the 50% predicted by LOH alone. 

Previous studies have suggested that P-cadherin 
could serve as a specific marker, for basal cell differen- 
tiation and was not expressed in prostate cancer, 
although a recent study has shown focal P-cadherin 
expression in some prostate tumors [18]. In this study, 
we demonstrated P-cadherin mRNA in six of seven 
neoplastic cultures, although it was significantly reduced 
in at) six cases. It is, impossible to rule out the possibility 
that some of the P-cadherin mRNA expression may 
have resulted from potential contamination by small 
numbers of non-neoplastic basal cells admixed with the 
neoplastic cells within the culture material. However, we 
favor the interpretation that low levels of P-cadherins 
are expressed in cultured tumor cells, as well as weakly 
in some tumors in vivo owing to disruption of normal 
gene regulation. This interpretation is favored by the 
presence of focal immunostaining for P-cadherin protein 
in histologically neoplastic cells in five of seven cases. 
The histologic selection criteria (requiring minimal to no 
benign prostate gjands) should also have minimized 
major contamination. Despite the presence of mRNA in 
primary tumor cultures and focal positive immuno- 



staining, P-cadherin immunostaining may still serve as a 
useful basal cell marker because the staining pattern was 
distinctly abnormal in the cases where it was focally 
present- 
In this study, the levels of a- and 0-catenin mRNA 
were also lower than normal in six of seven cases, 
though the reductions were relatively less than those of 
cadherins, especially for the ot-calenins. Catenin protein 
expression was generally moderate to weak by immu- 
nohistochemistry and showed a heterogeneous cyto- 
plasmic and weak membranous staining pattern. The 
immunoposilivity was generally similar in most cases, 
with 25-75% cells showing positivily. The results of the 
immunohislochemical studies were generally consistent 
with the mRNA data; although not in all cases (e.g., 
Case 1); perhaps due to tumor heterogeneity. 

In this study, we also observed a coordinated down- 
regulation of E-cadherin and the catenins in most cases; 
this was most observable at the mRNA level. The cad- 
herins are tightly regulated during embryogenesis and 
appear to serve the need for precise spatio-temporal 
regulation. The promoter sequences of both E-cadhenn 
and P-cadherin have been cloned and functionally ana- 
lyzed |2, 5, 10, 15]. Both promoters have similar regu- 
latory elements, such as GC-rich regions and CCA AT 
boxes. Although the two promoters share similar se- 
quences, in vitro binding studies suggest that the two 
promoters are regulated by different transcriptional 
factors |ll]. The tissue specificity of these promoters 
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appears to be derived by different combinations of a 
relatively few factors common to many types of tissues 
and does not appear to be derived from transcription 
factors specific for each type of tissue. In our study the 

lx?» . lf f 0f M P ressi °n of «- and 0-catenins at the 
mRNA level was correlated with reductions in the levels 
or E-cadherin expression. Although the exact mecha- 
nism of this coordinated down-regulation is not known 
he coordinated pattern supports the hypothesis that 
loss or alteration of some regulatory feelers occurs 
during prostate tumor progression. Possible mechanisms 
include transcriptional factor alterations or hyper- 
methylahon of the promoter region |14. 43]. Limited 
information is available on the transcriptional regula- 
tion of the catenins, but it is possible that similar 
mechanisms may play a role. 

In summary, this is the first comparative study of the 

^TT! H f ?° neS inV0lvcd in a* cadherin-medi- 
ated cell-eelfadhesion pathway at the mRNA level using 
paired neoplastic and non-neoplastic primary cultures 
derived from prostatectomy specimens. Our results 
indicate that (I) there is a marked patient-to-patient 
vMiabon in the^ normal levels of the cadherins and 
ea emns; (2) mRNA levels of E-cadherin as well as 
catenins are s.gn.ficanlly reduced in sorhe prostate can- 
cer primary cultures, and the reduction tends to be to the 
same degree in each tumor, suggesting a defect in a 
regulating mechanism common to all of these genes; (3) 
P-cadhenn appears to be present at both the mRNA 
level and the protein level in some prostate cancers. This 
coordinated down-regulation of E-cadherin and catenin- 
mediated adhesion pathways may play a crucial role in 
tumor pathogenesis and metastasis. 
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Down-regulation of prostate-specific antigen expression by finasteride 
through inhibition of complex formation between androgen receptor and 
steroid receptor-binding consensus in the promoter of the PSA gene iu 
LNCaP cells. 

Wang LG , Lju_XM , KreisJVV, Budnian DR . 

Department of Medicine, New York University, Manhasset 11030, USA. 

As a specific competitive inhibitor of 5alpha-reductase, an intracellular enzyme that 
converts testosterone to dihydrotestosterone, finasteride is being extensively used for the 
treatment of benign prostatic hyperplasia and in experimental settings for prostate cancer. 
In this study, we showed that finasteride markedly inhibited prostate-specific antigen 
(PSA) secretion and expression. The promoter of the PSA gene contains several well- 
known cis-regulatory elements. Among them, steroid receptor-binding consensus (SRBC) 
has been identified as a functional androgen-responsive element. Our previous study 
showed that PSA was not only present in conditioned medium of the PSA-positive 
LNCaP cells but was also detectable in small amounts in PSA-negative cell lines, PC-3 
and DU-145 (L. G. Wang et al., Oncol Rep., 3: 91 1-917, 1996). A strong correlation ' 
between binding of nuclear factors to SRBC and the level of PSA present in the 
conditioned medium and cell extracts was found in these three cell lines, whereas no such 
correlation with binding was obtained using Spl oligonucleotide as a probe. Binding of 
LNCaP cell nuclear proteins to SRBC was diminished when the cells were exposed to 25 
microM finasteride, at which concentration 50% of both PSA iriRNA and protein were 
inhibited. As a major component of DNA-protein complexes, the level of androgen 
receptor was dramatically decreased in the cells treated with finasteride. Our data indicate 
that inhibition of complex formation between SRBC and nuclear proteins due to the 
remarkable decrease in the level of androgen receptor plays a key role in the down- 
regulation of PS A gene expression by finasteride in LNCaP cells. 



PMID: 9044850 [PubMed - indexed for MEDLINE] 
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Expressioa of calcycILn iu human melanocyte lesious. 
Weterman MA. van Muijen GN . Blocmcrs HP . Rutter DJ . 

Department of Biochemistry, University of Nijmegen, The Netherlands. 

When comparing two subsequent stages of melanocyte tumor progression we identified 
■ calcyclin as a new potential progression marker, the expression of which was correlated 
with metastatic behavior of various human melanoma cell lines in nude mice. In this 
study, we describe a good correlation between RNA and protein levels in the xenografts 
of these cell lines and extended these experiments to a panel of 120 routinely processed 
human melanocyte cutaneous lesions. Northern blot analysis demonstrated that calcyclin 
RNA expression was elevated in melanoma metastases as compared to several types of 
nevocellular nevi. Calcyclin staining using a specific polyclonal antiserum showed a 
more complex pattern. A stronger staining in a higher percentage of positive cells was 
observed in thick primary melanoma (> or = 1 .5 mm) as compared to thin primary 
melanoma (< 1 .5 mm). Calcyclin expression was also present in a higher percentage of 
cells showing a stronger staining in melanomas with higher Clark levels (> If) 
corresponding to the vertical growth phase of primary melanomas. Protein expression in 
nevocellular nevi was confined to the dermal part and was highest in the lower parts of 
the dermis. Remarkably, dysplastic nevi (atypical moles), potential precursors of 
melanoma, did not show any expression at all, either in junctional or dermal parts. 
Confinement of the expression to the dermal part of nondysplastic nevi and primary 
melanomas may reflect interactions with the microenvironment of the reticular dermis 
that occurs with vertical growth! 

PMID: 8261423 [PubMed - indexed for MEDLINE] 
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Estrogen regulation of the cytochrome P450 3 A subfamily in humans. 

Williams ET, Lcyk ML Wrightoa SA. Davies PJ . Loose PS , Shipley GL, Strobel HW . 

Department of Biochemistry, Medical School, University of Texas Health Science Center 
at Houston, 643 1 Fannin, MSB 6.200, Houston, TX 77030, USA. 

This study examines the possible role of estrogen in regulating the expression of the 
human CYP3A subfamily: CYP3A4, CYP3A5, CYP3A7, and CYP3A43. To accomplish 
this goal, mRNA was quantified from human livers and endometrial samples, and total 
CYP3A protein levels were evaluated by Western immunoblot analysis of the liver 
samples. The human endometrial samples were from premenopausal and postmenopausal 
women. The premenopausal endometrium was either in the proliferative or secretory 
: phase, whereas for the postmenopausal endometrium samples, the women had been 
treated with either a placebo or estropipate, an estrogen substitute. After analyses, 
CYP3A4 mRNA was shown to have lower hepatic expression in females thain in males. 
In the endometrium, CYP3A4 and CYP3 A43 are down-regulated by estrogen, whereas 
CYP3 A5 is expressed at higher levels during the secretory phase. CYP3A7 was not 
detected in the endometrium. In addition, the CYP3A subfamily showed increased 
mRNA expression in the liver as age increased. The expression levels of total CYP3 A 
protein and total CYP3A mRNA showed good correlation. Despite apparent regulation of 
CYP3A4 mRNA expression by; estrogen, the effects of estrogen may be overshadowed 
by additional regulators of gene expression. 
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Severely decreased MARCKS expression correlates with ras reversion but 
not witlr autogenic responsiveness. 

Wojtaszck PA, Stumpo DJ Blackshear PJ Macara lG . 

Department of Pathology, University of Vermont College of Medicine, Burlington 

Phorbol ester-inducible phosphorylation of MARCKS, the '80-kDa' substrate of protein 
kinase C, was undetectable in several phenotypically dominant, non-transformed 
revertants independently derived from the ras-transformed cell line NIH3T3 DT-ras 
Extremely low expression of MARCKS protein accounted for this apparent lack of ' 
phosphorylation. MARCKS-encoding mRNA levels were correspondingly decreased 
relative to normal and ras-transformed cells in all four ras revertant cell lines studied- C- 
1 1 and b-2, denved by 5-azacytidine treatment and selection with ouabain- CHP 9CJ 
denvedby ethylmethane sulfonate mutagenesis and selection with cis-hydroxy-L-proline- 

^c e 1 Ved by tr c mSkcti ° n wifll the human K^" 1 e^ne. However, re-expression' 
ii transaction of a cloned MARCKS cDNA into the C-l 1 ras revertant 

cells was not sufficient to induce retransformation. In fact, no significant difference in 
sensitivity to mitogenic stimulation by phorbol esters was observed among several cell 
lines expressing W1 dely varying levels of MARCKS. This evidence argues against a 
direct role for MARCKS in mitogenic signaling. However, the strong correlation between 
attenuation of MARCIS expression and phenotypically dominant ras reversion suggests 
that a common negative regulatory mechanism might be responsible for both effects 
presenting a potentially useful strategy for identifying factors involved in transducing the 
ras signal. b 

PMID: 8437859 [PubMed - indexed for MEDLINE] 
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[Expression of human telomerase reverse transcriptase in cervix cancer 
and its significance] 

[Article in Chinese) 

XLL, ZhuT, Wu P . Xu Huang L LjJKZ, Lu YP , Ma D . 

Department of Obstetrics and Gynecology, Tongji Hospital, Tongji Medical College, 
Huazhong University of Science and Teclinology, Wuhan 430030, China. 

OBJECTIVE: To investigate the expression of humantelomerase reverse transcriptase 
(hTERT) mRNA and protein in cervix cancer, cervical intraepithelial neoplasia (C1N) 
and normal cervix. METHODS: Expression of hTERT mRNA and the other two subunits 
of telomerase, human telomerase RNA component (hTR), human telomerase-associated 
protein (hTP 1) was determined by RT-PCR in 3 cervix cancer cell lines, 2 diploid cell 
lines, 38 cases of cervix cancer, 16 cases of CIN and 20 cases of normal cervix. 
Telomerase activity was also examined by telomeric repeat amplification protocol 
■ enzyme-linked immunosorbent assay (TRAP-ELISA). Expression of hTERT protein was 
detected in all the cell lines and 101 cases of paraffinized cervix tissue sections. 
RESULTS: hTERT mRNA expression was detected in all of the three cervix cancer cell 
lines, 81.6% of cervix cancer, 37.5% of CIN, 5.0% of normal cervix, while in neither of 
the two diploid cell lines. The other two subunits of telomerase were prevalently 
expressed in all of the cell lines and most cervix tissues. There was a strong correlation 
between hTERT mRNA expression and telomerase activity. Irnmunostaining also 
revealed that hTERT protein was expressed in all three cervix cancer cell lines, 65.5% of 
cervix cancer, 28.0% of CIN and 4.8% of normal cervix. CONCLUSION: Up-regu!ation 
of hTERT may play an important role in the development of CIN and cervix cancer, 
hTERT could be used as an early diagnostic biomarker for cervix cancer. 
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Abstract 



Background 



Prostate stem cell antigen (PSCA) is a recently defined hornologue of the 
Thy-l/Ly-6 family of glycosylphosphatidylinositol (GPl)-anchored cell 
surface antigens. The purpose of the present study was to examine the 
expression status of PSCA protein and mRNA in clinical specimens of human 
prostate cancer (Pea) and to validate it as a potential molecular target for 
diagnosis and treatment of Pea. 



Materials and Methods 

Jmmunohistochemical (IMC) and in situ hybridization (1SH) analyses of PSCA 
expression were simultaneously performed on paraffin-embedded sections 
from 20 benign prostatic hyperplasia (BPH), 20 prostatic intraepithelial 
neoplasm (PIN) and 48 prostate cancer (Pea) tissues, including 9 androgen- 
independent prostate cancers. The level of PSCA expression was 
semiquantitative^ scored by assessing both the percentage and intensity of 
PSCA-positive staining cells in the specimens. Then compared PSCA 
expression between BPH, PJN and Pea tissues and analysed the correlations 



ht t p:// www . wjso . c om/c on t en 1/2/ 1/13 



5/25/2006 



World Journal of Surgical Om ,y | Full text | Prostate stem cell antigen (PS expression in ... Page 2 of 12 

of PSCA expression level with pathological grade, clinical stage and 
progression to androgen-independence in Pea. 



Results 

In BPH and low grade PIN, PSCA protein and mRNA staining were weak or 
negative and less intense and uniform than that seen in HGPIN and- Pea. 
There were moderate to strong PSCA protein and mRNA expression in 8 of 
11 (72.7%) HGPIN and in 40 of 48 (83.4%) Pea specimens examined by 
IHC and ISH analyses, with statistical significance compared with BPH 
(20%) and low grade PIN (22.2%) samples (p < 0.05, respectively). The 
expression level of PSCA increased with high Gleason grade, advanced stage 
and progression to androgen-independence (p < 0.05, respectively). In 
addition, IHC and ISH staining showed a high degree of correlation between 
PSCA protein and mRNA overexpression. 

Conclusions 

Our data demonstrate that PSCA as a new cell surface marker is 
overexpressed by a majority of human Pea. PSCA expression correlates 
positively with adverse tumor characteristics, such as increasing 
pathological grade (poor cell differentiation), worsening clinical stage and 
androgen-independence, and speculatively with prostate carcinogenesis. 
PSCA protein overexpression results from upregulated transcription of PSCA 
mRNA. PSCA may have prognostic utility and may be a promising molecular 
target for diagnosis and treatment of Pea. 



Outline Introduction 



Results 
iino interests 



Abstract Prostate cancer (Pea) is the second leading cause of cancer-related death in 
niroduaion American men and is becoming a common cancer increasing in China. 
-,d tr.r.-.fh.^ . Des P ite recently great progress in the diagnosis and management of 

localized disease, there continues to be a need for new diagnostic markers 
that can accurately discriminate between indolent and aggressive variants of 
Pea. There also continues to.be a need for the identification and 
characterization of potential new therapeutic targets on Pea cells. Current 
References diagnostic and therapeutic modalities for recurrent and metastatic Pea have 
been limited by a lack of specific target antigens of Pea. 

Although a number of prostate-specific genes have been identified (i.e. 
prostate specific antigen, prostatic acid phosphatase, glandular kallikrein 2), 
the majority of these are secreted proteins not ideally suited for many 
immunological strategies. So, the identification of new cell surface antigens 
is critical to the development of new diagnostic and therapeutic approaches 
to the management of Pea. 

Rerter RE et al {;!] repotted the identification of prostate stem cell antigen 
(PSCA), a cell surface antigen that is predominantly prostate specific. The 
PSCA gene encodes a 123 amino acid glycoprotein, with. 30% homology to 
stem cell antigen 2 (Sea 2). Like Sca-2, PSCA also belongs to a member of 
the Thy- j/Ly-6 family and is anchored by a glycosylphosphatidylinositol 
(GPJ) linkage. mRNA in situ hybridization (ISH) localized PSCA expression in 
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normal prostate to the basal cell epithelium, the putative stem cell 
compartment of prostatic epithelium, suggesting that PSCA may be a 
marker of prostate stem/progenitor cells. 

In order to examine the status of PSCA protein and mRNA expression in 
human Pea and validate it as a potential diagnostic and therapeutic target ■ 
for Pea, we used immunohistochemistry (IHC) and in situ hybridization 
(ISH) simultaneously, and conducted PSCA protein and mRNA expression 
analyses in paraffin-embedded tissue specimens of benign prostatic 
hyperplasia (BPH, n = 20), prostate intraepithelial neoplasm (PIN, n = 20) 
and prostate cancer (Pea, n = 48). Furthermore, we evaluated the possible 
correlation of PSCA expression level with Pea tumorigenesis, grade, stage 
and progression to androgen-independence. 
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Tables 



table i 

Correlation of PSCA 
expression with Gleason 
score 



Table 2 

Correlation of PSCA 
expression with clinical 
stage 



Tissue samples 

All of the clinical tissue specimens studied herein were obtained from 80 
patients of 57-84 years old by prostatectomy, transurethral resection of 
prostate (TURP) or biopsies. The patients were classified as 20 cases of BPH, 
20 eases of PIN, 40 cases of primary Pea, including 9 patients with recurrent 
Pea and a history of androgen ablation therapy (orchiectomy and/or 
hormonal therapy), who were referred to as androgen-independent prostate 
cancers. Eight specimens were harvested from these androgen-independent 
Pea patients prior to androgen ablation treatment. Each tissue sample was 
cut into two parts, one was fixed in 10% formalin for 1HC and the other 
treated with 4% paraformaldehyde/0.3 M PBS PH 7 A in 0.3% DEPC for 3 h 
for ISH analysis, and then embedded in paraffin. All paraffin blocks 
examined were then cut into 5 nm sections and mounted on the glass slides 
specific for 1HC and ISH respectively in the usual fashion. H&E-stained 
section of each Pea was evaluated and assigned a Gleason score by the 
experienced urological pathologist at our institution based on the criteria of 
Gleason score (2]. The Gleason sums are summar ized in Table :! . Clinical 
staging was performed according to Jewett-whitmore-prout staging system, 
as shown in Table 2. In the category of PIN, we graded the specimens into 
two groups, i.e. low grade PIN (grade J - II) and high grade PIN (HGPIN, 
grade III) on the basis of literatures [3,4]. 

Immunohistochemical (IHC) analysis 

Briefly, tissue sections were deparafftnized, dehydrated, and subjected to 
microwaving in 10 mmol/L citrate buffer, PH 6.0 (Boshide, Wuhan, China) in 
a 900 W oven for 5 min to induce epitope retrieval. Slides were allowed to 
cool at room temperature /or 30 min. A primary mouse antibody specific to 
human PSCA (Boshide, Wuhan, China) with a 1:100 dilution was applied to 
incubate with the slides at room temperature for 2 h. Labeling was detected 
by sequentially adding biotinylated secondary antibodies and strepavidin- 
peroxidase, and localized using 3,3 r -diaminobenzidine reaction. Sections 
were then counterstained with hematoxylin. Substitut ion of the primary 
antibody with phosphate- buffered-saline (PBS) served as a negative- 
staining control. 
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mRNA in situ hybridization (ISH) 

Five-pm-thick tissue sections were deparaffinized and dehydrated, then 
digested in pepsin solution (4 mg/ml in 3% citric acid) for 20 min at 37.5°C, 
and further processed for ISH. Digoxigenin-labeled sense and antisense 
human PSCA RNA probes (obtained from Boshide, Wuhan, China) were 
hybridized to the sections at 48°C overnight. The posthybridization wash 
with a high stringency was. performed sequentially at 37°C in 2 x standard 
saline citrate (SSC). for 10 min, in 0.5 x 5SC for 15 min and in 0.2 x SSC 
for 30 min. The slides were then incubated to biotinylated mouse anti- 
digoxigenin antibody at 37-5°C for 1 h followed by washing in 1 x PBS for 
20 min at room temperature, and then to strepavidin-peroxidase at 37.5°C 
for 20 min followed by washing in 1 x PBS for 15 min at room temperature. 
Subsequently, the slides were developed with diaminobenzidine and then 
counterstained with hematoxylin to localize the hybridization signals. 
Sections hybridized with the sense control probes routinely did not show any. 
specific hybridization signal above background. All slides were hybridized 
with PBS to substitute for the probes as a negative control. 

Scoring methods 

To determine the correlation between the results of PSCA immunostaining 
and mRNA in situ hybridization, the same scoring manners are taken in the 
present study for PSCA protein staining by IHC and PSCA mRNA staining by 
]SH. Each slide was read and scored by two independently experienced 
urological pathologists using Olympus BX-41 light microscopes. The 
evaluation was done in a blinded fashion. For each section, five areas of 
similar grade were analyzed semiquantitative^ for the fraction of cells 
staining. Fifty percent of specimens were randomly chosen and rescored to 
determine the degree of interobserver and intraobserver concordance. There 
was greater than 95% intra- and interobserver agreement. 

The intensity of PSCA expression evaluated microscopically was graded on a 
scale of 0 to 3+ with 3 being the highest expression observed (0, no 
staining; 1 + , mildly intense; 2-I-, moderately intense; 3+, severely intense). 
The staining density was quantified as the percentage of cells staining 
positive for PSCA with the primary antibody or hybridization probe, as 
follows': 0 = no staining; 1 = positive staining in <25% of the sample; 2 = 
positive staining in 25%-50% of the sample; 3 = positive staining in >50% 
of the sample. Intensity score (0 to 3+) was multiplied by the density score 
(0-3) to give an overall score of 0-9 [J ,5]. In this way, we were able to 
differentiate specimens that may have had focal areas of increased staining 
from those that had diffuse areas of increased staining [6]. The overall score 
for each specimen was then categorically assigned to one of the following 
groups:. 0 score, negative expression; 1-2 scores, weak expression; 3-6 
scores, moderate expression; 9 score, strong expression. 

Statistical analysis 

Intensity and density of PSCA protein and mRNA expression in BPH, PIN and 
Pea tissues were compared using the Chi-square and Student's Mest. 
Univariate associations between PSCA expression and Gleason score, clinical 
stage and progression to androgen- independence were calculated using 
Fisher's Exact Test. For all analyses, p < 0.05 was considered statistically 
significant. 
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Figures 



PSCA expression in BPH 

In general, PSCA protein and mRNA were expressed weakly in individual 
samples of BPH. Some areas of prostate expressed weak levels (composite 
score 1-2), whereas other areas were completely negative (composite score 
0). Four cases (20%) of BPH had moderate expression of PSCA protein and 
mRNA (composite score 4-6) by IHC and ISH. In 2/20 (10%) BPH 
specimens, PSCA mRNA expression was moderate (composite score 3-6), 
but PSCA protein expression was weak (composite score 2) in one and 
negative (composite score 0) in the other. PSCA expression was localized to 
the basal and secretory epithelial cells, and prostatic stroma was almost 
negative staining for PSCA protein and mRNA in all cases examined. 



PSCA expression in PIN 



Representatives of PSCA 
IHC and ISH staining in Pea 
(A. IHC staining, B. ISH 
staining, x200 
magnification) 



In this study, we detected weak or negative expression of PSCA protein and 
mRNA (^2 scores) in 7 of 9 (77.8%) low grade PIN and in 2 of II (18.2%) 
HGPIN, and moderate expression (3-6 scores) in the rest 2 low grade PIN 
and 5 of 11 (45.5%) HGPIN. One HGPIN with moderate PSCA mRNA 
expression (6 score) was found weak staining for PSCA protein (2 score) by 
IHC. Strong PSCA protein and mRNA expression (9 score) were detected in 
the remaining 3 of 11 (27.3%) HGPIN. There was a statistically significant 
difference of PSCA protein and mRNA expression levels observed between 
HGPIN and BPH (p < 0.05), but no statistical difference reached between 
low grade PIN and BPH (p > 0.05). 



PSCA expression in Pea 



In order to determine if PSCA protein and mRNA can be detected in prostate 
cancers and if PSCA expression levels are increased in malignant compared 
with benign glands, Forty-eight paraffin-embedded Pea specimens were 
analysed by IHC and ISH. It was shown that 19 of 48 (39.6%) Pea samples 
stained very strongly for PSCA protein and mRNA with a score of 9 and 
another 21 (43.8%) specimens displayed moderate staining with scores of 
4-6 (Figure 1). In addition, 4 specimens with moderate to strong PSCA 
mRNA expression (scores of 4-9) had weak protein staining (a score of 2) 
by IHC analyses. Overall, Pea expressed a significantly higher level of PSCA 
protein and mRNA than any other specimen category in this study (p < 
0.05, compared with BPH and PIN respectively). The result demonstrates 
that PSCA protein and mRNA are overexpressed by a majority of human 
Pea. 



Correlation of PSCA expression with Gleason score in Pea 

Using the semiquantitative scoring method as described in Materials and 
Methods, we compared the expression level of PSCA protein and mRNA with 
Gleason grade of Pea, as shown in Table .1. Prostate adenocarcinomas were 
graded by Gleason score as 2-4 scores = well-differentiation, 5-7 scores = 
moderate-differentiation and 8-10 scores = poor-differentiation [/"]. 
Seventy- two percent of Gleason scores 8-10 prostate cancers had very 



lit t p : //www . wj s o. c om/c on t c n 1/2/ 1 / 1 3 



5/25/2006 



■ World Journal of Surgical One y | Full text | Prostate stem cell antigen (PS expression in ... Page 6 of 12 



strong staining of PSCA compared to 21% with Gleason scores 5-7 and 17% 
with 2-4 respectively, demonstrating that poorly differentiated Pea had 
significantly stronger expression of PSCA protein and mRNA than 
moderately and well differentiated tumors (p < 0.05). As depicted in Figure 
1, IHC and ISH analyses showed that PSCA protein and mRNA expression in 
several cases of poorly differentiated Pea were particularly prominent, with 
more intense and uniform staining. The results indicate that PSCA 
expression increases significantly with higher tumor grade in human Pea. 

Correlation of PSCA expression with clinical stage in Pea 

With regards to PSCA expression in every stage of Pea, we showed the 
results in Table 2. Seventy-five percent of locally advanced and node 
positive cancers (i.e. C-D stages) expressed statistically high levels of PSCA 
versus 32.5% that were organ confined (i.e. A-B stages) (p < 0.05). The 
data demonstrate that PSCA expression increases significantly with . 
advanced tumor stage in human Pea. 

Correlation of PSCA expression with androgen- 
independent progression of Pea 

AM 9 specimens of androgen-independent prostate cancers stained positive 
for PSCA protein and mRNA. Eight specimens were obtained from patients 
managed prior to androgen ablation therapy. Seven of eight (87.5%) of 
these androgen-independent prostate cancers were in the strongest staining 
category (score = 9), compared with three out of eight (37.5%) of patients 
with androgen-dependent cancers (p < 0.05). The results demonstrate that 
PSCA expression increases significantly with progression to androgen- 
independence of human Pea. 

It is evident from the results above that within a majority of human prostate 
cancers the level of PSCA protein and mRNA expression correlates 
significantly with increasing grade, worsening stage and progression to 
androgen-independence. 

Correlation of PSCA immunostaining and mRNA in situ 
hybridization 

Jn all 38 specimens surveyed herein, we compared the results of PSCA IHC 
staining with mRNA ISH analysis. Positive staining areas and its intensity 
and density scores evaluated by IHC were identical to those seen by ISH in 
79 of 88 (89.8%) specimens (18/20 BPH, J 9/20 PIN and 42/48 Pea 
respectively). Importantly, 27/27 samples with PSCA rnRNA composite 
scores of 0-2, 32/36 samples with scores of 3-6 and 22/24 samples with a 
score of 9 also had PSCA protein expression scores of 0-2, 3-6 and 9 
respectively. However, in 5 samples with PSCA mRNA overall scores of 3-6 
and in 2 with scores of 9 there were less or negative PSCA protein 
expression (i.e. scores of 0-4), suggesting that this may reflect 
posttranscriptional modification of PSCA or that the epitopes recognized by 
PSCA mAb may be obscured in some cancers. The data demonstrate that 
the results of PSCA immunostaining were consistent with those of mRNA 
ISH analysis, showing a high degree of correlation between PSCA protein 
and rnRNA expression. 
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Outline Discussion 



PSCA is homologous to a group of ceil surface proteins, that mark the 
earliest phase of hematopoietic development. PSCA mRNA expression is 
prostate-specific in normal male tissues and is highly up-regulated in both 
androgen-dependent and-independent Pea xenografts (LAPC-4 tumors). We 
hypothesize that PSCA may play a role in Pea tumorigenesis and 
progression, and may serve as a target for Pea diagnosis and treatment. In 
this study, IHC and ISH showed that in general there were weak or absent 
PSCA protein and mRNA expression in BPH.and low grade PIN tissues. 
However, PSCA protein and mRNA are widely expressed in HGPIN, the 
putative precursor of invasive Pea, suggesting that up-regulation of PSCA is 
an early event in prostate carcinogenesis. Recently, Reiter RE et al [1], 
using ISH analysis, reported that 97 of 118 (82%) HGPIN specimens stained 
strongly positive for PSCA mRNA. A very similar finding was seen on mouse 
PSCA (mPSCA) expression in mouse HGPIN tissues by Tran C. P et al [S]. 
These data suggest that PSCA may be a new marker associated with 
transformation of prostate cells and tumorigenesis. 

We observed that PSCA protein and mRNA are highly expressed in a large 
percentage of human prostate cancers, including advanced, poorly 
differentiated, androgen-independent and metastatic cases. Fluorescence- 
activated cell sorting and confocal/ immunofluorescent studies 
demonstrated cell surface expression of PSCA protein in Pea cells [9]. Our 
IHC expression analysis of PSCA shows not only cell surface but also 
apparent cytoplasmic staining of PSCA protein in Pea specimens (Figure I). 
One possible explanation for this is that anti-PSCA antibody can recognize 
PSCA peptide precursors that reside in the cytoplasm. Also, it is possible 
that the positive staining that appears in the cytoplasm is actually from the 
overlying cell membrane [5]. These data seem to indicate that PSCA is a 
novel cell surface marker for human Pea. 

Our results show that elevated level of PSCA expression correlates with high 
grade (i.e. poor differentiation), increased tumor stage and progression to 
androgen-independence of Pea. These findings support the original IHC 
analyses by Gu 2 et al [9], who reported that PSCA protein expressed in 
94% of primary Pea and the intensity of PSCA protein expression increased 
with tumor grade, stage and progression to androgen-independence. Our 
results also collaborate the recent work of Han KR et al [30], in which the 
significant. association between high PSCA expression and adverse 
prognostic features such as high Gleason score, seminal vesicle invasion and 
capsular involvement in Pea was found. It is suggested that PSCA 
overexpression may be an adverse predictor for recurrence, clinical 
progression or survival of Pea. Hara H et al [1 7] used RT-PCR detection of 
PSA, PS MA and PSCA in J ml of peripheral blood to evaluate Pea patients 
with poor prognosis. The results showed that among 58 PCa patients, each 
PCR indicated the prognostic value in the hierarchy of PSCA>PSA>PSMA RT- 
PCR, and extraprostattc cases with positive PSCA PCR indicated lower 
disease-progression-free survival than those with negative PSCA PCR, 
demonstrating that PSCA can be used as a prognostic factor. Dubey P et al 
[3 2] reported that elevated numbers of PSCA + cells correlate positively 
with the onset and development of prostate carcinoma over a lonq time 
span in the prostates of the TRAMP and PTEN +/- models compared with its 



Abstract 
Introduction 
Materials and methods 
Results 
Discussion' 
Competing interests 
References 



littp://\vww.\vjS(>.c*orn/contnit/2/I/I3 



5/25/2006 



World Journal of Surgical Oni ,y | Full text | Prostate stem cell antigen (PS /expression in ... Page 8 ofl 2 



normal prostates. Taken together with our present findings, in which PSCA 
is overex pressed from HGPIN to almost frank carcinoma, it is reasonable 
and possible to use increased PSCA expression level or increased numbers 
of PSCA-positive cells in the prostate samples as a prognostic marker to 
predict the potential onset of this cancer. These data raise the possibility 
that PSCA may have diagnostic utility or clinical prognostic value in human 
Pea. 

The cause of PSCA overexpression in Pea is not known. One possible 
mechanism is that it may result from PSCA gene amplification. In humans, 
PSCA is located on chromosome 8q24. 2 [1], which is often amplified in 
metastatic and recurrent Pea and considered to indicate a poor prognosis 
[13-15]. Interestingly, PSCA is in close proximity to the c-myc oncogene, 
which is amplified in >20% of recurrent and metastatic prostate cancers 
[16,17]. Reiter RE et al [18] reported that PSCA and MYC gene copy 
numbers were co-amplified in 25% of tumors (five out of twenty), 
demonstrating that PSCA overexpression is associated with PSCA and MYC 
coampJification in Pea. Gu Z et al [9] recently reporteted that in 102 
specimens available to compare the results of PSCA immunostaining with 
their previous mRNA ISH analysis, 92 (90.2%) had identically positive areas 
of PSCA protein and rnRNA expression. Taken together with our findings, in 
which we detected moderate to strong expression of PSCA protein and 
mRNA in 34 of 40 (85%) Pea specimens examined simultaneously by.lHC 
and ISH analyses, it is demonstrated that PSCA protein and mRNA 
overexpressed in human Pea, and that the increased protein level of PSCA 
was resulted from the upregulated transcription of its mRNA. 

At present, the regulation mechanisms of human PSCA expression and its 
biological function are yet to be elucidated. PSCA expression may be 
regulated by multiple factors [13]. Watabe T et al [li 1 ] reported that 
transcriptional control is a major component regulating PSCA expression 
levels. In addition, induction of PSCA expression may be regulated or 
mediated through cell-cell contact and protein kinase C (PKC) [20]. 
Homologues of PSCA have diverse activities, and have themselves been 
involved in carcinogenesis. Signalling through SCA-2 has been demonstrated 
to prevent apoptosis in immature thymocytes [2 J ]. Thy-1 is involved in T 
cell activation and transducts signals through src-like tyrosine kinases [22]. 
Ly-6 genes have been implicated both in tumorigenesis and in cell-cell 
adhesion |2.": -25]. Cell-cell or cell-matrix interaction is critical for local tumor 
growth and spread to distal sites. From its restricted expression in basal 
cells of normal prostate and its homology to SCA-2, PSCA may play a role in 
stem/progenitor cell function, such as self-renewal (i.e. anti-apoptosis) 
and/or proliferation [)]. Taken together with the results in the present 
study, we speculate that PSCA may play a role in tumorigenesis and clinical 
progression of Pea through affecting cell transformation and proliferation. 
From our results, it is also suggested that PSCA as a new cell surface 
antigen may have a number of potential uses in the diagnosis, therapy and 
clinical prognosis of human Pea. PSCA overexpression in prostate biopsies 
could be used to identify patients at high risk to develop recurrent or 
metastatic disease, and to discriminate cancers from normal glands in 
prostatectomy samples. Similarly, the detection of PSCA-overexpressing 
cells in bone marrow or peripheral blood may identify and predict metastatic 
progression better than current assays, which identify only PSA- positive or 
PSMA- positive prostate cells. 
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In summary, we have shown in this study that PSCA protein and mRNA are 
maintained in expression from HGPIN through all stages of Pea in a majority 
of cases, which may be associated with prostate carcinogenesis and 
correlate positively with high tumor grade (poor cell differentiation), 
advanced stage and androgen-independent progression. PSCA protein 
overexpression is due to the upregulation of its mRNA transcription. The 
results suggest that PSCA may be a promising molecular marker for the 
clinical prognosis of human Pea and a valuable target for diagnosis and 
therapy of this tumor. 
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Gene expression signatures and biomarkers of noninvasive and invasive 
breast cancer cells: comprehensive profiles by representational difference 
analysis, microarrays and proteomics 
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We have characterized comprehensive transcript and 
proteomic profiles of cell lines corresponding to normal 
breast (MCF10A), noninvasive breast cancer (MCF7) and 
invasive breast cancer (MDA-MB-231). The transcript 
profiles were first analysed by a modified protocol for 
representational difference analysis (RDA) of cDNAs 
between MCF7 and MDA-MB-231 cells. The majority of 
genes identified by RDA showed nearly complete con- 
cordance with microarray results, and also led to the 
identification of some differentially expressed genes such 
as lysyl oxidase, copper transporter ATP7A, EphB6, 
RUNX2 and a variant of RUNX2. The altered transcripts 
identified by microarray analysis were involved in cell-cell 
or cell-matrix interaction, Rho signaling, calcium home- 
ostasis and copper-binding/sensitive activities. A set of 
nine genes that included GPCR11, cadherin 11, annexin 
Al, vimentio, lactate dehydrogenase B (upregulated in 
MDA-MB-231) and GREB1, S100A8, amyloid p pre- 
cursor protein, claudin 3 and cadherin 1 (downregulated in 
MDA-MB-231) were sufficient to distinguish MDA-MB- 
231 from MCF7 cells. The downregulation of a set of 
transcripts for proteins involved in cell-cell interaction 
indicated these transcripts as potential markers for 
invasiveness that can be detected by methylation-specific 
PCR. The proteomic profiles indicated altered abundance 
of fewer proteins as compared to transcript profiles. 
Antisense knockdown of selected transcripts led to 
inhibition of cell proliferation that was accompanied by 
altered proteomic profiles. The proteomic profiles of 
antisense transfectants suggest the involvement of pepti- 
dyl-prolyl isomerase, Raf kinase inhibitor and 80kDa 
protein kinase C substrate in mediating the inhibition of 
cell proliferation. 

Oncogene (2006) 25, 2328-2338. doi: 1 0.1 03 8/sj. one. 1209265; 
published online 28 November 2005 
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Introduction 

The transformation of a normal cell into a cancer cell 
has been correlated to altered expression of a variety of 
genes (Perou ei al, 2000; Becker et al, 2005). The 
expression of some of these genes is a direct result of 
sequence mutation, whereas other changes occur due to 
alterations in gene products that participate in specific 
pathways. The changes in gene expression have been 
routinely characterized by classical subtraction hybridi- 
zation and differential display approaches (Cerosaletti 
et al, 1995; Alpan et al, 1996). With the availability 
of the human genome sequence and sequences for a 
number of other model organisms, traditional methods 
have largely been replaced by gene microarrays 
(Khan et al, 2001). These analyses have been used to 
characterize the molecular basis of a variety of diseases 
including cancer. A comprehensive analysis of a large 
number of cancer cell lines allowed clustering of genes 
into groups based on their * expression patterns in 
phenotypically related cell lines (Khan et al, 2001; 
Dan et al, 2002; Rosenwald et al, 2002; van't Veer 
et al, 2002). The results of profiling experiments 
indicated expression of specific gene clusters in cell lines 
that have the same origin or have arisen from the same 
organ (Ross et al, 2000). A complementary approach 
that has been used in limited ways is proteomics. 
Proteomics scores for changes in different proteins 
and peptides in cells with characteristic pheno- 
typic differences. However, a comparative analysis of 
transcripts and proteins to establish a relationship 
between transcript changes and protein levels, has not 
yet become routine. ) 

Although expression profiling of tumor tissue and its 
comparison with normal tissue, in principle, is most 
appropriate to obtain the genetic signatures of a tumor 
type, such comparisons have not been free of attendant 
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complications. These complications arise due to hetero- 
geneity of tumor specimens wherein any cell type- 
specific changes are likely to be masked by other cell 
types that constitute the tumor specimen. For this 
reason, well-characterized cell lines established 
from tumor tissue may prove more informative and 
have been considered useful by cancer researchers. 
Comparing gene profiles between cell lines has the 
potential to reveal genes that could be causative for 
the phenotype and other genes that can serve as tumor 
biomarkers. 

Our investigations are aimed at designating a subset 
of transcripts that could distinguish a normal breast cell 
from a breast cancer cell and help to piredict tumorigenic 
or metastatic potential of a transformed cell. We 
describe here transcript and proteomic profiles of a 
normal breast cell line, a tumorigenic but noninvasive 
breast carcinoma cell line and an invasive breast 
carcinoma cell line, and summarize them as a set of 
candidate biomarkers or targets for therapeutic inter- 
vention. The comparison of transcript profiles with 
proteomic profiles demonstrated that altered proteins 
were not always represented in the microarray desig- 
nated profiles and vice versa. Furthermore, we have 
targeted five transcripts that were upregulated in MCF7 
cells for investigating their role in cell proliferation 
pathways. The proteomic profiles have revealed that 
inhibition of cell proliferation by antisense knockdown 
was mediated by a specific set of proteins. 



Results 

Representational difference analysis 
As described in the Materials and methods section, 
RDA was performed by using cDNAs from MCF7 and 
MDA-MB-231 as tester/driver or driver/tester combina- 
tion. The difference product in the first case represents 
the genes that are either upregulated in or specific to 
MCF7. On the other hand, the difference product of 
MDA-MB-231 (tester) and MCF7 (driver) hybridiza- 
tion resulted in the isolation of cDNAs that are either 
upregulated in or specific to MDA-MB-231. The initial 
linkers used in this protocol had internal Bgftl sites. 
One strand of the linker was used to amplify both the 
tester and driver cDNAs after linkers had been ligated 
to cDNAs. After removal of linkers from amplified 
cDNAs by digestion with BglU, a dephosphorylated 
BgHJ adaptor was ligated to tester DNA. The BglU 
adaptor had an internal EcoKl site. The difference 
product was digested with EcoKl and cloned in pBlue- 
Script vector. The cloning efficiency of the difference 
product was very low (5 x 10 4 c.f.u.///g of DNA). The 
low efficiency of cloning is attributed to a substantial 
fraction of amplified DNA product that is refractory to 
restriction digestion. The sequencing of a set of 100 
clones each from the difference libraries revealed 50 
different kinds of clones. The majority of these 
sequences were short fragments and represented either 
3' regions or internal fragments of transcripts. A 
summary of these clones is presented in Table 1 . The 
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involvement of the majority of these cDNAs is well 
characterized either in tumorigenesis or in metastasis. 
The phenotypic characteristics of MCF-7 and MDA- 
MB-231 ideally match with the biological significance of 
these genes. The alterations in transcripts for Rho 
signaling proteins, Ca 2+ binding/requiring proteins, 
tight junctions/anchoring junctions/gap junctions, cop- 
per binding or sensitive proteins, and RUNX2 are 
particularly noteworthy. 

The differential expression of a representative number 
of RDA clones was validated by semiquantitative PCR. 
As shown in Figure 1, these transcripts were either 
specific to or upregulated in the cell line that was used as 
a tester. Such analyses demonstrated that more than 
90% of the clones were differentially expressed. The 
abundance of transcripts and the results of RT-PCR 
were also confirmed by Northern blotting (Figure 2). 
The pattern of hybridization clearly indicates that all 
these transcripts showed differential expression in 
MCF7 and MDA-MB-231 cells that were used as 
driver/tester combinations for the RDA. 



Gene microarrays 

After obtaining preliminary molecular signatures of 
MCF7 and MDA-MB-231 cells by RDA, we used 
Affymetrix gene micoarrays to expand the above 
analysis to identify a comprehensive set of transcripts 
that is deregulated in invasive breast carcinoma cells. 
The comparisons of cell lines on the basis of transcripts 
that are either present or absent as shown in Figure 3 
revealed that a set of 123 genes distinguishes MDA-MB- 
231 cells from MCF7 and MCF10A. These genes can be 
classified by their involvement in functional classes such 
as transcription, signal transduction, cell adhesion, cell 
cycle, metabolism, transport, response genes and devel- 
opment (Figure 4). The majority of these genes 
participated in the process of signal transduction 
followed by transcription, cell adhesion and metabo- 
lism, respectively. A few transcripts in these classes were 
tested by real-time RT-PCR to confirm their altered 
abundance. The selected transcripts showed changes 
ranging between two- and 10-fold, 11- and 20-fold and 
greater than 20-fold, and were in close agreement with 
the results of microarray analysis.. The qualitative 
pattern of change observed in microarrays analysis 
was readily reproduced by real-time or semiquantitative 
RT-PCR for all transcripts tested. 

The number of altered transcripts was over 1000 
based on a change of twofold or greater, and a majority 
of these genes show changes varying between two- and 
fourfold (Figure 5). Interestingly, with all comparisons 
combined, there were 21 genes downregulated more 
than 50-fold and 55 genes that were upregulated more 
than 18-fold when specific cell line pairs were compared 
(Figure 5). The transcripts that represent the extremes of 
upregulated and downregulated scale can allow distinc- 
tion between MCF7 and MDA-MB-231 cells. These 
transcripts include GPCR11, cadherin 11, annexin 
Al, vimentin, lactate dehydrogenase B (upregulated 
in MDA-MB-231) and GREB1, S100A8, amyloid 
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Table 1 Differentially expressed transcripts identified by RDA 



Upregulated in MDA-MB-231 



Downregulated in MDA-MB-231 



Extracellular matrix/matrix-crosslinking proteins 
LOX 

Laminin p 1 

Collagen VI-a 1 „^ 

Calcium-binding proteins 
Reticulocalbin 1 
S100A8 
Cullin 5 

Transcription factors/promoter-binding proteins 
RUNX2 
c-Jun 
Fra-1 

Cell-cell adhesion/ cell-surface receptor proteins: 
Cdhll 

CYR6I \ 
MHC class II antigen y chain 
Pro tease-activated receptor- 1 
Protease-activated receptor-4 

ATPase/GTPase and signal transduction proteins 
ATP7a 
Caveolin 2 

AXL receptor tyrosine kinase 
Rho GEF 3 
Rho/Rac GEF 18 
P21-Rac2 

Metalloproteases and MMP inhibitor proteins 
TIMP-2 
MT1-MMP 

Stress-response proteins 

Dual specificity phosphatase (DUSP) 

Cytoskeletal component and binding proteins 
Moesin 
Vimentin 
Filamin B 

Cell- cycle regulation and growth/ 'differentiation/ 'apo ptosis proteins 
Cyclin BI 
Cyclin E 
Cyclin A2 
Bcl2-like 1 protein 



Secreted proteins and growth factors 
Milk fat globule protein 
TGF-a 

SMAD-specific E3 ub Hgase 2 

Miscellaneous 
Prion protein 



Calcium-binding proteins 
Calgranulin B 



Transcription factorslpromoter-binding proteins: 
Chromosome 4 ORF 
Estrogen receptor 1 
RUNX2 variant (exon 8 deleted) 

Cell-cell adhesion I cell-surface receptor proteins 
Claudin 3 

Amyloid /? precursor protein 

Triose phosphate isomerase 

Plakoglobin 

Cdh 1 

Cdh 3 

Annexin A9 

RAR-a 

Connexin31 

ATPase/GTPase and signal transduction proteins 

RhoD 

RhoB 

TGF-/? Rl 
Stress-response proteins 

Protein kinase HI 1 

Cytoskeletal component and binding proteins 
Keratin 18 
Tubulin S 1 

Microtubule-associated protein t 

Cell-cycle regulation and growth/ differ entiation/apoptosis proteins 
S100A13 
SIOOC 

Aurora kinase AIK2 
Nucleosidediphosphate kinase 

Secreted proteins and growth factors 

Trefoil factor 3 (TFF3) 

Trefoil factor I (TFF1) 

Four and a half LIM domain 1 

Solute carrier family 16 SLC16A6 
DNA replication 

DNA replication complex GINS-PSF2 
Miscellaneous 

Serine protease inhibitor type 1 (SPINT1) 

Human homolog of Xenopus protein XAG 

Hypothetical protein FLJ22222 

Hypothetical protein 20171 

Hypothetical protein MGC3265 



precursor protein, claudin 3 and cadherin 1 (down- 
regulated in MDA-MB-231). The distinction between 
MCF7 and MCF10A may be made based on keratin 19, 
serine protease, amyloid /? precursor, neuropeptide Y 
receptor Yl (upregulated in MCF7) and caldesmon, 
annexin Al, epithelial membrane protein 1, S100A2, 



keratin 1 5 (downregulated in MCF7). Likewise, 
MDA-MB-231 cells differ from MCF10A in vimentin, 
epithelial membrane protein 3, cadherin 11, GPCR 
116, collagen type XIII a 1, Bcl2-associated athanogene 
2 (upregulated in MDA-MB-231) and keratin 15, 
cystatin A, cadherin 1, CD24, calcium-activated chlor- 
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ide channel, SI OOP, GPCR 87 (downregulated in 
MDA-MB-231). Thus, a small subset of transcripts 
may serve as an accurate signature of these cell lines. 
Several of these gene products have been shown to 
participate in tumorigenesis and invasiveness of breast 
carcinoma cell lines. 

The invasiveness phenotype of MDA-MB-231 cells 
specifically relates to changes in the following functional 
classes: (a) cell adhesion molecules, (b) Ca 2+ requiring, 
Ca 1 ^ binding or Ca 2+ regulatory genes, (c) copper- 
sensitive or copper-transporting proteins and (d) specific 
regulatory proteins of Rho signaling. Among these 
functional groups, 23 transcripts involved in cell-cell 
or cell-matrix interactions are underexpressed in MDA- 



MB-231 cells and 21 transcripts were overexpressed in 
this cell line (Table 2). The comparison of Ca 2+ - 
requiring/binding genes indicated downregulation of 
26 transcripts and upregulation of 26 transcripts in 
invasive cells as compared to noninvasive cells (Table 3). 
While Ca 2 + homeostasis is extensively investigated in 
human cancers, copper homeostasis is an underexplored 
area. The alterations in copper homeostasis in breast 
carcinoma cells were reflected by changes in trans- 
cripts corresponding to a variety of copper-binding or 
copper-sensitive proteins/enzymes (Tables 1 and 4). The 
deregulation of Rho signaling was evident from changes 
in various proteins involved in this pathway (Table 4). 
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Figure 1 Semiquantitative evaluation of selected transcripts, (a) 
RNA was isolated from confluent culture dishes containing 
MCF10A (lanes 1 and 4), MCF7 (lanes 2 and 5) or MDA-MB- 
231 (lanes 3 and 6) cells. The amount of RNA was first determined 
spectrophometrically. Equal amounts of RNA, as determined by 
absorbance at 260 nm, were amplified with primers specific to actin 
gene for 18 cycles (lanes 1-3) or 20 cycles (lanes 4-6). The lane 
containing size markers is labeled as M. (b) A set of primers 
corresponding to caveolin 2 (lanes I and 2), TGF-* (lanes 3 and 4), 
Moesin (lanes 5 and 6), LOX (lanes 7 and 8), Axl receptor (lanes 9 
and 10), RhoD (lanes II and 12),S100A13 (lanes 13 and 14),TFF3 
(lanes 15 and 16) and Claudin 3 (lanes 17 and 18) were amplified 
for 32 cycles. Lanes 1, 3, 5, 7, 9, 12, 14, 16 and 18 represent 
amplified products from MCF7 and lanes 2, 4, 6, 8, 10, II, 13, 15 
and 17 represent MDA-MB-231 cells. The lanes containing p'cR 
products from MDA-MB-231 cells are marked with an asterisk. 
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Figure 2 Analysis of selected transcripts by Northern hybridiza- 
tion. The probes specific to CD74 (a), CYR61 (b), SPINT1 (c) and 
DUSP (d) were labeled with a 32 P nucleotide and hybridized to 
blots containing RNA from MCF10A (lane I), MCF7 (lane 2) 
and MDA-MB-231 (lane 3). The blots were washed stringently and 
developed as described. The amounts of RNA loaded were 
normalized as in Figure I. 
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Figure 3 Comparison of cell lines based on the presence or absence of transcripts. The absence or presence of a transcript in the 
Asymetrix chip was scored by the fluorescence read-out as described in the Materials and methods section. * 
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Gene Ontology 
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; ; Present in MDAMB231 - Absent in MCF10A & MCF7 □ Absent in MDAMB231 - Present in MCF10A & MCF7 



Figure 4 Functional classes of transcripts that differentiate a cell line pair. The transcripts identified as present or absent were 
classified based on their functional importance. 
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Figure 5 Distribution of altered number of transcripts as a : 
function of fold change. The altered transcripts were categorized in 
groups based on the magnitude of change in their abundance. 



Table 2 Cell adhesion molecules altered in breast carcinoma cells 
Upregulated in MDA-MB-231 Downregulated in MDA-MB-231 



Cadherin 4 


Claudin3 


Cadherin 1 1 


Cadherin- 1 


Catenin 


Cadherin- 3 


Integrin a 6 


Cadherin- 18 


Transmembrane anchor protein 


Cadherin, LAG seven pass receptor 


Eph B2 


Down syndrome cell adhesion 


Dystonin 


Catenin-52 


Lam in in fl 1 


Eph A4 


Lam in 


Ephrin A4 


Filamin B 


Annexin A9 


Filarnin C 


Ankyrin 3 


Tailin I 


Sarcoglycan 


Butyrophilin 


Keratin 8 


Spectrin-oc 


MAP-7 


Spectrin-/? 


MAP-t 


Thrombospondin 


Plate oglobin 


Plastin 3 


Plakophilin 


Adducin 3 y 


Discoidin domain receptor 


Lam in B receptor 


Zona occludens 3 


Lam in in fl 2 


Periplakin 


Lam in A/C 


Protocadherin a 9 




Lam in in y 2 




Lam in in a 3 



Proteome analysis 

To identify altered abundance of proteins and relate it to 
transcript profiles, we characterized the protein profiles 
of MCF-10A, MCF-7 and MDA-MB-231 cells. Typi- 
cally, > 300 protein spots could be visualized in silver- 
stained gels, and there were far fewer protein spots in 
gels that were stained with Coomassie blue. The 
comparison of MCF7 or MDA-MB-231 proteins with 
MCF-10A revealed that MCF-7 had 11 unique protein 
spots, while MDA-MB-231 had 15 spots that were not 
seen in MCF-10A. These proteins were either specific to 
or upregulated in these cell lines. The identity of these 
proteins is shown in Table 5. Out of these 26 protein 
spots, only 25 yielded amino-acid sequence. As shown in 
the table, the list includes proteins involved in stress 



response, protein-tagging activities, calcium-binding and 
calcium homeostasis proteins and some regulatory 
proteins. Prominent among these changes were proteins 
involved in calcium homeostasis such as crocalbin, 
calreticulin, calcyclin and reticulocalbin. The changes in 
signaling pathways between the two cell lines were 
indicated by altered levels of Rho GDP dissociation 
inhibitor 1 , an apoptosis/differentiation regula- 
ting protein galectin, ■ Myc expression regulator far 
upstream binding protein- 1 and the microtubule reg- 
ulator protein stathmin. The translation initiation 
fabtors 5 A and 4H were also selectively upregulated in 
MDA-MB-231 cells. 
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Table 3 Calcium binding or sensitive transcripts 
Upregulated in MDA-MB-231 Downregulated in MDA-MB-231 



Reticulocalbin 
Dystonin 
Follistatin like 1 
Cullin 

Annexin A5 

EF hand domain containing-2 

Hippocalcin like-2 

LDLR 

Steroid sulfatase 

MT-actin crosslinking factor 

Inositol- 1 ,4, 5- triphosphate 

receptor 3 

Sorcin 

Guanine nucleotide-binding 

protein-y 

GAS6 

SWP-70 protein 
Jagged I 

EGF-containing fibrulin-iike 
ECM 

Transglutaminase 2 

Thrombin receptor-like 1 

Plastin 3 

FYN oncogene 

PKC-a 

Calmegin 

Calpain 

HEG homolog 

Cyr61 



S100A8 
SI00A7 
S100A13 

Tumor associated Ca 2+ 
signal transducer 
Notch homolog 3 
PKLD2 

Adenylate cyclase 
Phospholipase-C 
Chemokine ligand 12 
Ubquitin-speciflc protease 
PKLC-*f 

Ret-protooncogene 

Signal peptide-CUB domain 

Mannostdase a 

Bradykinin receptor B2 

Solute carrier family 24 

Ca 2+ channel voltage-dependent /?3 

Regulator of G protein signaling 17 

Dystrobevin-a 

Synaptogamin 1 

Matrix gla protein 

EF hand domain family member Dl 

Pit-cyclic AMP dependent 

ATPase-Ca 2+ transporting 

Calmodulin 1 

CaM kinase 



Table 4 Altered transcripts involved in copper homeostasis and Rho 
signaling 



Copper-binding proteins 



Rho signaling proteins 



LOX 

LOX-1, LOX-2 
SCO cyt oxidase-deficient 
homolog 2 
COX 17 homolog 
Metallothionein IE, IF and 2a 
Ring finger protein7 
Amiloride-binding protein 1 
Neurotrypsin/motopsin 



Rho3 

Rho/Rac GEF 18 
Rho GEF 12 

Ras related C3 botulinum toxin 
substrate 2 

Cdc 42 effector protein 3 
Rho GEF 3 

Rho GDP dissociation inhibitor ft 
RhoD 



Changes in proliferation characteristics and protein 
profiles in response to transfection with antisense \ 
constructs of selected transcripts 

We had observed significant upregulation of transcripts- 
for DNA replication complex protein GINS PSF2, 
trefoil factor 3, aurora kinase AIK2, protein kinase HI 1 
and secreted protein XAG in MCF7 cells. We reasoned 
that antisense knockdown of the above genes in MCF7 
cells might indicate pathways involved in tumorigenesis 
and invasiveness. 

MCF7 'cells were transfected with empty vector 
pCDNA3.1 or antisense constructs of the above genes. 
A semiquantitative amplification of pCDNA marker 
gene by RT-PCR confirmed the presence of the 
transfected construct in a significant proportion of the 
cell population. The transfected cells also showed a 
decrease in the target transcripts as observed by RT- 



TaWe 5 Proteomic profiles of MCF7 and MDA-MB-231 cells as 
compared to MCF10A cells 



MCF7 cells 



MDA-MB-231 cells 



Cell-cell adhesion/cell-surface 
receptor proteins 
Triose-phosphate isomerase 

Stress-response proteins 
Hsp27 

Superoxide dismutase 
Peroxiredoxin 2 



Cytoskeletal component/binding 
proteins 
Stathmin 

Cell-cycle regulation and growth/ 
differentiation/ apoptosis proteins 

Nucleoside diphosphate kinase A 

S100C 

Secreted proteins and growth factors 
Macrophage migration inhibitory 
protein 

Miscellaneous 
Cyt c oxidase VIb 
Peptidyl-prolyl cis- trans isomerase 
Ubiquitin 



Calcium-binding proteins 
Calcyclin 
Calreticulin 
Crocalbin 
Reticulocalbin 

Transcription Factors/ 
Promoter-binding proteins 

Far upstream element 

binding protein- 1 

Far upstream element 

binding protein-2 

Cell-cell adhesion/ cell-surface 
receptor proteins 
Galectin 

A TPase/GTPase/signal trans- 
duction/ trafficing proteins 

Rho GDP dissociation 

inhibitor I 

Stress-response proteins 
Hsp 70 

Peroxiredoxin 2 

Cytoskeletal component/ 
binding proteins 
Stathmin 

Miscellaneous 

Heterogeneous nuclear 
ribonucleo protein H 
eIF4H (translation) 
eIF5A (translation) 



PCR. The effects of antisense transfections were scored 
by growth characteristics of the transfectants. The cell 
proliferation was reduced between 15 and 40% when 
antisense transfectants were compared to cells trans- 
fected with empty vector. 

In order to relate decreased proliferation of transfec- 
tants to altered proteins, proteomic profiles of transfec- 
tants were compared with vector controls. The 
comparison of protein profiles of cells transfected with 
empty vector or antisense construct revealed alterations 
in several proteins for each transfectant (Table 6). These 

\ proteins included stress-response proteins, calcium- 
regulating proteins, translation factors, ubiquitin, pro- 

• teins of electron transport chain and oxidative phos- 
phorylation, signaling proteins, cytokeratins, actin and 
actin regulating proteins and general regulatory factors. 
The number of altered proteins varied between 5 and 15 
for various transfections. Peptidyl prolyl cis-trans 
isomerase, calcium -regulating proteins, SOD, galectin, 
histidine triad protein and PKC substrate were promi- 
nent among altered proteins. We performed database 
searches to identify interactors for all proteins that were 
altered in transfected cells and identified nearly 350 
proteins (data not shown). A significant number of these 
interacting proteins are involved in transcriptional 
regulation. 
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Table 6 Altered proteins in MCF7 cells after antisense knockdown of specific transcripts 
After transfection with as-trefoil factor 3(TFF3) After transfection with as-protein kinase Till 



Calcium-binding proteins 

Calmodulin 
Cell-cell adhesionj cell-surface receptor proteins 

Retinoic acid-binding protein II 
ATPase-GTPasej signal transduction/trafficing proteins 

Raf kinase inhibitor 
Stress-r espouse proteins 

Hsp27 

Peroxiredoxin 1 
Cytoskeletal component /binding proteins 

Cofilin-nonmuscle isoform 
Cell- cycle regulation and growth) differentiation! apoptosis proteins 

Chromobox protein homolog I 

Chromobox protein homolog 3 

Translationally controlled tumor protein 
Miscellaneous 

40S ribosomal protein S12 

ATP synthase D chain 

Cancer-associated Sm-Iike protein 

Cyt c oxidase polypeptide Va 

eIF5A (translation) 

Ferritin heavy chain 

His triad nucleotide- binding protein 

Histone H2B.n 

Peptidyl-prolyl isomerase 

Proteosome subunit /f-type 6 

RNA-binding protein 8A 

Thioredoxin 

Ubiquitin crossreactive protein 

After transfection with as-aurora kinase AJK2 

Calcium-binding proteins 

Calgranulin B 

Calgranulin A 
Stress-response proteins 

Hsp27 

Superoxide dismutase 
Miscellaneous 

60S acidic ribosomal protein P2 
Peptidyl-prolyl isomerase 
Ubiqui tin-cross reactive protein 



Cell-cell adhesionj cell-surface receptor proteins 
Galectin 

Retinoic acid- binding protein II 
ATPase-GTPase/ signal transduction) trajficing proteins 

PKC substrate 
Stress-response proteins 

Hsp27. 

Cytoskeletal component! binding proteins 

Actin-a, skeletal muscle - 

Cytokeratin 18 
Miscellaneous 

30H-acyI CoA dehydrogenase II 

ATP synthase A chain 

Cyt c oxidase peptide Va 

Enhancer of rudimentary homolog 

Thioredoxin 

Ubiquinol-cyt C reductase 
No match 

After transfection with as-DNA replication complex GINS PSF2 
Calcium Binding proteins: 
Calreticulin 

A TPase- G TPase /signal t ransdu c t ionftrqffi cing proteins 

14-3-3 n 

PKC substrate 
Cytoskeletal component /binding proteins: 

Cytokeratin 1 

Cytokeratin 18 

Cell-cycle regulation and & growth/ differentiation /apoptosis proteins 

Chromobox protein homolog 1 

Chromobox protein homolog 3 

MAP/MT affinity regulator 
Miscellaneous 

ATP synthase D chain 

Peptidyl-prolyl isomerase 
. Ubiquinol-cyt c reductase 

After transfection with as-human homolog of XAG 
Calcium-binding proteins 

Calreticulin 
Stress-response proteins 

Hsp27 
Miscellaneous 

ATP synthase A chain 

Peptidyl-prolyl isomerase 

Ubiquitin crossreactive protein 

No match 



Discussion 

The results presented here validate the gene profiles 
obtained from different expression platforms ranging 
from subtractive hybridization to gene microarrays and 
proteomic analysis. The RDA protocol is powerful 
enough to yield important genes that show significant 
alterations in their expression between cell lines, and can 
lead to isolation of full-length cDNAs by using 
appropriate modifications (Baskaran et aL, 1996; Jacob 
et aL, 1997). The detection of RUNX2, variant of 
RUNX2, EphB6, prion protein, lysyl oxidase and a 
copper transporter ATP7A transcripts by RDA warrant 
specific mention. RUNX transcription factors bind 
specific motifs on target gene promoters and regulate 
gene expression leading to cell growth, proliferation and 



differentiation (Pratap et aL, 2003). RUNX2 and its 
variant have differential repression activity toward the 
promoter of the cyclin-dependent kinase inhibitor 
(p21CIPl) (Westendorf et aL, 2002). The loss of EphB6 
expression due to methylation of its promoter is related 
to invasiveness of M DA- MB -231 (Fox and Kandpal, 
2004; unpublished observations). Lysyl oxidase, a 
copper- sensitive enzyme, causes oxidative deamination 
of lysine and hydroxy lysines of collagen to aldehyde 
forms to stabilize collagen fibrils (Siegel, 1976) that are 
found in invasive breast carcinoma cells (Akiri et aL, 

2003) . The activation ofiLOX is dependent on copper 
that is internalized and then transported to trans golgi 
network by copper transporter ATP7A (Pase et aL, 

2004) , a protein mutated in Menkes disease (Moller 
et aL, 2005). Prion protein has also been reported as a 
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chaperone for copper (Jones et al, 2004). Thus, EphB6 
can serve as a marker for invasiveness, and LOX and 
ATP7A may be exploited as relevant targets for 
therapeutic intervention. 

The downregulation of junctional proteins along with 
inactivation of TIMPs as shown here is in agreement 
with other reports describing their relationship with 
invasiveness of carcinoma cells (Johnson, 1991; Kousi- 
dou et al, 2004; Shao et al, 2005) and promoter 
methylation (Costa et al, 2004). As several transcripts 
coding for junctional proteins are downregulated in 
invasive cells, we postulate that methylation-specific 
PCR can be exploited to use these transcripts as 
biomarkers of tumor cells in general and invasiveness 
in particular. The changes in cell-cell interaction 
correlate to cell phenotypes because such interactions 
influence Rho/Ras signal transduction pathways and 
vice versa (Malliri and Collard, 2003; Nagaraja and 
Kandpal, 2004; Ridley, 2004), and lend credence to the 
significance of altered transcripts for Rho and Rho 
GEFs as presented here. 

Early changes in calcium homeostasis as measured by 
calcium excretion have been reported in breast cancer 
(Campbell et al, 1983), and altered calcium signaling has 
been shown in invasive lung carcinoma cells (Amuthan 
et al, 2002). Prominent among calcium-binding proteins 
are S-100 protein, a group of intracellular messengers 
that respond to transient changes in calcium concentra- 
tion by binding to specific receptors (Marenholz et al, 
2004) and regulate cell growth, differentiation and 
motility, transcription and cell cycle. The S-100 proteins 
detected in the present study map to chromosome lq21, 
a region of genome that is frequently altered in human 
breast cancer cells (Bieche et al, 1995). Calcium ions act 
• as a second messenger in specific signaling pathways in a 
variety of cancers (Missiaen et al, 2000) and are known 
to alter calcineurin to activate transcription factors such 
as NFATc (Luo et al, 1996). 

As dictated by post-transcriptional regulation, protein 
profiles showed far fewer changes as compared to 
transcript profiles, and the knockdown of five selected 
genes in MCF7 cells produced interesting changes in 
protein profiles. These genes, namely, XAG (secretory 
Xenopus laevis protein), trefoil factors 3, human aurora2 
kinase AIK2, protein kinase Hll and DNA replication 
complex GINS PSF2, have been shown to be estrogen 
responsive, oncogenic or involved in tumorigenesis (Yu 
et al, 2001; Fletcher et al, 2003; Katoh, 2003; Warner 
et al, 2003; Takayama et al, 2003). The antisense 
constructs of these genes appeared to work as siRNAs 
as suggested by the reduction in the transcript detected 
in RT-PCR of RNA isolated from the transfected cells. 
The involvement of the above transcripts in invasive 
potential is apparent from the observed upregulation of 
calcium-binding proteins in transfected MCF7 cells, 
which is comparable to their levels in MDA-MB-231 
cells. The proteins that appear to mediate inhibition of 
proliferation in antisense-transfected cells include PKC 
substrate, Raf kinase inhibitor, histidine triad nucleo- 
tide-binding protein and peptidy-prolyl isomerase 
(Pinl). We believe histidine triad protein effects are 



most likely mediated via its interaction with ATM 
protein. Raf kinase inhibitor (Keller et al, 2004) and 
ATM (Hall, 2005) have been conclusively linked to 
transformation of cells, and the activity of Pinl has been 
related to p53-mediated signaling pathways (Mantovani 
et al, 2004; Berger et al, 2005). In this context, p53 
activation has also been hypothesized by Cu-SOD 
prion-like enzyme (Wiseman, 2005). Thus, alterations 
in copper homeostasis and p53-mediated signaling may 
be considered as a significant regulatory mechanism in 
tumorigenesis. 

In summary, we have presented here a set of 
candidate genes that can serve as biomarkers for 
tumorigenesis and invasiveness, and some of these 
markers may be used to develop DNA-based diagnostic 
tests. The alterations in transcripts for copper home- 
ostasis genes suggest copper chelation or inhibition of 
copper transporter ATP7A as potential targets for 
therapeutic application. The modulation of RUNX2 
splicing variants by chemicals that affect splicing 
machinery may also be explored as a therapeutic 
modality. The changes in EphB6 expression, if con- 
firmed in tumor specimens, may have prognostic 
significance. 



Materials and methods 

Breast cancer cell lines 

We used MCF-10A, a eel! line established from normal breast, 
and two breast carcinoma cell lines MCF-7 and MDA-MB- 
231 that vary in their in vitro and in vivo invasiveness. All cells 
were cultured at 37°C/7% C0 2 . MCF-10A cells were grown in 
1:1 DMEM:F12 media (Gibco) with 5% horse serum (Gibco), 
20 mM HEPES, lOng/ml EGF (Invitrogen), 10ml/l PenStrep- 
Glutamine (lOOOOU/ml penicillin, lOOOOjig/ml streptomycin 
and 29.2mg/ml L-glutamine), 10/ig/ml insulin (Invitrogen), 
0.1/ig/ml Cholera Toxin (Sigma) and 500ng/ml hydrocorti- 
sone (Sigma). MCF-7 and MDA-MB-231 cells were grown in 
DM EM (Gibco) supplemented with 2rnM L-glutamine (Gib- 
co), ImM sodium pyruvate (Gibco), 5 ml/1 penstrep (5000 U/ 
ml penicillin and 5O0O^g/mI streptomycin), and 10% fetal 
bovine serum (Hyclone). 

Total RNA isolation 

RNA was isolated from 85 to 95% confluent 10 cm tissue 
culture dishes using TRI reagent (Molecular Research Center 
Inc.) with slight modifications to the recommended protocol. 
Approximately 10 million cells were mixed with 1.0 ml Tri 
Reagent, the mixture was extracted with chloroform and the 
aqueous phase containing RNA was separated. The RNA was 
precipitated with isopropanol, the pellet washed sequentially 
with 80 and 300% ethanol, then dried and resuspended in 
DEPC-treated water. RNA was stored in aliquots at -70°C. 
The quality of RNA was visualized by running on a 
formaldehyde gel. The appearance of ribosomal RNA bands 
indicated that RNA was not degraded during the procedure. 
The amount of RNA was determined by its absorbance at 
260 nm. 

DNAase treatment of total RNA 

To remove DNA contamination, 20 /ig of RNA (quantified 
spectrophotometrically) was treated with 500 ng DNAase I, 
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80 U RNasin (Promega) and 1 mM MgCl 2 in Tris buffer in a 
total volume of 50 fil The reaction was carried out at 37°C for 
1 h and the DNAase inactivated by heating to 65°C for 30min. 

Representational difference analysis (RDA) 
RDA of cDNAs is a modification of genomic RDA (Lisitsyn 
et al 7 1993). We performed-RUA in the following two ways. In 
one experiment, MCF-7 cDNA was used as a driver and 
MDA-MB-231 cDNA as a tester. In the second experiment, 
MCF-7 cDNA was used as a tester and MDA-MB-231 cDNA 
as a driver. The protocol has been describer previously (Jacob 
et al. y 1997). Briefly, first-strand synthesis was carried out using 
a commercial cDNA synthesis kit as per the manufacturer's 
protocol. A linker with a Bgill site was ligated to the tester as 
well as the driver cDN A. A primer specific to one of the linker 
strands was used to PCR amplify the linker-ligated cDNAs. 
The linkers were then removed by digesting the cDNA with 
BglU and the digested cDNA was gel purified. A second set of 
unphosphorylated B%R\ adaptors was ligated to the tester 
cDNA only. The tester and driver DNAs were hybridized m a 
5 |il reaction volume at a ratio of 1:40. After hybridization, the 
ends of the tester homoduplexes were repaired with Klenow 
polymerase and 1 /d of the reaction mixture was diluted to 
100 AtL The difference product was obtained by amplifying 1 fil 
of the diluted mixture using the top strand of the ligated 
adaptor as a primer. The amplified difference product was 
digested with EcoKl and cloned in a pBlueScript vector. 
Individual clones were picked up and sequenced by Sanger's 
dideoxy chain termination method. Representative clones were 
validated by Northern analysis and semiquantitaive RT-PCR. 

Microarray analysis 

The GeneChips, Human Genome U133A 2.0, (Affymetrix, 
Santa Clara, CA, USA) used in this study contained 
approximately 22 000 probe sets corresponding to 18 400 
transcripts and variants, including 14500 well-characterized 
human genes. 

Total RNA was converted into double-stranded cDNA by 
using Superscript II (Invitrogen, Carlsbad, CA, USA) and an 
oligo-dT primer containing a heel of the T7 RNA polymerase 
promoter sequence. The reaction mixture containing double- 
stranded cDNA was extracted with phenol-chloroform, 
precipitated with ethanol, and dissolved in 12 /d RNase-free 
water. The cDNA was transcribed in vitro by using a RNA 
transcription labeling kit (Enzo Biochem, Famingdale, NY 
USA) with 6 Ml of double-stranded cDNA in the presence of 
ATP, CTP, GTP, UTP, bio-ll-CTP and bio-16-UTP. The 
biounylated RNA was purified by using an affinity column 
(Qiagen, Valencia, CA, USA) and fragmented randomly, by 
heating to 95°C in the presence of fragmentation buffer, 
between sizes of 35 and 200 bases. The GeneChips were 
hybridized overnight at 45°C in hybridization oven m a 
solution containing fragmented cRNA, control oligonucleo- 
tide B2, 20 x eucaryotic hybridization controls, herring sperm 
DNA, 'acetylated BSA and 2 x hybridization buffer- The 
GeneChips were washed and stained with streptavidm- 
phycoerythrin and the antibody in 2 x MES stain buffer, 
acetylated BSA, and optically read at a resolution of 6^m with 
a Affymetrix GeneChip scanner 3000. Affymetrix MICRO- 
ARRAY SUITE was used for initial data preparation 
(generation of ;CHP files). Normalization (quantile method) 
and calculation of signal intensities was performed with the 
software package RMA from the R project (http://www.r- 
project.org/). For every cell line, three replicates were 
performed with Affymetrix Gene Chips. The Gene Chip data 
were used for further calculations after the raw image and 



MAS5 analysis revealed a positive quality report. Ratios of 
average signal intensity (log 2) were calculated for the probe 
sets between pairs of cell lines and then converted to an 
average fold change (AFC). Statistical validation was per- 
formed on probe sets as described (Yoshida et al. t 2004). The 
statistical method used to assign P-values to the fold changes 
of gene responses is described by Yoshida et al (2004) and is a 
two-step procedure based on the Benjamini and Yekutieli 
construction of false discovery rate confidence intervals 
(FDRCI) (Reiner et al, 2003). Functional annotation of 
proteins was assigned through Gene Ontology (http://www. 
geiieontology.org) or Locuslink (http://www.ncbi.nlm.nih.gov/ 
LocusLink) classifications obtained through appropriate pub- 
lic databases. 

Quantitative RT-PCR 

RNA was reverse transcribed with Superscript II (Invitrogen, 
Carlsabad, CA, USA) RT by priming with oligodT. The 
primers specific to validated genes were synthesized from the 3' 
untranslated- region using Primer 3 software. PCR reactions 
were then performed in triplicates in an I-cycler Thennocycler 
with optical module (BioRad, Hercules, CA, USA). The 
amplified products were quantified by reading fluorescence of 
SybrGreen I (Molecular Probes, Eugene, OR, USA). Average 
fold changes were calculated by differences in threshold cycles 
(Ct) between pairs of samples to be compared. HPRT gene was 
used as a control. 

Semiquantitative RT-PCR 

The spectrophotometrically determined concentration of RNA 
was confirmed by amplifying actin message at different cycles. 
The cycling conditions that yielded proportional increment of 
amplified product was used to normalize the RNA concentra- 
tion. The normalized RNA was used as template to determine 
relative abundance of transcripts corresponding to clones 
identified by RDA experiments. The conditions were standar- 
dized in the range of cycles that yielded a PCR product for at 
least one of the pairs of compared RNAs. Such experiments 
dictated cycles between 30 and 35 to be appropriate to 
compare abundance of selected transcripts in MCF7 and 
MDA-MB-231 cells. 

Northern analysts 

The expression pattern of selected transcripts mcell lines was 
also analysed in Northern blots. RNA (20 /an), as determined 
by spectrophotometer and confirmed by actin amplification, 
was electrophoresed on a formaldehyde agarose gel. A RNA 
ladder. was used as a size marker. The RNAs were transferred 
from the gel to a Hybond nylon membrane by capillary 
transfer. The RNA was fixed onto the membrane by 
irradiation in a Stratalinker. The blot was hybridized at 65°C 
for 12-15h with a radioactive probe and the blot was 
subsequently washed with 0.1 x SSC and 1% SDS at 65°C. 
The hybridized probe was detected by autoradiography. 

Transfection of MCF-7 cells with antisense constructs 
The genes selected on the basis of their upregulation were 
cloned in antisense orientation in pCDNA3.1 vector (Inviro- 
gen). MCF-7 cells were grown to 70-80% confluence. 
Approximately 4^g of DNA was transfected into MCF-7 
cells by using Lipofectamine 2000. The transfected cells were 
grown in the presence of G418 (400/ig/ml). The transfectants 
were processed for protein isolation. A control set of cells was 
transfected with an empty pcDNA3.1. The proteins were 
analysed by two-dimensional electrophoresis, and altered 
bands were excised for mass spectrometry. 
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Protein isolation 

The cultured cells were harvested by trypsinization and 
centrifuged at 220 g for 5min at 4°Q The cell pellet was 
washed once with ice-cold 1 x PBS. The proteins were isolated 
by using a commercial kit (Bio Rad, Hercules, CA, USA). 
Briefly, pelleted cells (0.05 ml) were mixed with 0.5 ml ice-cold 
CPEB solution containing protease inhibitors cocktail 
(Roche), vortexed and stored on ice for 30inin. The cell 
suspension was passed through a syringe needle (20 gauge) for 
10-20 strokes to ensure complete cell lysis. The cytoplasmic 
protein fraction was collected by centrifugation at 100 £ for 
lOmin at 4°C. The nuclear pellet was washed once again with 
0.5 ml CPEB solution. The nuclear pellet was resuspended in 
0.75 ml PSB buffer, vortexed briefly and centrifuged at 1000# 
for lOmin at 4°C, and the supernatant containing nuclear 
protein was collected into a new tube. The samples were 
quantified using 2D Quant kit (Amersham Biosciences), 
aliquoted and stored at -80°C to prevent protein degradation. 
To reduce streaking, background staining and the other gel 
artefacts associated with substances contaminating 2D/IEF* 
samples, the samples were cleaned with 2D Clean up kit (Bio 
Rad, Hercules, CA, USA) before running on the gel. 

Two-dimensional gel electrophoresis 

The protein mixtures were separated based on isoelectric 
Doints by using commercial pre-cast pH gradient gel strips 
according to the manufacturer's instructions. The protein 
Sample (175 ^g) in 185 /xl of sample buffer (8M urea, 2% 
CHAPS, 0.2% biolytes, 3/10 ampholytes, 65 mM DTT and 
0.002% bromophenol blue) was loaded in the sample loading 
trays at the end of 1 1 cm immobilized rehydrated strips (pH 3- 
10) (Bio Rad, Hercules, CA, USA). Following isoelectric 
focusing, proteins were reduced and alkylated by successive 
15min treatments with equilibration buffer (6M urea, 0.375 M 
Tris-HCl pH 8.8, 2% SDS, 20% glycerol, 2% DTT) and 2.5% 
(W/V) iodoacetamide, respectively. Proteins were then re- 
solved in the second dimension on 8-16% gradient SDS- 
PAGE gel (Bio Rad, Hercules, CA, USA). The protein spots 
were visualized by staining with either silver stain or 



Coomassie blue stain. The gel images were compared and 
bands showing significant (greater than twofold) alterations in 
intensity were excised and processed for mass spectrometry. 
Comparisons were made between protein ly sates from MCF- 
10A, MCF-7 and MDA-MB-231 cell lines or between MCF-7 
and MCF-7 cells transfected with specific antisense constructs. 

Protein identification by enzymatic digestion followed by mass 
spectrometry 

Prior to performing trypsin digestion, the gel pieces containing 
protein spots were washed sequentially once with water and 
twice with acetonitrile. The gel pieces were then allowed to 
swell in 100 mM ammonium bicarbonate and finally washed 
with acetonitrile. The washed slices were dried in a Speed Vac 
concentrator, and subsequently incubated with 20 /d of 
Promega's autocatalysis-resistant trypsin (12.5ng//d in 50 mM 
ammonium bicarbonate and 5 mM CaCl 2> pH 8.0) overnight at 
37°C. The supernatant (5 /il) from tryptic digest was injected 
for peptide sequence analysis using on-line capillary liquid 
chromatography-electrospray ionization- tandem mass spectro- 
metry (LC-MS/MS). The front end HPLC utilized a Dionex 
(San Francisco, CA, USA) Vydac 300/im inner diameter 
x 15 mm CI 8 column. The linear acetonitrile gradient (3%/ 
min, containing 0.02% TFA) was developed using a Hewlett- 
Packard 1100 pump operating at 0.1 ml/min, and the flow was 
split before the injector such that the flow rate through the 
column was 3 /d/min. Peptides were detected at 215 nM. The in- 
line mass spectrometer was a ThermoElectron LCQ-DECA 
instrument operated in data-dependent MS/MS mode, and 
proteins were identified by searching a nonredundant protein 
database using the Sequest program- 
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Identification of androgen-regulated genes in the 
prostate cancer cell line LNCaP by serial analysis 
of gene expression and proteomic analysis 

A common therapy for nonorgan-confined prostate cancer involves androgen depriva- 
tion. To develop a better understanding of the effect of androgen on prostatic cells, we 
have analyzed gene expression changes induced by dihydrotestosterone (DHT) in the 
androgen responsive prostate cancer line LNCaP, at both RNA and protein levels. 
Changes at the RNA level induced by DHT were determined by means of serial analysis 
of gene expression (SAGE), and protein profiling was done by means of quantitative 
two-dimensional polyacryfamide gel electrophoresis. Among 123371 transcripts anal- 
yzed, a total of 28844 distinct SAGE tags were identified representing 16570 genes. 
Some 351 genes were significantly affected by DHT treatment at the RNA level 
(p < 0.05), of which 147 were induced and 204 repressed by androgen. In two indepen- 
dent experiments, the integrated intensity of 32 protein spots increased and 12 
decreased at least two-fold in response to androgen, out of a total of 1031 protein 
spots analyzed. The change in intensity for most of the affected proteins identified 
could not be predicted based on the level of their corresponding RNA. Our study pro- 
vides a global assessment of genes regulated by DHT and suggests a need for profiling 
at both RNA and protein levels for a comprehensive evaluation of patterns of gene 
expression. 



Keywords: Androgen / Prostate / LNCaP / Gene expression / Serial analysis of gene expression 
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1 Introduction 

Androgens affect numerous aspects of prostate biology 
including development, growth, and maintenance. They 
also play a critical role in tumorigenesis and progression 
of prostate cancer. Androgen deprivation is an estab- 
lished treatment modality for prostate cancer. However, 
the disease eventually progresses into a hormone refrac- 
tory cancer. Several mechanisms have been identified 
which may contribute to androgen independence: (1) 
Mutations in the androgen receptor (AR) lead to ligand- 
independent activation or promiscuity of the receptor 
[1 , 2]. The ability of the receptor to activate or repress 
downstream genes can also be affected by mutation. AR 
mutation is associated with advanced phases of prostate 
cancer [3,4]; (2) AR can be activated in a ligand-indepen- 
dent manner by specific growth factors and cytokines [5]; 
(3) AR gene amplification has been found to occur in 



28-30% of tumors that recurred post androg en-ablation 
therapy [6, 7]. Wallen ef a/. [8] have recently shown that 
one -third of locally recurrent hormone refractory prostate 
cancer contain AR gene amplification; (4) Coactivator 
amplification and core pressor down-regulation have 
been shown to increase receptor transactivation [9, 10]. 
Regardless of which pathway(s) is taken by the tumor 
cells, androgen-regulated genes may ultimately be the 
key players in the development of hormone refractory 
cancer. As a step towards a better understanding of the 
effect of androgen on gene expression, we have under- 
taken a comprehensive assessment of gene expression 
changes induced by dihydrotestosterone (DHT) in the 
androgen responsive prostate cancer cell line LNCaP. An 
important feature of our study is the parallel assessment 
of expression changes at both RNA and protein levels. 



2 Materials and methods 



Correspondence: Dr. Y. Q. Chen, Department of Pathology, 
Wayne State University, 540 E. Canfield, Detroit, Ml 48201, USA 
E-mail: yqchen@med.wayne.edu 
Fax:+1-313-5770057 

Abbreviations: DHT, dihydrotestosterone; G3PD, glyceralde- 
hyde-3- phosphate dehydrogenase; PSA, prostate specific anti- 
gen; RT-PCR, reverse transcription-polymerase chain reaction; 
SAGE, serial analysis of gene expression 



2.1 Prostate cell line and DHT treatment 

LNCaP human prostate cancer cell line (American Type Cul - 
ture Collection, Rockville, MD, USA) was cultured for 3 d in 
phenol-free RPMI 1640 supplemented with 5% charcoal- 
stripped fetal bovine serum (FBS) at 37°C. Haif of the cul- 
tures were then treated with 1 0" 9 m DHT for 24 h. Total RNA 
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and protein were extracted from untreated and DHT treated 
cells using TRIzol (Invitrogen, Carlsbad, CA, USA) reagent 
for RNA and a solubilization cocktail for proteins, consisting 
of 9 m urea, p-mercaptoethanol and 1 % NP-40. For kinetics 
experiments 1 x 10 6 cells were plated in phenol-free RPMI 
medium with 5% charcoal-stripped FBS. On day three, 
cells were treated with 5 mL of media containing 10"* m 
DHT and total RNA was extracted at different time points 
from 0-96 h. To determine the direct/indirect effect of andro- 
gen on RNA levels, cells were plated as above and treated 
with 5 ng/mL of cyclohexamide in the presence or absence 
of 1 O^m DHT for 24 h and total R NA was extracted. 

2.2 Serial analysis of gene expression 

Serial analysis of gene expression (SAGE) was performed 
as described previously [1 1] with the following modifi- 
cations: ditags were PCR amplified using biotinylated pri- 
mers and digested with Nla\\\ enzyme [12]. Concatemers 
were heated for 15 min at 65°C and chilled on ice for 
10 min before separation on an 8% polyacrylamide gel 
[13]. The concatemers were then cloned into the Sphl 
site of the pZero vector (Invitrogen). Concatenated tags 
were screened by PCR using M13 forward and M13 
reverse primers. PCR products with inserts greater than 
500 bp were isolated and sequenced with M13 forward 
primer on an automated 3700 DNA sequencer (Perkin- 
Elmer, Norwalk, CT, USA). For microSAGE, 1 of total 
RNA per tube was used for cDNA synthesis in two tubes, 
with the mRNA Capture Kit (Boehringer Mannheim, 
Indianapolis, IN, USA. cDMAs were cleaved with Mall I, 
ligated to linkers and then digested with Bsm Fl enzyme. 
The released tags were ligated, and processed for the 
rest of steps as with the standard SAGE protocol. 

2.3 SAGE data analysis 

SAGE tags were extracted using the SAGE software 
V 4.12 [11]. Tags were matched to the SAGE reliable 
map (release 10-26-2000) (http://www.ncbi.nlm.nih.gov/ 
SAGE/). Due to the fact that some tags map to multiple 
genes and some genes have multiple tags, SAGE data 
were analyzed in two different ways: (1) exclusion 
method: tags that match to multiple genes were dis-, 
carded. Only tags that match to a single gene were tabu- j 
lated and composite counts analyzed for their signifi- 1 
cance; (2) inclusion method: tags that match to multiple : 
genes were counted at 100% toward each gene. All tags 
were tabulated and composite counts analyzed for their 
statistical significance. Lists of differentially expressed 
genes (p<0.05) obtained from the exclusion and inclu- 
sion methods were compared, and finally only genes that 
have ap value < 0.05 in both lists were considered statis- 



tically significant. The total number of genes identified 
was estimated by IM m + (N um - 0.1 N um )/3.5, where N m is 
the number of genes matched to SAGE tags, N um is the 
number of SAGE tags that do not match to known genes 
or ESTs, with 10% representing the estimated sequencing 
error per SAGE tag and 3.5, the average number of tags 
per gene in the SAGE reliable map (release 10-26-2000). 

2.4 RT-PCR and real-time PCR quantification 

For reverse transcription-PCR(RT-PCR), 1 \ig of total RNA 
was reverse transcribed into cDNA. One fortieth of cDNA 
was used for PCR reaction. Samples were collected at 
different cycles and separated on a 2% agarose gel with 
ethidium bromide. Image was captured and quantified 
using NucleoVision 760 Image Workstation (Nucleotech, 
CA, USA). Amplification curves were generated. Sub- 
sequently, RTtPCR was done . at cycles within the log 
phase of amplification. 

Real-time quantification was performed in the iCycler (Bio- 
Rad, Hercules, CA USA). Briefly, one fortieth of c DNA was 
used in each reaction. Six reactions were carried out for 
each gene and three independent experiments were per- 
formed. PCR mix comprised of 1X PCR buffer, 1.5 mw 
MgCl 2 , 0.1 mw dMTP, 200 ntvt primers (listed below), 0.05 U 
platinum Taq polymerase (Invitrogen) and 0.1 x SYBR green 
(Molecular Probes, Eugene, OR, USA). PCR wascarried out 
at 94°C for 2 min, and 40 cycles of 94°C for 30 s, 55°C for 
30 s, and 72°C for 30 s. Primers used for real-time PCR 
quantification: PSA (prostate specific antigen) (Hs.1 71995, 
5'-GGAAATGACCAGGCCAAGAC-3\ 5 '-CAACCCTG GAC 
C TCACACCTA-3'), SCMH1 (Hs.57475, 5'-GCCTTGACC 
ACATCACTCCAT-3', 5 ' -AG GCCTAGGGCTGCAAAAG -3 ') , 
and clusterin (Hs.75106, 5 '-GCAGGAATACCGCAAAAA 
GC-3\ 5'-GACTCAAGATGCCCCCGTAAG-3 T ). Standard 
samples (50, 25, 1 0, 5, 2.5, 1 and 0.5 pL cDNA) and experi- 
mental samples were used in real-time quantification PCR. 
Each sample was run in quadruple reactions. Standard 
curve (Ct = mX + B) was obtained, where Ct is threshold 
cycle number, X is log quantity of target molecules, m is 
curve slope and B is Y-axis intercept value. Number of 
fold induction or repression for a given target molecule 
was calculated by Qa/Qb, where Qa is the target quantity 
in experimental sample A and Qb is the target quantity 
in experimental sample B. Cta = mlogQa + B or Qa = 
10 tcta -B)/m therefore, Qa/Qb = 1 nfcta-ctbym 



2.5 2-D PAGE 

The procedure followed was as previously described 
[14]. Cells were solubilized in 200 \iL of lysis buffer con- 
taining 9.5 m urea (Bio-Rad); 2% NP-40, 2% carrier 
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ampholytes pH 4-8 (Gallard/Schlessinger, Carle Place, 
NY, USA), 2% p-mercaptoethanol and 10 mui PMSE 
Aliquots containing approximately 5x10 6 cells, were 
applied onto isofocusing gels. IEF was conducted using 
pH 4 to 8 carrier ampholytes at 700 V for 16 h, followed 
by 1000 V for an additional 2 h. The 1-D gel was loaded 
onto the 2-D gel, after equilibration in 1 25 rrwiTris, pH 6.8, 
10% glycerol, 2% SDS, 1% DTTand bromophenol blue. 
For the second dimension separation, a gradient of 
11-14% of acrylamide (Serva, Hauppauge, NY, USA) 
was used. Following 2-D PAGE, proteins were visualized 
by silver staining of the gels or transferred to an Immobi- 
lon-P PVDF membrane (Millipore, Bedford, MA). 



2.6 Protein identification by mass spectrometry 

The 2-D gels were silver stained by successive incuba- 
tions in 0.02% sodium thiosulfate for 2 min, 0.1% silver 
nitrate for 40 min and 0.014% formaldehyde plus 2% 
sodium carbonate. The proteins of interest were excised 
'from the 2-D gels and destained for 5 min in 15 mM 
potassium ferricyanide and 50 mwi sodium thiosulfate 
as described [15]. Following three washes with water, 
the gel pieces were dehydrated in 100% acetonitrile for 
5 min and dried for 30 min in a vacuum centrifuge. 
Digestion was performed by addition of 100 ng of trypsin 
(Promega, Madison, Wl, USA) in 200 itim ammonium 
bicarbonate or by addition of 100 ng of the endopro- 
teinase Glu-C (Promega) in 100 mM ammonium bicarbo- 
nate. The Lys-C digestion was performed with 500 ng of 
the endoproteinase Lys-C (Roche, Mannheim, Germany) 
in 100 mM Tris-HCI, pH 9. Following enzymatic digestion 
overnight at 37 °C, the peptides were extracted twice 
with 50 jaL of 60% acetonitrile/1 % TFA. After removal of 
acetonitrile by centrifugation in a vacuum centrifuge, the 
peptides were concentrated by using pipette tips C18 
(Millipore). 

Analyses were performed primarily using a PerSeptive 
Biosystem MALDI-TOF Voyager-DE mass spectrometer 
(Framingham, MA, USA), operated in delayed extraction 
mode. Peptide mixtures were analyzed using a saturated 
solution of cyano-4-hydroxycinnamic acid (CHA) (Sigma, 
St. Louis, MO, USA) in acetone containing 1% TPA 
(Sigma). Peptides were selected in the mass range of 
800-4000 Da. Spectra were calibrated using calibration 
mixture 2 of the Sequazyme peptide mass standards kit 
(PerSeptive Biosystems). The search program MS-Fit, 
developed by the University of California at San Francisco 
(http://prospector.ucsf.edu), was used for searches in the 
NCBI database. Search parameters were as follows: 
maximum allowed peptide mass error of 400 ppm, con- 
sideration of one incomplete cleavage per peptide and 



pH range between 4 and 8. MALDI-TOF mass spectro- 
metry was also used for molecular weight determina- 
tion as described [16], In some cases, the amino acid 
sequence of some peptides of interest was determined 
by ESI MS analysis; 

3 Results 

3.1 SAGE analysis of the effect of androgen 
on gene expression 

SAGE libraries were generated from LNCaP cells cul- 
tured in the presence or absence of DHT. A total of 
123371 tags were generated of which 62878 were from 
the LNCaP cell line and 60493 from the LNCaP ceils 
treated with DHT for 24 h (Table 1). Sequence analysis 
identified a total of 28844 distinct tags representing 
16570 genes, 11 243 from LNCaP and 12203 from DHT 
treated cells. A total of 351 transcripts were differentially 
expressed at a significant level (p<0.05). Eighty-seven 
percent of transcripts matched GenBank entries; 79% 
corresponded to known sequences and 8% to ESTs. 
RNA levels for 147 genes were increased and 204 genes 
were decreased after DHT treatment (Table 2). Therefore, 
at the RNA level more genes were repressed than 
activated by androgens. Of these androgen-regulated 
genes, 149 were changed > five-fold by DHT treatment 
(Table 3). 



Table 1. Summary of SAGE analysis in LNCaP cells 



Sample 


Total 


Distinct 


Number 




transcripts 


tags 


of genes 


-DHT 


62 878 


17 050 


11 243 


+ DHT 


60 493 


18 510 


12 203 


Total 


123 371 


28 844 


16 570 


- DHT: LNCaP cells without dihydrotestosterone; + DHT: 


LNCaP cells treated with 10 -9 m dihydrotestosterone for 


24 h 








Table 2. DHT regulated genes in LNCaP cells 




Genes 




LNCaP v$ LNCaP + DHT 






(gene #) 








p 0.05 


p 0.01 


Known genes 




277 


147 


ESTs 




29 


8 


No match 




45 


13 


Total 




351 


168 


Up-regulated by DHT (24 h) 


147 


65 


Down-regulated by DHT (24 h) 


204 


103 
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Table 3. List of candidate genes that are regulated by DHT (> 5 fold) 
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Q arc 


irnmunogioDuiin j poiypepiiae, imKer proxein ior immuno- 


76325 










globulin alpha and mu potypetides 


1 out 


1 


17 


II IUULCU 






7ft 


n 


1 ^ 
1 0 


ln*Hi if^ari 
II lUULcu 




v/-iitn nv/inn Qarrnmfl vini<i "t 7 nnrnnpnfl homnlnn 
v Jul 1 dvidi 1 odiLAJi i la vii uo it \ji lo^jyci id i iwt i iwiuy 


f O OH*+ 


i 


1fi 

ID 


II ILiUUcLI 


1 •cUuTJ't 


iiiyuoiiif iicavy y)\j\y fjcpuuc? i i r omiuuui iiiuooic 


OO UUD 


1Q 


O 
c. 


no pi cooou 


1 .50E-04 




COO O 1 C 


1 K 

IO . 


I 




O. I3C"U*t 


nuuicdi uioiiiuiJiiwi i yci it? o y*' * uuuidi loi i iui i ivjiuy 


T A ATA 


n 
u 


I 1 


inaucea 




INUVCl 




1 1 




nepresseo 




INOVcl 




1 0 




Repressed 


ft /nc nA 




ton ^yo 


u 
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10 


0 


Repressed 


1 .30E-03 


ESTs 


1 o4 1 \>£. 


10 


0 


Repressed 


1 .40C-UO 


MUr -riLJUoylaliUM IdOlUI -lllvc C 


6895 


1 


12 


Induced 


1.49E-03 


actin related protein 2/3 complex, subunit 3 (21 kD) 


143240 


17 


3 


Repressed 


1 .63E-03 


ESTs 


211582 


0 


9 


Induced 


1 .66E-03 


myosin, light polypeptide kinase 


180266 


0 


9 


Induced 


1 .66E-03 


tropomyosin 2 (beta) 


93002 


14 


2 


Repressed 


2.53E-03 


ubiquitin carrier protein E2-C 


17883 


14 


2 


Repressed 


2.53E-03 


protein phosphatase 1G (formerly 2C), magnesium- 
dependent, gamma isoform 


126023 


14 


2 


Repressed 
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ho \ S, Highly Similar to N 1 Ui_nUMAN fNbUnUljblNlU LUUUo 
vi\j \ un i x\\J f oiN rn_/ivi uluu i rnuvu noun 
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98260 


8 


0 


Repressed 


4.34E-03 


ESTs 


CATAAGACTT 


8 


0 


Repressed 


4.64E-03 


Novel 


TACGGGGACA 


2 


12 


Induced 


5.76E-03 


Novel 


180034 


12 


2 


Repressed 


6.91 E-03 


cleavage stimulation factor, 3* pre-RNA, subunit 3, 77kD 


5753 


12 


2 


Repressed 


6.91 E-03 


inosito(myo)-1 (4)-monophosphatase 2 


278941 


10 


1 


Repressed 


6.94E-03 


PRO0628 protein 


19500 


0 


7 


induced 


7.25E-03 


nuclear localization signal deleted in velocardiofacal syndrome 
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Table 3. Continued 



UGD or Tag 


CAP 


DHT 


Change 


P value 


Description 


9006 


0 


7 


Induced 


7.25E-03 


VAMP (vesicle-associated membrane proteinj-associated 
protein A (33kD) 


26471 


0 


7 


t Induced 


7.25E-03 


Homo sapiens clone HQ0692 


AACTGCTGGC 


12 


2 


Repressed 


7.63E-03 


Novel 


GCTGACCGTC 




0- 


Repressed 


8.59E-03 


Novel 


CCCCCCTGTC 


7 


0 


Repressed 


8.59E-03 


Novel 


243901 


2 


11 


Induced 


8.82E-03 


Homo sapiens cDNA FU20738 fis, clone HEP08257 


119000 


1 


9 


Induced 


9.12E-03 


actinin, alpha 1 


78596 


1 


9 


Induced 


9.12E-03 


proteasome (prosome, macropain) subunit, beta type, 5 


171955 


7 


0 


Repressed 


9.61 E-03 


trophinin associated protein (tastin) 


180545 


7 


0 


Repressed 


9.61 E-03 


Homo sapiens mRNAfor hypothetical protein (TR2/D1 5 gene) 


77719 


7 


0 


Repressed 


9.61 E-03 


gamma-glutamyl carboxylase 


4877 


7 


0 


Repressed 


9.61 E-03 


CGi-51 protein 


22795 


7 


0 


Repressed 


9.61 E-03 


ESTs 


79335 


7 


0 


Repressed 


9.61 E-03 


Homo sapiens SWI/SNF-related, matrix-associated, actin- 
. dependent regulator of chromatin D1 (SM ARC D1) mRNA 


119177 


9 


1 


Repressed 


1.26E-02 


ADP-ribosylatin factor 3 


CGGGAGCACC 


9 


1 


Repressed. 


1.29E-02 


Novel 


182217 


0 


6 


Induced 


1.32E-02 


succinate-CoA ligase, ADP-forming, beta subunit 


75106 


0 


6 


Induced 


1.32E-02 


clusterin (complement lysis inhibitor, SP-40,40, sulfated gly- 
coprotein 2, testosterone-repressed prostate message 2) 


256311 


0 


6 


Induced 


1.32E-02 


granin-like neuroendocrine peptide precursor 


103180 


0 


6 


Induced 


132E-02 


DC2 protein 


TACGGGGATG 


1 


8 


Induced 


1.70E-02 


Novel 


AAACAAATCA 


2 


10 


Induced 


1.70E-02 


Novel 


136644 


2 


10 


Induced 


1.71E-02 


CS box-containing WD protein 


8036 


2 


10 


Induced 


1.71E-02 


glioblastoma overexpressed 


74 284 


2 


10 


Induced 


1.71E-02 


ESTs, Weakly similar to Similar to S. cerevisiae hypothetical 
protein L3111 


ATGGCTGATC 


6 


0 


Repressed 


1.75E-02 


Novel 


ATCACTGCCC 


6 


0 


Repressed 


1.75E-02 


Novel 


ACATCATCAG 


6 


0 


Repressed 


1.75E-02 


Novel 


CCAGTCCAAG 


6 


0 


Repressed 


1.75E-02 


Novel 


54842 




8 


Induced 


1.78E-02 


ESTs 


1526 


1 


8 


Induced 


1.78E-02 


ATPase, Ca++ transporting, cardiac muscle, slow twitch 2 


7911 


1 


8 


Induced 


1.78E-02 


KIAA0323 protein 


227400 


1 


8 


Induced 


1.78E-02 


mitogen-activated protein kinase kinase kinase kinase 3 


77269 


1 


8 


Induced 


1.78E-02 


guanine nucleotide binding protein (G protein), alpha inhibiting 
activity polypeptide 2 


7943 


6 


0 


Repressed 


1.81E-02 


RPB5 -mediating protein 


278544 


6 


0 


Repressed 


1.81E-02 


acetyl -Coenzyme A acetyltransf erase 2 (acetoacetyl 
Coenzyme A thiolase) 


23111 


6 


0 


Repressed 


1.81E-02 


phenylatanine-tRNA synthetase -like 


31442 


6 


0 


Repressed 


1.81E-02 


RecQ protein-like 4 


38628 


6 


0 


Repressed 


1.81E-02 


hypothetical protein 


77422 


6 


0 


Repressed 


1.81E-02 


proteolipid protein 2 (colonic epithelium-enriched) 


211973 


6 


0 


Repressed 


1.81E-02 


homolog of Yeast RRP4 (ribosomal RNA processing 4), 
3'-5'-exoribonuclease 


171 075 


6 


0 


Repressed 


1.81E-02 


replication factor C (activator 1) 5 (36.5kD) 


89 781 


6 


0 ' 


Repressed 


1.81E-02 


upstream binding transcription factor, RNA polymerase I 


154149 


6 


0 


Repressed 


1.81E-02 


apurinic/apyrimtdinic endonuclease(APEX nuclease)-like 2 



protein 
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UGD or Tag 


CAP 


DHT 


Change 


P value 


Description 


279772 


8 


1 


Repressed 


2.28E-02 


brain specific protein 


205091 


8 


1 


Repressed 


2.28E-02 


ESTs, Weakly similar to WW domain binding protein 1 1 
(M. musculus) 


75658 


8 


1 


Repressed 


2.28E-02 


phosphorylase, glycogen; brain 


GGGCAGCTGT 


8 


1 . 


Repressed 


2.37E-02 


Novel 


105440 


0 


5 


Induced 


2.43E-02 


hepatocyte nuclear factor 3, alpha 


118244 


0 


5 


Induced 


2.43E-02 


protein phosphatase 2, regulatory subunit B (B56) f delta iso- 
form . 


82389 


0 


5 


Induced 


2.43E-02 


CGI-1 18 protein 


31638 


0 . 


5 


Induced 


2.43E-02 


restin (Reed-Steinberg cell-expressed intermediate filament- 
associated protein) 


12013 


0 


5 


Induced 


2.43E-02 


ATP-binding cassette, sub-family E (OABP), member 1 


12797 


0 


5 


Induced 


2.43E-02 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 16 


7736' 


0 


5 


Induced 


2.43E-02 


hypothetical protein 


274479 


0 


5 


Induced 


2.43E-02 


IMME7 


153138 


0 


5 


Induced 


2.43E-02 


origin recognition complex, subunit 5 (yeast homoiog)-like 


16034 


0 


5 


Induced 


2.43E-02 


ESTs 


270072 


0 


5 


Induced 


2.43E-02 


ESTs 


187035 


0 


5 


Induced 


2.43E-02 


ESTs 


140452 


0 


5 


Induced 


2.43E-02 


cargo selection protein (mannose 6 phosphate receptor bin- 
ding protein) 


182265 


0 


5 


Induced 


2.43E-02 


keratin 1 9 


19762 


0 


5 


Induced 


2.43E-02 


ESTs, Weakly similar to unknown {D. melanogastet) 


317 


0 


5 


Induced 


2.43E-02 


topoisomerase (DNA) 


6236 


0 


5 


Induced 


2.43E-02 


ESTs 


GCTGGAGCCT 


5 


0 


Repressed 


3.03E-02 


Novel 


CCAGTGCTCA 


5 


0 


Repressed 


3.03E-02 


Novel 


ACCCTACATA 


5 


0 


Repressed 


3.03E-02 


Novel 


GGGGAAATCT 


5 


0 


Repressed 


3.03E-02 


Novel 


ACTGGTACTG 


5 


0 


Repressed 


3.03E-02 


Novel 


GCTCCGGTGT 


5 


0 


Repressed 


3.03E-02 


Novel 


ACAGTGGTGA 


5 


0 


Repressed 


3.03E-02 


Novel 


7869 




7 


Induced 


3.04E-02 


lysophosphatidic acid acyltransf erase -delta 


12101 


1 


7 


Induced 


3.04E-02 


hypothetical protein 


266940 




7 


Induced 


3.04E-02 


t-complex-associated-testis-expressed 1 -like 1 


6196 




7 


Induced 


3.04E-02 


integrin-linked kinase 


366 




7 


Induced 


3.04E-02 


6-pyruvoyltetrahydropterin synthase 


173611 




7 


Induced 


3.04E-02 


NADH dehydrogenase (ubiquinone) Fe-S protein 2 (49k D) 
(NADH-ceonzyme Q reductase) 


102469 




7 


Induced 


3.04E-02 


putative nuclear protein 


78825 


1 


7 


Induced 


3.04E-02 


matrin 3 


284465 




7 


Induced 


3.04E-02 


ESTs 


30738 




7 


Induced 


3.04E-02 


hypothetical protein FLJ1 0407 


78687 


t 


7 


Induced 


3.04E-02 


neutral sphingomyelinase (N-SMase) activation assoviated 
factor 


92381 


1 


7 


Induced 


3.04E-02 


nudix (nucleoside diphosphate linked moiety X)-type motif 4 


242039 


5 


0 


Repressed 


3.26E-02 


EST 


4766 


5 


0 


Repressed 


3.26E-02 


DKF2P586O01 20 protein 


283109 


5 


0 


Repressed 


3.26E-02 


hypothetical protein DKF2p762L171 0 


192853 


5 


0 


Repressed 


3.26E-02 


ubiquitin-conjugating entyme E2G 2 (homologous to yeast 
UBC7) 


153678 


5 


0 


Repressed 


3.26E-02 


reproduction 8 
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UGD or Tag 


CAP 


DHT 


Change 


P value 


Description 


251317 


5 


0 


Repressed 


3.26E-02 


EST 


279623 


5 


0 


Repressed 


3.26E-02 


selenoprotein X 


150319 


5 


o 


ncjji coocu 




FST<? 

L_v_> | O 


1 02456 


5 


n 


no \Jt CoocU 


o 9Rp_np 


cturvivai ot rnoior protein inieractiny proxein 1 


284250 


e; 
\j 


n 




O.tOC 


mu uu j pruiein 


251871 


5 


n 


nt; jj r co bcu 


9KP»n9 


o i r syninase 


270480 


7 


1 




3.92E-02 


E55T<5 Wpaklv similar tn Al 1 1^ HI I MAN Al 1 1 IRFAMN Y 

I — w i Op vvc7cu\iy on final wj nLUJ it\J IVI/AI >( /AL-W oUDrrMViiU i 

SEQUENCE 


26655 


7 


1 


Repressed 


3.92E-02 


glucose-6-phosphatase, transport (glucose-6-phosphate) 
protein 1 


8118 


7 


1 


Repressed 


3.92E-02 


leukotriene A4 hydrolase 


CTCCGCCGGC 


7 


1 


Repressed 


4.47E-02 


Novel 


AGGAAATGCT 


7 


1 


Repressed 


4.47E-02 


Novel 


GCTGACCGAGG 7 


1 


Repressed 


4.47E-02 


Novel 


CCCATAGTCC 


7 


1 


Repressed 


4.47E-02 


Novel 



CAP: LNCaP cells without dihydrotestosterone 

DHT: LNCaP cells treated with 10^ dihydrotestosterone for 24 h 

UGD: unigene ID 

Numbers in CAP and DHT columns are the number of tags observed in respective samples 



3.2 Confirmation of SAGE data 

To confirm the differential expression pattern at the RNA 
level, semiquantitative RT-PCR assay was performed on a 
group of selected genes. One \iq of total RNA was reverse 
transcribed into cDNA with oligo dT. cDNA was amplified by 
PGR for various cycles. PCR products were separated on 
agarose gels and quantified by densitometry. PCR ampli- 
fication curves were plotted for each gene, and data within 
the logarithmic phase of amplification were used for quanti- 
fication (Fig. 1 A, B and C). The reproducibility of gene regu- 
lation by androgen was confirmed in three independent 
experiments by monitoring the induction of PSA in the pres- 
ence or absence of cyclohexamide (Fig. 1 B). The kinetics 
of gene induction or repression was determined for PSA 
(Hs.171995), ciusterin (Hs,75106) and SCMH1 (Hs.57475) 
genes. Typical examples of kinetics are shown in Figure 
1C. PSA was induced at 4-6 h, peaked between 6-20 h, 
and gradually declined after 20 h post-treatment of DHT. 
Ciusterin was induced within 0.5-1 h and gradually declined 
after 6-12 h. SCMH1 was repressed 2-4 h post-treatment. 
Expression of PSA, ciusterin , and SCMH1 was quantified by 
real-time PCR (Fig. ID). 

3.3 Changes in protein expression induced 
by DHT 

Protein lysates of LNCaP cells cultured in the presence of 
DHT for 72 h and cells cultured in parallel in the absence 
of DHT were subjected to 2-D PAGE (Fig. 2). Following sil- 



ver staining, gels were digitized prior to computer-based 
matching and quantitative analysis, as previously des- 
cribed [17]. Of a total of 1031 protein spots matched 
between 2-D patterns of DHT treated and untreated cells, 
a set of 32 protein spots increased in intensity by at least 
two-fold in two independent experiments. Likewise, a set 
of 1 2 protein spots decreased in intensity by at least two- 
fold in these two independent experiments. A two-fold 
change in integrated intensity by silver staining represents 
on average a three -fold change in amount of protein, 
based on prior quantitative studies [1 7]. 

The 44 protein spots that changed in intensity as a result 
of DHT treatment were excised from the gels, digested 
with trypsin, and subsequently analyzed by MALDI-TOF 
MS. The resulting spectra were used to identify the pro- 
teins using the MS-Fit search program. Of the 44 spots 
excised from the gels, 29 were identified without ambi- 
guity and consisted of 21. up-regulated and eight down- 
regulated proteins (Table 4). The identified proteins repres- 
ented a heterogeneous group that included cytoskeletal 
proteins (e.g. tropomyosin, a tubulin), metabolic enzymes 
(e.g. adenine phosphoribosyl transferase, f> 1 ,4 galactosyl 
transferase, galactocerebrosidase), and the products of 
previously described androgen responsive genes (e.g. 
SRY, selenium binding protein) [18, 19], Specific anti- 
bodies confirmed the identification based on MS for all 
proteins analyzed by Western blotting, which included a 
tubulin, myosin light chain isoforms, nucleoside diphos- 
phate kinase A, glyceraldehyde 3-phosphate dehydro- 
genase (G3PD), and tropomyosin (data not shown). . 



1334 A.Waghrayefa/. Proteomics 2001, 1, 1327-1338 






Figure 1. Kinetics and quanti- 
fication of androgen-regulated 
genes. Total RNA from un- 
treated and DHT treated 
LNCaP cells was reverse 
transcribed. cDNA was PCR 
amplified for various cycles. 
PCR. amplification curves were 
plotted for each gene. A cycle 
number within the logarithmic 
phase of amplification was 
selected for semi-quantitative 

^*S*. „mm«~ .SS^^S- analysis. cDNA was also used 

. for real-time quantitative PCR. 

\:..?*^™£9^ *** ^ (A) An example of RT-PCR 

amplification of PSA at cycles 
between 20-30. (B) RT-PCR 
mmt -H a*:**.*'4, of PSA in tnree independently 
'■ /> -- y treated LNCaP samples, h: 
hours; 0: no DHT; 24: treated 
with 10- 9 M DHT for 24 h; -: 
without cycloheximide; +: with 5 ng/mL of cycloheximide for 24 h. (C) An example of three independent kinetics experi- 
ments for PSA, clusterin and SCMH1 . (D) Comparison of SAGE results with that of real-time quantitative PCR for PSA, 
clusterin and SCMH1 . f: induction; J,: repression. 



I 




Corresponding SAGE data were available for most of the 
proteins affected by DHT treatment that were identified. It 
was therefore of interest to determine if the changes 
observed at the protein levels were matched by concor- 
dant changes at the mRNA level. Remarkably, for most of 
the proteins identified, there was no appreciable concor- 
dant change at the RNA level (Table 4). 



4 Discussion 

We have compared the gene expression profile of 
LNCaP, an androgen responsive prostate cancer cell 
line, with or without androgen (DHT) treatment. Approxi- 
mately 350 of 16570 expressed genes detected at the 
RNA level were affected by dihydrotestosterone at the 
p<0.05 level. The DHT responsive genes included 
known genes, ESTs, and novel genes. As expected, we 
saw an induction of genes that are well known to be 
regulated by androgens. For instance, we found a 1.7-j 
fold induction in kallikrein 3/PSA, a 7.6-fold induction in! 
prostatic kallikrein 2, and a 15.7-fold induction in pro- ; 
static acid phosphatase by DHT in LNCaP cells. We 
also saw a five-fold induction in NKX3.1/NKX3A and 
three-fold in fatty acid synthase; two previously identified 
androgen-regulated genes [20, 21]. Interestingly p-micro- 
seminoprotein, reported to be reduced or lost in prostate 
tumor [22], was up-regulated more than 40-fold in 



LNCaP cells by DHT. More significantly, our data indicate 
that genes involved in a variety of tumor cell functions 
such as growth, apoptosis, and metastasis, are directly 
or indirectly regulated by androgens. In addition, it is 
noteworthy that both p-actin and G3PD, the two most 
frequently used loading controls, were up-regulated 
approximately two-fold by DHT (p<0.01). Thus, these 
two genes may not be appropriate controls in some 
experiments. 

Serial analysis of androgen-regulated gene expression 
provides us with a list of candidate genes. However, 
many factors such as the dose of DHT and the time of 
cell exposure to DHT and other unknown experimental 
variations will affect the level of gene expression. There- 
fore, it is important to confirm the SAGE results with alter- 
native methods. We performed semiquantitative RT-PCR 
(Fig. 1 B) on approximately 20 genes and real-time quanti- 
tative PCR on 10 genes in at least three independently 
DHT treated LNCaP samples. We noticed experimental 
variations in every gene determined. Some genes such 
as PSA (Fig. 1B) have smaller variations and some have 
V larger variations. In addition, the fold of alteration in 
expression identified by SAGE is different for some genes 
compared to that identified by real-time quantitative PCR. 
This is probably due to the technical limitation of SAGE. 
Serial analysis of gene expression is highly quantitative 
for genes whose tags are detected at large nilmbers. 
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Rgure 2. 2-D PAGE patterns of 
untreated (A) and LNCaP cells 
treated with DHT for 72 h (B). 
Identified proteins that were 
reduced in levels following DHT 
treatments are shown in (A) and 
induced proteins are shown in 
(B). Numbered spots represent 
either proteins that were chan- 
ged in level that have not been 
identified or isoforms of identi- 
fied proteins. 
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Table 4. List of proteins that are altered 


by DHT (> 2 


>-fold) 














Unigehe 


Protein 




Change 


mRNA 




Change 


^nort number 


ID 


Oh 


72 h 


in protein 


Oh 


24 h 


in mRNA 


Adenine phosphoribosyl transferase 


28914 


0.448 


1.084 




36 


18 


I 


Alpha Endosulfine 


111680 


0.066 


0.679 


T 


2 


5 




Alrkha Ti iKIiitn 

MlfJIld 1 UUIUII I 
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Numbers in protein are the intensity of protein spots 

Numbers in mRNA columns are the number of SAGE tags detected 

ND: not detected 
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When the tag number detected for a given gene is close 
to zero, the quantitative nature of SAGE is compromised. 
For instance, in approximately 60 000 transcripts of each 
sample, the PSA tag was detected 42 times in LNCaP and 
70 times in LNCaP treated with DHT. However, the tag for 
clusterin was detected 0 times and 6 times, and the tag 
for SCM H1 5 times and 0 times in the corresponding sam- 
ples. Therefore, therSAGE data of PSA is more reliable 
than that of clusterin and SCMH1 . Indeed, SAGE data of 
PSA is virtually identical to that of real-time quantitative 
PCR, whereas the SAGE data of clusterin and SCMH1 dif- 
fer from that of quantitative PCR (Fig. 1 D). 

PSA is probably the best-known androgen-regulated 
gene. We were surprised to see only 1.7-fold induction 
by DHT. However, kinetics experiments indicate that PSA 
is induced at 4-6 h, peaked between 6-20 h, and gradu- 
ally declined 20 h post-treatment of DHT. Since the SAGE 
experiment was done in samples treated for 24 h with' 
DHT, PSA mRNA level was likely to have declined from 
its peak level. Indeed, we could detect 5-10 fold induction 
of PSA at 6-8 h post -treatment. Clusterin was reported as 
an androgen-repressed gene in the rat prostate [23]. 
Recent evidence indicates that clusterin may not be 
directly androgen-repressed, but regulated by apoptotic 
stimuli [24]. Our results suggest that clusterin was 
induced within 0.5-1 h, gradually declined after 6-12 h, 
and after 24 h reduced to a lower level than that of 
untreated cells (Fig. 1C). 

Another important question is how many of the andro- 
gen-regulated genes identified are directly induced or 
repressed by androgens. In order to address this ques- 
tion, detailed analyses must be done for each gene. As 
the first step, we will divide the androgen-regulated 
genes identified in this study into two groups; the cyclo- 
hexamide-sensitive group whose induction or reduction 
of expression by DHT is blocked by the protein synthesis 
inhibitor and the cyclohexamide-insensitive group whose 
induction or reduction is not affected by cyclohexamide. 
We consider that genes in the latter group are directly 
regulated by DHT. Preliminary results suggest that 
approximately 20% of the 20 genes studied by RT-PCR 
are cyclohexamide-insensitive. Experiments are under 
way to determine the cyclohexamide sensitivity of all of 
the 149 genes listed in Table 3. Further characterization 
of androgen-regulated genes may provide some clues 
on the transition from hormone sensitive to hormone 
refractory prostate cancer. 

A relatively small set of genes could be analyzed at the 
protein level, largely due to the limited Sensitivity of 2-D 
PAGE. Nevertheless, a substantial number of detected 
proteins (44 of 1031 proteins analyzed, 4.3%) were 
affected by DHT treatment. Some of the proteins in this 



subset that were identified included the products of 
genes that were previously shown to be affected by 
androgens, namely selenium binding protein, brain speci- 
fic creatine kinase, SRY protein, B23 and G3PD. Using 
a subtractive approach, the human selenium-binding 
protein gene was shown to be differentially expressed 
in LNCaP and reversibly down-regulated by exogenous 
androgen in a concentration-dependent manner, in con- 
cordance with our findings at the protein level [19]. An 
increase in levels of the brain specific creatine kinase B 
chain in response to androgen has also been described 
[25]. Likewise the SRY gene has been found to be respon- 
sive to androgen stimulation in LNCaP cells [18]. Also, a 
number of glycolytic enzymes including G3PD which 
were affected at the protein level have been found to be 
responsive to androgen stimulation [26]. Androgenic reg- 
ulation of the amount and phosphorylation of the protein 
B23, included in our list of affected proteins, has been 
previously described and found to be related to the early 
changes associated with androgen mediated growth of 
the prostate [27]. 

Corresponding SAGE data was available for most of the 
proteins affected by DHT treatment that were identified. 
Interestingly, for most of the proteins identified, there 
was no appreciable concordant change at the RNA level. 
There are several potential explanations for this lack of 
concordance. A particular gene may be represented by 
more than one protein isoform in 2-D gels. For example, 
in the identified group, three proteins (myosin light chain, 
G3PD and selenium binding protein) were represented 
by more than one isoform. Thus, a source of discordance 
between RNA and protein data may be that the protein 
change is limited to a particular isoform of a protein and 
not to overall protein products of a particular gene. 
Nevertheless, a change in a particular isoform is inform- 
ative and biologically meaningful and may not be pre- 
dictable from RNA data. A lack of concordance between 
RNA and protein data may also reflect either translational 
control, post -translational modifications, or changes in 
protein turnover due to DHT treatment. Yet another 
explanation for a lack of concordance could be a tag 
time for changes at the RNA level to be reflected in a 
protein change. It follows from the above considerations 
that monitoring gene expression at both RNA and protein 
levels may provide complementary information that could 
not be ascertained by solely measuring RNA or protein. 
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Based on biomedical literature databases, we tried a first step for constructing a gene expression 
"data warehouse" specific to human colorectal cancer (CRC). Results of genome-wide tran- 
scriptomic research were available from 12 studies, using various technologies, namely, SAGE, 
cDNA and oligonucleotide arrays, and adaptor-tagged amplification. Three studies analyzed CRC 
ceil lines and nine studies of human samples. The total number of patients was 144. Out of 982 
up- or down-regulated genes, 863 (88%) were found to be differentially expressed in a single 
study, 88 in two studies, 22 in three studies, 7 in four studies, and only 2 genes in six studies. 
Eight large-scale proteomics studies were published in CRC, using 2-D-, SDS- or free-flow elec- 
trophoresis, involving only 11 patients. Out of 408 differentially expressed proteins, 339 (83%) 
were found to be differentially expressed only in a single study, 16 in three studies, 10 in four 
studies, 3 in five, and 1 in eight studies. Confirmation at proteome level of results obtained with 
large-scale trans criptomics studies was possible in 25%. This proportion was higher (67%) for 
reproducing proteome results using transcriptomics technologies. Obviously, reproducibility and 
overlapping between published gene expression results at proteome and transcriptome level are 
low in human CRC. Thus, the development of standardized processes for collecting samples, 
storing, retrieving, and querying gene expression data obtained with different technologies is of 
central importance in translational research. 
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1 Introduction 

Translational research in human colorectal cancer (CRC) is 
applying a large spectrum of molecular biology, cellular bi- 
ology, and advanced validation tools. In particular, genome- 
wide techniques are now applied to decipher modifications 
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in gene expression. In recent years, transcriptomics and 
proteomics tools have been broadly applied in CRC. The 
hope is that the new data obtained will now allow a classifi- 
cation of disease on molecular basis, deep insights into the 
pathophysiology of CRC, prognostic statements, and finally a 
systematic search for diagnostic and therapeutic targets. 

Because of the complexity of the biological system under 
investigation, the most significant contribution of transla- 
tional research in CRC is expected to derive not from the 
analysis of single experiments but from libraries of experi- 
ments. In other words, the results obtained so far by trans- 
lational research tools in different clinical and experimental 
settings need to be compared, contrasted, and if possible 
synthesized. Thus, based on the biomedical literature data- 
bases, we tried a first step for constructing such "data ware- 
house" specific to human CRC. 

* www.proteomics-iournal.de 
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The specific aims of the present study were to screen 
transcriptomics and proteomics studies published in CRC, 
and to elaborate a kind of "meta-analysis of gene expression" 
in CRC. Our hypothesis was that we would be able to deter- 
mine a common set of genes and gene products that are up- 
or down-regulated in human CRC by overlapping results 
obtained by different authors. 



2 Materials and methods 

This is a literature survey of gene expression data published 
in human CRC. A Medline search of reports published in 
English from 1990 to 2004 using the terms "colon cancer" 
and "gene expression" with the limit "human" was performed 
and yielded 1979 articles. All abstracts were reviewed and a 
related article search was performed on appropriate abstracts. 
Articles were selected by a consensus of two reviewers (E.S., 
M.R.) that satisfied these predetermined criteria: sample ori- 
gin (human and/or cell lines) and preparation detailed, tech- 
nology for gene expression studies defined (transcriptomics 
and/or proteomics studies), and quantitative results on over- 
expression or underexpression available. Studies concerning 
single genes or arbitrarily selected genes were discarded. 
Results obtained in animal models were not considered. 

Data were entered into an Excel working sheet (Micro- 
soft, Seattle, USA). Since gene expression data were not 
always obtained with quantitative, but in most cases semi- 
quantitative gene expression analysis technologies, no 
expression values or ratios were entered into the database. 
Only genes reported as being over- or underexpressed were 
entered. No threshold was defined so that some genes 
defined as differentially expressed might have shown only 



marginal differences. Only descriptive statistics are provided, 
which were obtained with the in-built tools of the Excel soft- 
ware. 

Various methods were used for large-scale translational 
research in CRC, including transcriptomics and proteomics 
technologies. Results of genome-wide transcriptomic re- 
search are available from 12 studies, using various technolo- 
gies, namely, SAGE (five studies), complementary DNA 
(cDNA) arrays (five studies), and oligonucleotide array (one 
study), as well as adaptor-tagged amplification (one study). 
Three studies analyzed CRC cell lines and nine other studies 
analyzed human samples. The total number of patients pro- 
filed in all studies was 144. Out of the nine studies with hu- 
man samples, only two studies with a total of only 22 patients 
were performed using purified epithelial cells (one of them 
using cDNA arrays, the other one using SAGE technology). 
An overview of these studies is provided in Table 1. 

Eight large-scale proteomics studies were published in 
CRC using various technologies, namely, 2-D PAGE (six 
studies), SDS-PAGE (one study), and free-flow electrophore- 
sis (one study). In total, all proteome studies in CRC involved 
only 11 patients. An overview of these studies is provided in 
Table 2. 



3 Results 

One thousand two-hundred and forty genes have been 
reported to be dysregulated (up- and/or down-regulated) in 
human CRC, representing about 5% of the 20 000-25 000 
human genes. 



* The complete dataset can be consulted: 
http: //www.chirurgiebethel.de 



Table 1. Large-scale or genome-wide transcriptomics studies in human CRC, using cell lines or human tissues. All studies together 
involved only 144 patients 

— ~ — " ~~ Preparation Method Number 

Author Year bampie of patients 



Zhang etal. (181 . 


1997 


SAGE-NET [191 


1997 


Zhang ef at. [18] 


1997 


Parle-McDermott etal. [201 


2000 


Yanagawa etal. [211 


2001 


Takemasa etal. [22] 


2001 


Nottermann ef al [23] 


2001 


Buckhaults etal. [241 


2001 


Birkenkamp etal. [251 


2002 


Lin etal. [261 


2002 


Muro et at. [271 


2003 


Williams etal. [281 


2003 



Normal mucosa and primary tumors 
Primary tumors and cell lines 
Cell lines 
Cell lines 

Paired normal and cancer 

Paired normal and primary tumors 

Paired normal mucosa and primary 

tumors, or adenoma 
Normal mucosa, adenoma and 

primary tumors 
Paired normal and primary tumors 
Paired normal mucosa and primary 

tumors, or adenoma 
Normal and cancer 
Paired normal mucosa and primary 

tumors, normal and adenoma 



Whole tissue 


SAGE 


2 


Cell lines 


SAGE 




Cell lines 


SAGE 




Cell lines 


SAGE 




Whole tissue 


cDNA 


20 


Whole tissue 


cDNA 


20 


Whole tissue 


Oligonuc- 


22 




leotide array 




Purified epithelial 


SAGE 


6 


cells 






Whole tissue 


cDNA 


27 


Purified epithelial 


cDNA 


16 


cells 






Whole tissue 


Adaptor-tagged 


11 


Whole tissue 


cDNA 


20 
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Table 2. Large-scale or proteomics studies tri human CRC, using cell lines or human tissues. All studies together 
involved only nine patients 



Author 


Publication 


Sample 


Method 


Number 




year 






of patients 


Reymond etal. [9] 


,4997 


Purified epithelial cells 


2-D PAGE 


1 


Simpson etal. [29] 


20OO** 


Cell lines 


SDS-PAGE 




Lawrie etal. [30] 


2001 


Purified epithelial cells 


2-D PAGE 


4 


Simpson etal. [31] 


2001 


Cell lines 


Free-flow electrophoresis 




Medjahed efaU32] 


2003 


In silico 


2-D PAGE 




Demalte etal. [33] 


2003 


Cell lines 


2-D PAGE 




Stierum etal. [34] 


2003 


Cell lines 


2-D PAGE 




Friedman etal. [35] 


2004 


Human whole tissue 


2-D PAGE 


6 



The vast majority of dysregulated genes in human CRC 
was found using transcriptomics tools: a total of 982 genes 
was found to be differentially expressed in at least one of 
these 12 transcriptomics studies. Out of these 982 genes, 863 
(88%) were found to be differentially expressed only in a 
single study in other words these results have not been 
reproduced so far. The other findings could be reproduced in 
two or more transcriptomics studies: 88 genes were found to 
be differentially expressed in two studies, 22 in three studies, 
7 in four studies, no gene in five studies, and 2 genes in six 
studies. The most cited genes are listed in Tables 3 (up-reg- 
ulation) and 4 (down-regulation). 

A total of 408 proteins were found to be differentially 
expressed in human CRC in at least one study. Out of these 408 
molecules. 339 (83%) were mentioned in a single study, in 
other words these results could not be reproduced. Differential 
expression of the remaining 70 proteins were mentioned with 
following frequenc: 40 were mentioned in two proteomics 
studies , 1 6 in three studies, 1 0 in four studies, 3 in five studies , 
and a single protein in eight studies. The most cited proteins 
are listed in Table 5. It has to be noted that only a single study 
[1] provided differential display protein expression data 
obtained in the human patient, using whole tissue biopsy. 

It is also difficult to reproduce transcriptomics results 
with proteomics tools. Out of 982 genes found to be differ- 
entially expressed in human CRC by genome-wide tran- 
scriptomics technologies (Table 6a), only 177 (18%) have 
been confirmed using proteome technologies. When the 
genes reported to be differentially expressed in three and 
more transcriptomics studies (n = 31) are compared with 
those reported to be differentially expressed in three and 
more proteomics studies (n = 30), only two genes (actin: 
ACTB_HUMAN and creatin-kinase: KCRB_HUMAN) can 
be matched. Thus, the probability of reproducing a gene 
expression result obtained at transcriptome level is low when 
using proteomics technologies. 

In contrast, there is a better reproducibility when prote- 
omics results are verified using transcriptomics tools 
(Table 6b). In fact, when the subset of 30 proteins that are 
consistently (in three studies or more) reported to be dysreg- 
ulated in CRC is compared with transcriptomics results, 20 



genes can be matched, representing about two* thirds. Thus, 
the probability of being able to reproduce a proteomics result 
using transcriptomics tools is more than three times higher 
than the opposite way (67 vs. 18%). However, it has to be 
noted that in 16% of cases, results remain unclear or even 
contradictory. ' 

Both in the transcriptomics and proteomics studies, 
many genes and factors were found to be differentially regu- 
lated that obviously do not play a causal role in CRC carci- 
nogenesis. 

4 Discussion 

This study was aimed at screening transcriptomics and pro- 
teomics studies published in CRC, and to elaborate a kind of 
meta-analysis of gene expression in' CRC in order to generate 
a data warehouse that would be useful in translational 
research. A significant number of translational research 
studies have been published in CRC, starting in 1997 with 
SAGE technology, followed by an increasing number of array- 
based transcriptomics studies since 2001, and more recently 
by several proteomics studies. This has created a significant 
amount of data which we were then able to compile. 

Our starting hypothesis was that we would be able to 
determine a common set of genes and gene products that are 
up- or down- regulated in human CRC by comparing results 
obtained, by different authors. This endeavor was not very 
successful; obviously, overlapping between published gene 
expression results both at proteome and transcriptome level 
is low in human CRC. In fact, more than 80% of results 
could not be reproduced. Several explanations can be pro- 
vided to explain this lack of reproducibility. 

First, the number of patients included in the studies is 
low (alltogether 144 patients in transcriptomics, 11 in prote- 
omics studies); some high impact publications having been 
conducted on samples of only two patients [18). This is a 
problem because interindividual genetic variability is high in 
human CRC [2]. Thus, it is allowed to hypothesize that some 
results attributed to gene dysregulation might be caused by 
genetic diversity rather than by cancer-specific traits. 
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Table 3. Results of transcriptornics research in human CRC. Most cited up-regulated genes out of 12 genome-wide or large-scale studies 



TRANS CRIPTOMICS 
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1 






i 










1 








G01U9 




TCPD_HUMAN 


Chaperonin containing TCP-1, subunit4 (delta) 












1 








1 


1 




X75821 


Q04984 


CA1t_HUMAN 


C0L1A1 










1 


1 




1 










U22055 


Q96AG0 


TF3A_HUMAM 


General transcription factor MIA 








i 




1 










1 




U14631 


P80365 


GR0_HUMAN 


GR01 oncogene (melanoma growth stimulating 
activity, alpha) 








i 












1 


1 




M92439 


P42704 


MDP1_HUMAN 


MPDP4, MDP7 microsomal dipeptidase 






i 


1 








1 










AF029082 


P31947 


MPI2_HUMAN 


M-phase inducer phosphatase 2 






i 




















MB6400 


P29312 


PR0C„HUMAN 


Pyrroline 5-carboxilate reductase mRNA 






i 




















L76465 


P15428 


SPRC.HUMAN 


Secreted protein, acidic, cysteine-rich 
(osteonectin) 






i 




















X57352 


Q01G28 


Q96AG0 


100 kDa coactivator mRNA 


























X03205 


P62888 


RL30_HUMAN 


SOS Ribosomal protein L30 


























M95787 


Q01995 


RL23_HUMAN 


60s ribosomal protein L23 
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PI 7075 


CA21_HUMAN 


Collagen alpha-2 type 1 



























© 2005 WILEY- VCH Verlag GmbH & Co. KGaA, Weinheim 



www.proteomics-jburnal.de 



3070 E. Sagynaliev et al. 



Proteomics 2005, 5, 3066-3078 



Table 3. Continued 
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Accession 
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Eukaryotic translation initiation factor 2, subunit 2 
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IMP (inosine monophosphate) dehydrogenase 2 
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Interferon induced transmembrane protein 1(9- 
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P39030 
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Ribosomal protein L8 


























L38941 


P49207 


RS5_HUMAN 
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Table 4. Results of transcriptomics research in human CRC. Most cited down-regulated genes out of 12 genome-wide or large-scale studies 
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L-FABP liver fatty acid-binding protein 1 
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Acyl-CoenzymeA dehydrogenase C-2 to C-3 short chain 
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IgG Fc-binding protein i 


1 








1 
















ITMC_HUMAN 


Integral membrane protein 2C ; 










1 


1 














MK03_HUMAN 


MAPK3 
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CA12 
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Calcium dependent protease (small subunit) 
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Table 4. Continued 
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Second, results have been obtained using heterogeneous 
samples in particular cell lines, whole tissue biopsies, and 
epithelial cells purified from surgical specimens. Results 
obtained in cell lines do not allow accurate comparison be- 
tween normal and cancer cells, and the presence/absence of 
proteins of interest has to be confirmed in biopsies. For 
example, when 2-D PAGE protein patterns of normal human 



colonic crypts were compared with the CRC cell line LIM 
1863, the proteins spots from normal crypts matched only 
75-80% of the cell line spots and the relative expression 
levels of a large number of proteins differed [3], When clin- 
ical biopsies are examined, results of phenotypic compar- 
isons depend on the type of samples examined (e.g., hetero- 
geneous whole tissue biopsies with inflammatory cells, 
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Table 5. Results of proteomics research in human CRC. Most cited up-regulated (orange), down-regulated (green), or mentioned (yellow) 
proteins out of 12 large-scale proteomics studies 



Accession 
Number 



Swiss-Prot Entry Name 



Name 



AF041260 j075363 



ACTB_HUMAN mRIMA fragment encoding cytoplasmic actin 



AA232508 S Q9Y397 



AA007218 IQ96QY8 



ACTG.HUMAN 



Actin, Cytoplasmatic 2 (Gamma-Actin) 



GTP_HUMAN 



Glutathione S-transferase M3 (brain) 




1G01119 



THIOJIUMAN 



Thioredoxin (ATL-derived factor) (ADF) 



X75821 



Q04984 



U22055 



U14631 



M92439 
AF029082 



M86400 



L76465 



X57352 



X03205 



ANX3_HUMAN 



Q96AG0 



ANX4_HUMAN 



Annexin lll (lipocortin III) 
Annexin IV (lipocortin IV) 



P80365 



K1CR.HUMAN 



Keratin,type I cytoskeletal 18 (cytokeratin 18) (K18) (CK18) 



P42704 



ATPB_HUMAN 



ATP5B A TP synthase 



P31947 



PPIA.HUMAN 



Peptidyl-Prolyl Cis-Trans Isomerase A (Cyclophilin A) 



P29312 



PDA3_HUMAN 



Probable protein disulfide isomerase ER-60 precursor 



P15428 



TPIS HUMAN 



Triosephosphate 's^eja^eJTtM) 



Q01628 



TCTP_HUMAN 



translationally controlled tumor protein (TCTP)(p23) 



P 10809 



CH60_HUMAN 



Heat shock protein 60 



M95787_ 

Z25821 

U96132 



Q01995 



P4212G 



Q99714 



AB028893 



P04643 



X79239 



Q02546 



Ml 3934 



P06366 



M60854 



M13932 



VINC_HUMAN 



Vinculin 



EFlGJHUMAIM 



TBAl.HUMAN 



P17008 



PO8708 



Elongation factor 1-gamma mRIMA 



Tubulin alpha-1 chain, brain specific 



CATD_HUMAN 



EF1 INHUMAN 



G3P2.HUMAN 



GR78_HUMAN 



EN0AJIUMAN 



CRTC.HUMAN 



Cathepsin D (EC 3A23^5}J3E^ 



Elongation factor 1 -alpha, mRNA 



Glyceraldehyde 3 phosphate dehydrogenase 



78 kDa glucose-regulated Pf^ ei ££ r ^^[£°i.l^iL I LZ?l 



Alpha enolase (2-phospho-D-glycerate hydrolyase) 



Calrecticulum precursor (CR55) (calregulin) 



L06498 



P17075 



C0F1_HUMAN 



Cofilin, non-muscle isoform 



L04483 



P35265 



HS7C_HUMAN 



M64716 



P25111 



HBB_HUMAN 



X77770 



P02383 



K2C8_HUMAN 



W52460 
•X55715 



P14798 



P23396 



M77234 m 
M58458" 



P49241 



PI 2750 



D 16992 
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M77233 



P23821 



F16294 



P09058 



X61156 



P08865 



L36055 



Q13541 



RM12_HUMAN 



GR75_HUMAN 



HeaTshock Cognate 71 kDa protein 



Hemoglobin beta chain.(GENE: HBB}h.s 



Keratin, type II cytoskeletal£jcjrtoJ^ejratin8) JK 8)J_CKJJ 



Mitochondrial 60s ribosomal protein L7/L12 precursor 



Mitochondrial stress-70 protein precursor (GRP 75) 



SODCJWMAN 



KCRB„HUMAN 



Superoxide Dismut^se ICu_*Zn] 
Creatine kinase 



143Z_HUMAN 



RL4_HUMAN 



ANX1.HUMAN 



R0A1_HUMAN 



FAB L_HU MAIM 



1 4-3-3 protein zeta/delta (protein kinase C inhibitor protein-1) 



60s ribosomal protein L4JLT_J_ 



AnnexinT(lipocortin 1) (calpactinll) (chromobindin 9)(p35) 



DNA binding protein UPi, liver mRNA fragment 



L-FABP liver fatty acid- binding protein J! 
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Table 5. Continued 
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S-Adenosylhomocysteine hydrolase AHCY} 
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X64707 


P26373 


TERA_HUMAN . 


Transitional endoplasmic reticulum ATPase (TER ATPase) 
















X56932 


P40429 


GDIR„HUMAN 


Rho GDP-dissociation inhibitor 1 


















L25899 


P39030 


A r^AIM IIIIRAAM 

AC0N_HUMAN 


Aconitate hydratase, mitochondrial precursor (aconitase} 
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M 
















X89401 | 


P46778 


A A r> A 1 11 IftA A M 

AAC4_HUMAN 
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Table 6. Reproducibility of transcriptome results using proteomics tools (a) and Wee versa (b). Proteomics results have been reproduced in 
67% of cases in transcriptomics studies. Results from transcriptomics studies are more difficult to reproduce using proteomics 
tools (25%). Only two gene products can be retrieved in three and more proteomics a nd transcriptomics studies (actin andcreatin- 
kinase) 



a. Transcriptomics > Proteomics 
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Table 6. Continued 
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necrosis, blood, stool, etc.), so that standardized sample 
preparation procedures are critical for obtaining reproduc- 
ible results. Several sample preparation methods have been 
described, in particular fluorescence-activated cell-sorting 
(FACS) |4], laser capture microdissection (LCM) [5-8], 
immunomagnetic beads separation (9], and cellular frac- 
tionation [10 J. Unfortunately, these sample preparation 
procedures were barely applied in CRC. In our experience, 
the beads-based method is characterized by several advan- 
tages when compared with other cell purification proce- 
dures [11]. 

Third, gene expression patterns depend on .the arrays 
technology platform. In transcriptomics studies any factors 
may affect the outcome of a microarray experiment, in par- 
ticular technical, instrumental, computational, and inter- 
pretative factors. In fact, lack of reproducibility and accuracy 
is a major concern in microarray studies [12]. When cross- 
platform comparison was performed, reproducibility was 
insufficient: only four genes from a set of 185 common genes 
selected behaved consistently on three array platforms, and 
agreement of about 30% was found between two brands [1 3], 

Fourth, in proteomics studies, 2-D PAGE or 2-D DIGE 
have well-known technological limitations. In CRC, even 
after epithelial cell enrichment using magnetic beads, the 
mean CV of repeated 2-D PAGE analysis with silver staining 
was found to lie between 20 and 28%. Only 47% (interrun) to 
76% (intrarun) of spots could be matched within a triplicate 
experiment. Interindividual phenotypic variability was high. 
Thus, even under weD-defined experimental conditions, 2-D 
PAGE parallel analysis of paired CRC samples is hampered 
by a significant variability [2]. 

Fifth, the methods applied for generating, formatting, 
storing, retrieving, and querying data are of outmost impor- 
tance to assess methodological and biological variation in 
gene expression analysis. Unfortunately, due to the small 
sample size (number of patients), large number of variables 
examined at once, and absence of double or triple experi- 
ments (arrays and gels are expensive and samples are rare) 
statistical analysis is often not valid. In particular, assessing 
the reproducibility of a variable is necessary (e.g., using the 
intraclass correlation coefficient) for comparing multiple 
samples at once. The use of median values instead of mean 
values has been shown to improve data correction [14]. It has 
also been proposed to use housekeeping genes as endoge-j 
nous controls [15]. A dedicated society, The Microarray Gene! 
Expression Data (MG ED) Society, has been formed to facil-; 
itate the sharing of gene expression data generated by func- 
tional genomics and proteomics experiments [16]. 

Finally, correlation between results of transcriptomics 
versus proteomics results is low. For CRC, there is no pub- 
lication comparing mRNA and protein expression for a 
cohort of genes. However, extrapolation is reasonable from 
another epithelial cancer (lung adenocarcinoma), where 
such comparison has been performed. Only a subset of the 
proteins (17%) exhibited a significant correlation with 
mRNA abundance [17]. 
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Obviously, many genes and factors found to be differen- 
tially regulated (both in transcriptomics and proteomics 
studies) do not play a causal role in CRC carcinogenesis. For 
example in the studies under investigation, at least 17 
mRNAs encoding ribosomal proteins were identified to be 
dysregulated using cDNA arrays and 39 ESTs using SAGE 
technology. This broad dysfunction of protein synthesis, in 
particular of small molecules synthesis, has been reported 
not only in cancer but also in several other human diseases, 
where etiologies have been linked to mutations in genes of 
the translational control machinery [36]. 

However, some findings might be of particular interest 
in human CRC. For example, a small molecule group found 
to be dysregulated in human CRC is the 14-3-3 proteins 
family. 14-3-3 proteins are ubiquitous within all eukaryotic 
cells and participate in protein kinase signaling pathways. In 
particular, they are involved in phosphorylation-dependent 
protein-protein interactions that control progression 
through the cell cycle, initiation and maintenance of DNA 
damage checkpoints, activation of MAP kinases, prevention 
of apoptosis, and coordination of integrin signaling and 
cytoskeletal dynamics [37]. Given the prevalence of specific 
14-3-3 isoforms expression in several human epithelial can- 
cers [38, 39], these proteins may be involved in cancer 
tumorigenesis and particular isoforms may be useful as 
therapeutic targets in human CRC. 

In summary, we propose a gene expression data ware- 
house in human CRC that is intended to help researchers 
active in the field to get an overview of the data available. 
However, reproducibility of results obtained in different 
studies was disappointing. 

As a matter of fact, the development of some types of 
unified processes for generating information, formatting, 
storing, retrieving, and querying data, regardless of the tech- 
nology used to generate it, is of central importance for 
building gene expression databases. Such unified processes 
have been proposed (see above) but have found only limited 
recognition so far. This is unfortunate because these metho- 
dological issues must be solved before trying to integrate 
experimental data from different sources into functional 
proteomics studies at the bench or in silico. The present 
meta-analysis of gene expression highlights the need for 
including a sufficient number of patients, analyzing only 
purified epithelial cells, using clinical standards (such as the 
international classification of disease, histopathology, and 
staging), assessing the variability of the technology applied, 
considering common standards for microarray data annota- 
tion and exchange, and finally for developing software 
implementing these standards and promoting the sharing of 
these high quality, well -annotated data within the life sci- 
ences community. 

In the absence of such unified processes, reproducibility of 
results within a laboratory will continue to be low, confirma- 
tion of results between different groups will stay difficult, and 
identification of diagnostic and therapeutic targets will remain 
a lottery. These problems are challenging the whole drug dis- 
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covery process from the very beginning, namely target identi- 
fication, so that target validation, assay development, and 
prioritization of compounds remains a high risk endeavor. 

We are very grateful to G. Nestlerfor her excellent technical 
assistance. 
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17.1 Introduction 

There are now numerous organisms whose genome sequences are known, for 
example Arabidopsis (Arabidopsis Genome Initiative, 2000), Drosophila (Adams 
et ai, 2000), human (International Human Genome Sequencing Consortium, 
2001) and rice (Yu et aL, 2002). The prediction of open-reading frames from 
these genomic sequences has enabled the comprehensive identification of many 
putative protein sequences. These proteins can be arranged into three categories, 
namely those of known function, those with recognisable motifs and hence a 
vague idea of function, and those with no sequence similarities to any pro- 
tein (Gabor Miklos and Malenszka, 2001). Many proteins reside in this latter 
'functional vacuum' which could represent as much as 30% of the predicted 
proteins. Determining protein function is key to understanding cellular mecha- 
nism. Studying how protein expression is modulated in response to a given set 
of circumstances, such as infection, disease, developmental stage, senescence or 
response to drugs, will facilitate the elucidation of disease pathways and thus 
provide a mechanism for diagnosis and therapy. 

DNA chips (mRNA profiling studies) can contribute to the study of gene 
expression in response to a particular biological perturbation. However, the 
extrapolation that changes in transjpript level will also result in corresponding 
changes in protein amount or activity cannot always be made. To understand 
fully, we need integrated data sets fro,m a variety of protein expression studies, 
providing information on relative abundances, sub-cellular locations, protein 
complex formation and the profiling of isoforms generated by either alternate 
mRNA splicing or post-translational modifications. Proteomics is the word now 
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commonly used to describe the discipline associated with the acquisition of 
these data sets. 

17.2 Definitions and applications 

Before embarking on an overview of the techniques used in the field of pro- 
teomics, it is necessary to define a few terminologies; proteomics, functional 
genomics, structural genomics and post-genomics are terms that have crept into 
the scientific vocabulary with alarming stealth and are used freely, and in some 
cases interchangeably, especially in cases where the use of such 'buzz words' 
is likely to increase the attendance at departmental seminars. 

• Genome: A genome represents the entire DNA content in a particular cell, 
whether or not it is coding, non-coding, or is located either chromosomally 
or extra-chromosomally. 

• Genomics: Genomics is the study of the genome, interrogating the complete 
genome sequence using both DNA and RNA methodologies. 

• Proteome: The proteome represents the complete set of proteins encoded by 
the genome. 

• Proteomics: Proteomics is the study of the proteome and investigates the cel- 
lular levels of all the isoforms and post-translational modifications of proteins 
that are encoded by the genome of that cell under a given set of circumstances. 

• Functional genomics: This is the study of the functions of genes and their 
inter-relationships. 

Whilst a genome is more or less static, the protein levels in a particular cell can 
change dramatically as genes get turned on and off during the cell's response to 
its environment. The proteome originally was defined over seven years ago as 
'all the proteins coded by the genome of an organism' (Wasinger et al. y 1995). 
Nowadays the term 'proteomics' is used more widely and implies an effort to 
link structure to function by whatever means are appropriate. We can expect the 
definition to change again with time as the field and the investigator's view of 
it evolves. 

17.3 Stages in proteome analysis 

Proteomic analysis can ascertain function either by looking for changes in the 
expression of either all or a subset of proteins, or by identifying binding partners 
for particular proteins and seeing how their interaction is affected by biological 
perturbation. Whatever the rationale of the investigation, or the number of pro- 
teins involved, the study of the proteome can be broken down into the following 
stages of analysis. : 

v 
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1. Separation of proteins. Before analysing protein expression and abundance 
levels, proteins first have to be isolated into a 'purified' state. Whilst there 
are a variety of chromatographic procedures that can achieve this, separation 
by two-dimensional polyacrylamide gel electrophoresis (2D PAGE) has been 
the method of choice in the recent past. However, other new methodologies 
are now emerging, each having its own strengths and weaknesses. 

2. Analysis of comparative expression. Once separated, it is then necessary 
to carry out some form of analysis to assess the relative abundance of the 
proteins present. 

3. Identification of protein species. Once a set of proteins showing differences 
in abundance between two or more states has been identified, mass spectro- 
metric analysis is be used to determine their identities. 

4. Confirmatory experiments. 

When a protein has been shown to be important in a given process by the 
above analysis, it may be necessary to perform further experiments to confirm 
its implied function or involvement in the process. 

Protein Separation and Visualisation 

For proteomes that encompass the protein content of a given cell or tissue type, 
or that of a whole organism, there are two main methods that are first used to 
resolve the protein mixture, and then to visualise the individual components in 
such a way that their relative abundances can be quantified. The first method 
utilises two-dimensional (2D) gel electrophoresis followed by a variety of in-gel 
staining methods, whilst the second couples liquid chromatographic separation 
to subsequent ultraviolet (UV) and/or mass spectrometry detection, 

2D PAGE 

The majority of proteome analysis to date has employed 2D PAGE, a technique 
that has been around for many years (O'Farrell, 1975) and is the subject of 
several excellent reviews (Gorg et aU 2000). Using this method, proteins are 
initially separated by isoelectric focusing (IEF) in- the first dimension, accord- 
ing to charge (pi), and then by SDS PAGE in the second dimension, according 
to size (Af r ). This type of separation has the capacity to resolve complex pro- 
tein mixtures, thus permitting the simultaneous analysis of hundreds or even 
thousands of proteins at a time (Figure 17.1). 

One of the most important considerations to make when visualising, quan- 
tifying and then determining the identities of spots from 2D gels is which 
staining method to use for spot detection. There is a variety of staining meth- 
ods available which differ in their sensitivity and dynamic range of detection. 
Coomassie staining (Neuhoff et al, 1988) is compatible for mass spectrometry 
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Figure 17.1 The process operates by the following principle: First dimension - isoelectric 
focusing. Proteins separate on the basis of their charge to a position along the strip where 
their pi and pH at that position are equivalent and they have no net charge. Second dimen- 
sion - SDS polyacrylamide gel electrophoresis. Proteins denature in SDS and their migration 
through the gel matrix is inversely proportional to their mass 



identification purposes, but the sensitivity of detection only goes down to the 
sub-microgram range. Silver staining (Mortz et al. 9 2001) has been widely used 
for high sensitivity protein visualisation on 2D gels, detecting sub-nanogram 
amounts of material. However, the drawbacks of silver staining pertain to its 
poor dynamic range of staining which renders it unsuitable for quantitative anal- 
ysis. More recently, the Sypro post-electrophoretic fluorescent stains (Molecular 
Probes, Oregon, USA) have emerged as alternatives, offering a better dynamic 
range of detection and ease of use (Malone et al. y 2001 ; Yan et al. t 2000). Sypro 
Ruby has been shown to be more sensitive, as well as compatible for subsequent 
peptide mass mapping, than silver staining (Lopez et al, 2000). 

2D PAGE is not without numerous technical difficulties and inadequacies. 
Most problems are concerned with the incomplete coverage of the proteome, 
where proteins with either high molecular weights, or extremes of pi or hydro- 
phobicity, may not be represented on gels. Another problem regards the degree 
of protein separation achieved by 2D gels. Ideally, every protein in a sam- 
ple would be resolved as a discrete, detectable spot by 2D PAGE. However, 
since complex samples can be separated by 2D electrophoresis into as many 
as 3000 spots, a large proportion of proteins will co-migrate to the same spot 
position, a property that will subsequently confound quantitation and mass spec- 
trometric identification (Gygi et a/ M 2000). Additionally, since about 90% of 
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the total protein of a typical cell is made up of only 10% of the 10 000-20000 
different protein species (Gabor Miklos and Malenszka, 2001; Zuo et aL, 2001), 
the less abundant protein species may be, obscured by more abundant ones, thus 
further compounding the proteome coverage problem. 

There are strategies designed to overcome the aforementioned limitations of 
2D PAGE. For instance, in order to increase . the representation of hydrophobic 
proteins, which are usually membrane or membrane-associated proteins, alter- 
native denaturants or detergents, such as thiourea and ASB14 respectively, can 
be employed (Molloy et aL, 2000; Santoni, et al, 2000). However, although 
these methods increase the number of hydrophobic proteins represented, protein 
coverage is still by no means complete. 

The loss of low abundance proteins from 2D gels is a problem caused by a 
combination of limited resolution and sensitivity. These proteins are quite often 
the most biologically active molecules and if their detection is not masked by 
a more abundant protein as described above, it may be limited by their scarcity 
in the starting sample. Zuo et al (2001) estimated that there may need to be 
at least 1000 copies of a protein within a cell to be detected. Therefore, to 
increase the representation of low abundance proteins, they may need to be 
enriched by fractionation of the stalling sample. Pre-fractionation of a complex 
protein sample enables an increase in the amount of sample that can be loaded 
onto a 2D gel, subsequently increasing the sensitivity of spot detection. Pre- 
fractionation also enables samples to be simplified into different components 
which can then each be analysed using separate gels, thus facilitating the ability 
to achieve complete protein coverage. 

One way to fractionate indirectly is to focus greater amounts of sample to 
a higher resolution on narrow pH range IPG strips such that they may have a 
greater chance of migrating separately from abundant protein species. One set 
of researchers has reported displaying 70% of the entire Escherichia coli pro- 
teome using a combination of six different pH range gels (Tonella et a/., 2001). 
Most fractionation protocols utilise chromatographic, differential extraction or 
centrifugation principles. Using liquid chromatography, proteins can be sepa- 
rated upon the basis of their hydrophobicity, native size or charge. Additionally, 
extraction, either chemically using alternative detergents, or manually, by dis- 
section of tissues of interest, can also help to limit the number of proteins in a 
given sample. Differential centrifugation can be used to isolate or enrich a par- 
ticular tissue or organelle fraction, this having the added functional advantage 
of being able to simultaneously isolate groups of proteins that are involved in 
a common physiological role, for example Arabidopsis mitochondrial proteome 
(Millar et a/., 2001) and Arabidopsis chloroplast (Schubert et aL, 2002). 

Non-gel-based methods 

Although optimisation of 2D PAGE strategies have meant that more of the 
proteome can be visualised using this method, problems detecting low abundance 
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species, and particularly integral membrane proteins, have prompted several 
research groups to introduce gel electrophoresis-free approaches. Recently, sev- 
eral researchers have described methods of protein separation and identification 
that utilise on-line, 2D high performance Liquid chromatography (HPLC) sepa- 
rations (Link et aL, 1999; Wolters et al. y 2001). Wolters et aL, who have named 
the technique multidimensional protein identification technology (MudPIT), use 
a capillary HPLC column with a strong cation exchange matrix at the front end 
of the column which is followed by reverse phase packing at the back end. 
Their approach involves the tryptic digestion of soluble and insoluble protein 
fractions of the entire yeast proteome, followed by the application of total tryp- 
tic peptides from the two fractions onto the strong cation exchange matrix at 
the top of the column. A salt step gradient is then used to displace a fraction 
of the peptides onto the reverse phase packing. Displaced peptides are then 
eluted into the mass spectrometer using a solvent gradient. This procedure is 
then repeated in steps, each time using an increasing amount of salt to release 
further peptides from the cation exchange to the reverse phase packing. Each 
peptide eluted is introduced into a mass spectrometer capable of generating frag- 
mentation data, which in turn are used for automated searches against protein 
databases and identification. Mass spectrometric techniques are dealt with in 
more detail below. 

The application of this approach to the total yeast proteome is impressive, 
enabling the identification of approximately 1500 proteins (Washburn et a/., 
2001), in comparison with 2D PAGE based methods which have only identified 
around 400 spots (corresponding to about 300 gene products). One advantage of 
MudPIT is that there are many more protein classes represented in its proteome 
than are produced from the electrophoresis-derived proteome. It therefore has 
the potential to provide more information about proteins that do not behave well 
on 2D PAGE, and also to generate quantitative data when coupled with stable 
isotope labeling (see below). 

Quantitation 

Qualitative proteomics enables the investigator to determine whether or not a 
particular protein shows an increase or decrease in expression. As this pro- 
vides no measure of the extent of this expression change, this approach is 
therefore unsuitable for clustered data analysis which ultimately presents an 
insight into functionality. On the other hand, quantitative proteomics does allow 
co-expression patterns to be studied, and proteins showing similar expression 
trends can then be assigned into the same functional groups. An example of this 
was demonstrated by Grunenf elder et al. (2001) who discovered that expression 
changes during the cell cycle of Caulobacter can be grouped into 23 distinct 
pattern clusters. Proteins within each cluster have been shown, in many cases, 
to have a similar function. To generate quantitative data, the authors used 
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radioactive labelling of proteins in conjunction with phosphorimaging of 2D 
gels. One drawback of this method is that the labelling is not as straightforward 
as that used in alternative methods. One alternative, a non-gel-based method 
for quantitation, involves mass spectrometry utilising differential isotope coded 
affinity labeling (see below). 

In the case of quantitative analysis, using 2D PAGE, this has involved running 
the two test samples to be compared on separate 2D gels, and then visualising the 
spots with silver stain. Image analysis software has been used to estimate spot 
volumes in each gel, which are then expressed as a ratio that provides a measure 
of any change in expression. However, quantitation using this type of analysis 
is crude as this technique has a number of inherent drawbacks. First, as no two 
gels run identically, there are problems with irreproducibility. Additionally, the 
quantitative process is complicated by the fact that corresponding spots between 
gels have to be matched prior to quantitation. However, spot matching can be 
performed by warping algorithms which are built into most gel analysis software. 
Silver staining, as stated earlier, has a very poor dynamic range which reaches 
saturation in the low nanogram range, thus rendering it unsuitable for accurate 
quantitation. These aforementioned factors all add variability into the system 
that makes this method unsuitable for the accurate quantitation of differences 
between two test samples. 

Difference gel electrophoresis (DIGE), first described some time ago, has 
the potential to overcome many of the issues described above (Unlu et a/., 
1997). The technique relies on pre-electrophoretic labelling of samples with 
one of three spectrally distinct fluorescent dyes, cyanine-2 (Cy2), cyanine-3 
(Cy3) or cyanine-5 (Cy5). The samples are all run in one gel and then viewed 
individually by scanning the gel at different wavelengths, thus circumventing 
problems with spot matching between gels. Image analysis programs can then 
be used to generate volume ratios for each spot, which essentially describe the 
intensity of a particular spot in each test sample, and thus enable expression 
differences to be identified and quantified (Figure 17.2). 

Fluorescent labels bind to lysine residues and labelling is carried out at sto- 
ichiometrics such that only a small proportion of the protein is labelled. These 
labelled proteins are compatible with in-gel digestion and mass spectrometric 
analysis. This method is more sensitive than staining with silver or any Sypro 
dye, with a detection limit of somewhere in the region of 100-200 pg protein 
and a dynamic range of labelling of over 5 orders of magnitude (Kernec et al. t 

2001 ; Tonge et ai 9 2001). j : 

'/ 

Identification 

Since the early 1990s, mass spectrometry has evolved into an extremely pow- 
erful technique for the identification of proteins from 2D gels. The application 
s of two ionisation techniques to the analysis of peptides and proteins were partly 1 
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Figure 17.2 Schema for labelling of two samples whose protein profiles are to be corn- 
pared by the 2D DIGE technique. The images are from a 2D analytical gel (pH 4-7) which 
was loaded with 50 u.g of a total protein extract from a wild-type strain of Erwinia caro- 
tovorea labelled with Cy3 and 50 u,g of total protein extract from a mutant strain of Erwinia 
carotovorea labelled with Cy5. The images were acquired using a 2920-2D Master Imager 
(Amersham Biosciences). The images were exported as 16-bit TIF images for analysis. To 
produce this figure, 8-bit TIF versions of the images were imported into ADO^E Photoshop 
version 5 and false coloured, green for Cy3, red for Cy5 and the two imaged overlaid. A 
colour version of this figure appears in the colour plate section 
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responsible for the surge of interest in biological mass spectrometry. Matrix- 
assisted laser desorption/ionisation (MALDI) and electrospray ionisation (ESI) 
were developed by Karas and Hillenkamp (1988) and Fenn et al (1989), respec- 
tively. The combination of either of the above mass spectrometric techniques 
with the separation of proteins by 2D-PAGE is now an established method for 
proteome analysis. In both cases,. identification takes place at the peptide level, 
not the entire protein. It is therefore necessary to convert proteins in excised gel 
pieces into peptides which can be extracted for analysis. The preparation of pep- 
tides ideally is performed using a fully automated digestion robot. This primarily 
reduces preparation time, but also prevents contamination by keratins, of which 
there are many sources ranging from hair, skin, dust and clothing. Excised spots 
must be destained, depending on the visualisation method used, reduced and 
alkylated to prevent inter-peptide disulphide bridge formation which could com- 
plicate analysis, and finally digested into relatively short peptides using a robust 
protease such as trypsin. Peptides generated are then extracted in an appropriate 
solvent compatible with the mass spectrometric technique to be used. 

MALDI -TOF mass spectrometry (MALDI) 

Peptides can be analysed by matrix assisted laser desorption ionisation (MALDI) 
to produce peptide mass fingerprints which are then matched against protein 
databases in order to identify the corresponding proteins (Henzel et al, 1993; 
Mann et al, 1993; Pappin et al, 1993; Yates et al, 1993). In addition to peptide 
identification, MALDI may also be used to identify which peptides in a tryptic 
digest have undergone post-translational modifications such as phosphorylation 
and glycosylation, which are mediated by kinases (Liao et al, 1994; Zhang 
et al, 1998) and glycosyl transferases (Colangelo and Orlando, 1999; Mechref 
and Novotny, 1998). 

MALDI may be divided into three separate stages: ionisation, mass separation 
and detection. One of the main advantages of using MALDI is that very little 
internal energy is imparted into the ions during ionisation, thus resulting in 
minimal fragmentation and therefore the formation of intact adduct ions. These 
are of the type (M + H) + , where M = the biological molecule and H = H + (or 
a proton). MALDI is therefore classified as a * soft' ionisation technique. 

Ions are generated through the use of a laser (usually nitrogen) which is 
fired at a sample plate containing a dried mixture of matrix and sub-picomole 
quantities of sample. The matrix, constituted by small organic molecules such 
as a-cyano-4-hydroxycinnamic acid, absorbs radiation from the laser (337 nm 
for nitrogen lasers) which results in the excitation of the matrix molecules. A 
dense plume containing matrix and analyte molecules is then produced, and 
the analyte molecules interact with hydrogen atoms from the matrix to form 
predominantly singly* charged (M + H) + ions. These ions are then separated in 
the mass analyser, with lower mass ions travelling faster than high mass ions at 
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constant energy, until they are detected by a microchannel plate detector (MCP). 
All proteomic experiments are performed using time-lag-focusing and reflectron 
mode, both of which act as energy focusing devices that greatly improve the 
resolution oHhe ions. Although the mass range of a time of flight (TOF) analyser 
is theoretically infinite, it does have an upper mass range of around 750 kDa in 
linear mode and 100 kDa in reflection mode. 

A typical MALDI mass spectrum of a typical protein digest mixture obtained 
from a 2D gel spot is shown in Figure 17.3. The acquisition and processing 
of MALDI data is automated and includes background subtraction, smooth- 
ing, centroiding and the generation of text files. The masses of the centroided 
peaks (minus contaminant peaks such as keratins) are then entered into a search 
engine. This program compares the observed, calculated masses against the the- 
oretical calculated masses of all the possible tryptic digest fragments from all 
proteins in non-redundant protein databases. The main parameters required for 
the peptide fingerprint search are the database to be searched (e.g. NCBI), type 
of enzyme used to digest the protein (e.g. trypsin), modificationsof the peptides 
(e.g. carbamidomethylation of cysteine residues) and mass tolerance (usually 
within 100 ppm for MALDI data). An output file is then generated which dis- 
plays hits, the scores of which are calculated by means of a probability-based 
scoring algorithm. 

Although MALDI has proved to be an impressive and rapid method for 
protein identification, it is limited by the fact that a protein sequence searched 
against may not be represented in the database. Also, MALDI does not cope 
well with samples that constitute a mixture of proteins and therefore result in 
heterogeneous digestion products. An alternative identification method involves 
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Figure 17.3 MALDI mass spectrum of the digest mixture of peptides obtained from a 
Drosophila 2D gel spot 
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peptide sequencing using nanospray/LC-MS/MS. This has the advantage over 
MALDI in that it is more sensitive, fully automated, and can also make positive 
identifications from mixtures of peptides. However, the combination of mass 
mapping and sequencing has allowed protein identification to be : both rapid 
(MALDI -TOF) and specific (LC-MS/MS). 

Nanospray ionization mass spectrometry (nano-ES) 

An alternative mass spectrometric-based method for protein identification is 
nanoelectrospray (nanoES). This utilises very fine glass capillaries with gold- 
coated tips in order to produce a spray (Wilm and Mann, 1996) which subse- 
quently enables extremely low levels (<50 fmoles of total protein) of protein 
digest mixtures to be analysed at flow rates of approximately 20-50 nLmin" 1 . 
For nanoES sample preparation, samples are dissolved in a solvent mixture 
(usually 50 : 50 acetonitrile : water, or 50 : 50 methanol : water containing 0.1% 
formic acid) and are then injected into the gold-coated capillary. Usually, sam- 
ples contain salts and buffers from the digestion process which must be removed 
prior to electrospray ionisation (ESI) analysis to prevent contaminant peaks and 
poor peptide signals. Once loaded, the capillary is then held at a potential of 
M -1 .5 kV which results in the formation of a very fine spray of solvent droplets 
containing preformed ions. The different types of ions formed are (M + rcH)" + , 
where M is the peptide molecule, nH is the number of protons attached to the 
molecule, and n+ is the net charge of the peptide ion (usually 2-4). Multiply- 
charged gas-phase ions are then formed as a result of desorption processes that 
occur due to evaporation of the solvent droplets. 

ESI differs from MALDI in that a series of ions are formed that have differ- 
ent numbers of charges attached (i.e. (M + nH)" + ). The resulting mass spectra 
obtained therefore display several peaks that correspond to ions which have the 
same peptide or protein sequence, but different numbers of attached charges. 
As mass spectrometry is concerned with the measurement of mass-to-charge 
(m/z) ratios, rather then mass alone, ESI-MS allows the analysis of relatively 
high molecular weight samples whilst using analysers with only modest m/z 
ranges. In general, the types of mass analysers used in conjunction with ESI 
are quadrupoles, quadrupole ion traps, quadmpole time-of-flight (Q-TOF) hybrid 
instruments and time-of-flight. The advantage of these instruments over MALDI 
is that the fragmentation of the ions can be very carefully controlled using tan- 
dem mass spectrometry (MS/MS), thus leading to precise sequence information. 
In addition, these types of instruments also have the ability to fragment very 
low abundance ions. 

The fragmentation of ions results from collision-induced dissociation (CID) 
processes in which low-energy collisions occur within a collision cell between 
isolated ions of a specific m/z and an inert gas, usually jargon. The fragment 
ions are then separated and detected to give a fragment ion spectrum (MS/MS 
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Figure 17.4 ESI-MS/MS spectrum of a doubly charged ion (m/z 523.29) of a trypsin 
autolysis product from porcine trypsin. Subtraction of the masses of adjacent fragment ion 
peaks (/7-type) corresponds to the masses of the amino acids in the peptide chain. Hence, the 
complete sequence of the peptide is LSSPATLNSR 

spectrum; Figure 17.4). The manually interpreted sequences and m/z values of 
the intact peptides, or uninterpreted data in the form of a text file (containing 
the m/z values and relative abundances of each peak in the MS/MS spectra) are 
then submitted to a search engine and hits are obtained based on the sequence 
information along with the masses of the peptides. Usually, the scores of the 
identified proteins are much higher than the hits obtained from MALDI data 
because the information submitted is much more specific; i.e. m/z values and 
sequence information for ESI-MS, as opposed to only m/z values for MALDI. 

In addition to nanoES, nano-liquid chromatography mass spectrometry 
(nanoLC-MS/MS) can also be used for sequencing. In this technique, peptides 
are separated by reverse phase in a narrow capillary column (i.d. of 50-180 |xM) 
that is packed with Ci 8 material. The advantage of this system over nanospray is 
that the analysis is fully automated. Up to 96 samples can be loaded and injected 
onto the system, mass spectra and MS/MS spectra acquired and processed, and 
the resulting data searched against databases without any manual intervention. 
As with MALDI-TOF, ESI-MS/MS is a powerful tool for the determination 
of post-U-anslational modifications. The latter technique can be considered more 
powerful as, in addition to the type of modification, the exact site of the 
modification can be determined. 

Comparative analysis of the proteome using stable isotope labelling 

The section 'Quantitation' above dealt with quantitative proteomics using differ- 
ential labelling in conjunction with 2D PAGE. However, as discussed earlier, a 
sizeable proportion of proteins cannot be detected using this technology largely 
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because of solubility and abundance issues. Although the MudPIT approach 
provides a mechanism to interrogate proteins recalcitrant to gel electrophoresis- 
based methods, it does not yield information about the relative quantities of 
the identified proteins. One technique which results in the measurement of 
relative abundances is mass coded abundance tagging (MCAT), where whole 
samples are digested to peptides,, and the . terminal lysine residues of peptides 
are converted to honioarginine using O-methylisourea. The relative abundance 
of proteins between two samples can be measured if peptides from one sam- 
ple only are labelled and are then combined with unmodified peptides prepared 
from the comparative sample. Peptide masses, determined by MS, which dif- 
fer by 42 atomic mass units (amus) correspond to the modified and unmodified 
forms of a peptide present in both samples. Relative peak intensities then give 
information about the relative abundance of the parent protein in the original 
samples (Cagney and Emili, 2002). 

A more robust methodology than MCAT to look at quantitative differences 
in protein abundance between sets of samples employs the use of stable isotope 
labelling. Here, a protein sample is prepared such that it contains a particu- 
lar isotopic form of an amino acid or added label, with a comparative sample 
similarly prepared, but with an alternative isotope. The samples can then be com- 
bined and digested to peptides. The peptides are then separated and sequenced, 
and the relative distribution of the isotopes assessed by nanoLC-MS/MS. One 
approach is to use the incorporation of isotopic variants of amino acids into 
protein samples, although this approach is only suitable in the case of. cultured 
material (Pratt et al., 2002). A more universally applicable method is that of 
isotope coded affinity tagging (ICAT; Gygi et aL, 1999). This involves the use 
of two probes, each comprising a biotin tag, a linker region and a cysteine reac- 
tive iodoacetamide handle. Although the two probes employed have identical 
chemical properties, they differ by a mass of 8 Da in their linker region, which 
can therefore be either a light version (8 hydrogen atoms) or a heavy version 
(8 deuterium atoms). 

When comparative analysis of two samples is made, one sample is labelled 
with the light probe whilst the other is labelled with the heavy probe. The 
two extracts are then pooled and digested with trypsin. The digest is loaded 
onto a monomeric avidin affinity column which retains the labelled, cysteine- 
containing peptides. These are subsequently eluted from the avidin by formic 
acid. The eluate, which consists if peptide pairs (heavy and light), is then 
applied to nanoLC-MS/MS. At the. first MS stage, each peptide appears as 
a mass pair separated by 8 Da (provided there is a single cysteine per pep- 
tide). Since the starting amount of each peptide pair should be equivalent, as 
assessed by the mass signal intensity of each, a ratio of these two intensities 
will enable the identification of peptides that show expression differences. Sub- 
sequent fragmentation and MS/MS peptide identification leads to the name of 
: the corresponding protein. 
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Since cysteine content differs widely from one protein to another, some 
proteins will therefore be accurately assigned by several peptides, whilst the 
analysis of others may be slightly ambiguous, as identification relies upon a 
single peptide. In addition, since one out of seven proteins does not contain a 
cysteine residue any analysis will therefore not represent the whole proteome. 

Protein Function and the Proteome 

The analysis of proteome components by 2D gel electrophoresis and multi- 
dimensional chromatography, as described above, provides information about 
which proteins are expressed in a given cell or tissue type under a given set 
of circumstances. Although many of the identified proteins may already have 
a previously characterised function, there are examples where no function is 
known. All that can therefore be concluded about these proteins is that they are 
implicated in the function of the cell in the situation studied. 

There are two other types of approaches designed to study the interactions 
of specific proteins within a proteome. The first involves the analysis of mul- 
tiprotein complexes which are the functional units that are responsible for the 
majority of cellular processes. Identifying the protein components of these com- 
plexes may help us understand how these multiprotein units function within 
the proteome. A second approach for a functional study utilises protein arrays. 
Here, function is assigned by the identification of binding partners from sub- 
strates that are applied to ordered arrays of immobilised proteins, peptides, or 
other capture agents. 

Analysis of multiprotein complexes 

The study of pairwise protein interactions on a proteome-wide scale has been 
characterised in many ways. For example, Uetz et ai (2000) and Ito et ai (2001) 
have both adapted the yeast 'two-hybrid' assay into a high-throughput method 
for mapping binary protein interactions on a large scale. Investigating protein 
interactions that occur within a functional complex, of which the spliceosome, 
the proteasome and the nuclear pore complex are well-documented examples, 
is not so straightforward. First; the complex must be purified away from other 
protein components in a native from. The complex can then be separated by ID 
or 2D PAGE, or by a variety of chromatographic techniques, and the individual 
members subsequently identified by the high throughput mass spectrometry 
methods described above. 

The isolation of the complexes in the first instance is key to this process. This 
isolation process must be stringent enough to remove non-specifically bound pro- 
teins, but not so harsh as to strip off bona fide components. The most widely used 
method to isolate whole complexes is by immunoprecipitation. Here an antibody 
raised against a known component of a given complex is mixed with cell extract. 
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Application of this mixture to Protein A sepharose beads subsequently binds the 
antibody and, consequently, the known component and its associated partners, 
thus enabling their isolation from other proteins in the cell extract. Although 
this is a powerful technique, it has its drawbacks. These include the expense 
of generating the specific antibodies, the inability to isolate low copy number 
complexes, and also the complications caused by false positives and negatives. 

Recently, two groups have published slightly different but effective approaches 
for the identification of yeast complexes that comprise three or more components 
(Ho et al. 9 2002; Gavin, 2002). These studies purify multi-protein complexes away 
from cell extracts by using a tagged version of one of the protein components 
of the complex as 'bait' (Figure 17.5). Molecular cloning is used to attach tags 
to many different bait proteins. These .constructs are then introduced into yeast 
cells where they are expressed and form physiological protein complexes with 
their respective binding partners. After isolating these complexes by their tags, a 
combination of ID PAGE and standard mass-spectrometry methods were used to 
identify the individual protein components. Ho et al. (2002) constructed an initial 
set of 725 yeast bait proteins which were tagged with a FLAG epitope (FLAG 
tag Gateway™). A single immunoaffinity purification step using immobilised anti- 
FLAG subsequently enabled the identification of 3617 interactions involving 1578 
different proteins. 
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Figure 17.5 In the 'co-precipitation/mass spectrometry' approach used by Gavin et al. 
(2000) and Ho et al. (2002) an 'affinity tag' is first attached to the protein (the 'bait') (a), 
(b) Bait proteins are systematically precipitated, along with any associated proteins, on an 
'affinity column', (c) Purified protein complexes are resolved by ID SDS-PAGE. (d) Proteins 
are excised from the gef, digested with the trypsin, and analysed by mass spectrometry. A 
colour version of this figure appears in the colour plate section 
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Gavin et al (2002) based their proteome-wide scale analysis on. an approach 
initially described by Rigaut et al (1999). These workers used tandem affin- 
ity purification (TAP) tagging, where bait is coupled to two different tags in 
series to allow binding to either IgG or calmodulin. The advantage of a dual 
tag approach is that two purification steps are used for the isolation of com- 
plexes, thus enhancing the stringency and sensitivity of the system. Using this 
method, the authors identified 1440 distinct proteins within 232 multiprotein 
complexes in yeast. As 91% of these complexes contain at least one protein of 
previously unknown function, this study provides a wealth of new information 
on 231 previously uncharacterised yeast proteins, and on a further 113 proteins 
to which the authors ascribe a previously unknown cellular role. Furthermore, 
Gavin et al found that most of the : identified- complexes share common protein, 
components. This allows the assembly of a higher-order network of interactions 
that can be used to coordinate cellular functions. 

Although clearly powerful, this kind of approach does have its drawbacks. A 
significant number of false-positive interactions, as well as a failure to identify 
many known associations, were problems encountered by both groups. Gavin 
et al estimate that 30% of the interactions they detect may be spurious, as 
inferred from duplicate analyses of 13 purified complexes. 

Protein arrays 

An emerging technology to assign function to proteins involves the use of pro- 
tein arrays. This technology, which has many similarities with high throughput 
DNA chip methodology, involves the printing of ordered arrays of thousands 
of ligand molecules onto glass or silicon-based surfaces. Subsequent functional 
analysis using these arrays involves exposing the protein chips to a particular 
test sample, which might be a protein sample if protein-protein interactions 
are under investigation, or might contain various other potential substrates for 
binding such as DNA, metabolites, etc. 

Protein can be arrayed in several formats. Arrayed molecules can be whole 
or fragments of recombinant proteins, peptide molecules, either monoclonal, 
polyclonal, or specific domains of antibodies, and also fragments of a protein 
expressed by phage display. The production of stable protein arrays is not as 
straightforward as that of DNA arrays, however. One major problem is that the 
printing of small volumes of protein ligands onto a support can result in protein 
denaturation, thus compromising the biological activity of these proteins, and 
consequently, array function. Another hurdle is that ligand molecules ideally 
should be attached to the support in roughly the same orientation and also in a 
hydrophilic environment that will facilitate binding reactions. 

There are therefore many different methods for immobilising protein ligands 
to primarily glass or silicon surfaces, such that a stable functional protein array is 
generated. Some arrays are created by the non-covalent adsorption of proteins to 
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nitrocellulose-coated glass supports, which involves van der Waal's, hydropho- 
bic and hydrogen bonding interactions. The advantage of this method is that 
proteins require no form of modification for immobilisation. However, the ran- 
dom attachment, and therefore orientation, means that this method is more prone 
to steric occlusion effects, as well as to protein denaturation and inactivation. 

An alternative. method involves the covalent attachment of proteins, such as 
the treatment of glass surfaces with poly-lysine aldehyde to immobilise pro- 
teins via their primary amines (MacBeath and Schreiber, 2000). Other examples 
include the immobilisation of biotinylated proteins onto streptavidin-coated sur- 
faces or his-tagged proteins onto nickel-chelating surfaces (Zhu etal, 2001). 
The advantage of these types of attachment is that as well as the protein attach- 
ing in roughly the same orientation, . the amount of. non-specific protein binding 
upon application of the test sample is also reduced. The most common type of 
proteins employed in protein arrays are antibodies. With the advent of phage 
display antibody libraries, it is now relatively easy to obtain sufficient amounts 
of thousands of pure proteins with varying specificities (de Wildt and Mundy, 
2000). The alternative would be to express and purify large quantities of hun- 
dreds of different recombinant proteins for immobilisation. 

There are a number of ways in which proteins binding to particular positions 
on the array can be identified. For example, in order to detect the binding of the 
array to a particular protein of interest, antibodies specific to this protein that 
are conjugated to either an enzymatic or fluorescent label can be used. Another 
approach identifies the binding to the array of proteins that have a particular 
activity. These can subsequently be detected using enzymatically or fluorescently 
conjugated substrates. An elegant example of this latter application comes from 
Zhu et aL (2001) who cloned and over-expressed 5800 yeast proteins that were 
then spotted onto glass slides. This group then screened the slides for calmodulin 
and phospholipid binding and found many new examples of proteins exhibiting 
these functionalities. 

Another more general method employs the use of mass spectrometric tech- 
niques to look at proteins or other substrates bound at each position on the array. 
One application of this technique is known as SELDI (surface enhanced laser 
desorption ionisation) (Isaaq et al, 2002). Here proteins or substrates interact- 
ing with material bound at a certain position undergo laser desorption to form 
gaseous ions whose masses can then be analysed by the same mechanism as 
MALDI-TOF MS. Methods such as surface plasmon resonance (SPR) that 
employ the label-free detection of proteins are also now being used. Here, pro- 
teins are printed onto gold-coated glass slides which are then exposed to either 
the proteins of interest, cell lysates or drug candidates, and kinetic measurements 
of binding in real time can be obtained. 

Protein arrays are stalling to be widely used in the analysis of protein-protein 
and protein-drug interactions, as well as for expression } profiling studies of 
disease-related proteins. 
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17.4 Future directions 

The most common method to date of separating and identifying complex protein 
mixtures has been to employ 2D PAGE and mass spectrometry, respectively. 
This methodology is particularly useful for establishing which proteins in any 
particular cell are up- or down-regulated under particular physiological condi- 
tions which, for example, allow the identification of potential target proteins 
in diseased cells. It is anticipated that these methods will continue to be used 
routinely for the characterisation of protein expression levels and subsequent 
identification and that the throughput of samples will continue to increase. In 
addition, many researchers and instrument manufacturers are seeking ways to 
automate the identification and characterisation of post-translational modifica- 
tions of gel-excised proteins, which has until now been a difficult and laborious 
challenge, particularly for low abundant proteins. It is apparent, however, that 
only a modest proportion of proteins from the proteome are visualised on 2D 
gels because of problems associated with protein precipitation during the first 
dimension. As a result of this, it is likely that an increasing number of labs will 
invest in technology associated with multidimensional liquid chromatography 
that has been shown to be an effective method for the identification of both 
acidic and basic proteins. These techniques may be used in conjunction with 
labelling methods such as ICAT to enable high-resolution peptide separations, 
identification and relative quantitation. Furthermore, as the number of protein 
sequences that are found in databases increases, it is envisaged that there will be 
a marked shift from routine identification to the analysis of protein function of 
proteins within complexes. Encouraging results obtained from affinity purifica- 
tion/2D electrophoresis/mass spectrometry and protein array technologies have 
shown that the rapid analysis of protein -protein interactions is now feasible and 
potentially will have a large impact on biological science and drug discovery. 
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Gene Expression Analysis 
Using Microarrays 

Sophie E, Wildsmith and Fiona J, Spence 



13.1 Introduction 

Microarrays are of increasing interest to both industry and academia as tools 
for 'gene hunting' and also as quantitative methods for routine analysis of large 
numbers of genes. Techniques such as real-time polymerase chain reaction (RT- 
PCR) TaqMan™ and SybrMan™ are generally considered to be more accurate, 
robust, larger in dynamic range and less capital intensive, but for rapid, large- 
scale gene expression analysis using limited mRNA, microarrays and gene chips 
are preferred. 

13.2 Microarray experiments 
Platforms 

Global gene expression platforms are now available in multiple formats, 
including cDNA arrays, oligonucleotides spotted onto slides or in situ 
synthesised oligonucleotide arrays manufactured using photolithography. 
Commercial sources for these include Stratagene (La Jolla, CA), Memorec 
(Kdln, Germany) and BD Biosciences (Oxford, UK) for cDNA microarrays, 
Mergen Ltd (San Leandro, CA) for spotted oligomers and Affymetrix (Palo 
Alto, CA) for oligoarrays synthesised in situ. Purchasing from a supplier is more 
expensive than generating microarrays in-house, although the latter is beneficial 
in labour-intensive institutions or when proprietary gene information is utilised. 
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'Off-the-shelf microarrays may also have the advantage of rigorous quality 
control and standardised protocols. 

It is possible to produce oligonucleotide spotted arrays in-house, by design- 
ing oligonucleotide sequences that match genes of interest and then purchasing 
purified oligonucleotides to spot down on glass or other substrates. Alternatively 
there are new systems such as that available from CombiMatrix (Mukilteo, 
WA, USA) for computer-aided design and in situ synthesis of oligonucleotides. 
However, production of cDNA microarrays is currently the most affordable and 
popular method and is now well established. Numerous sources of informa- 
tion on cDNA microarray fabrication are available in the literature and on the 
internet (Bowtell, 1999; Cheung et al> 1999; Wildsmith and Elcock, 2001 and 
http://cmgm.stanford.edu/pbrovm). Thus, this- chapter will focus on the 
implementation of experiments and analysis of data from cDNA microarrays. 
The experimental procedure differs slightly according to the number of fluo- 
rophores (or channels) and the type and manufacturer of the array. We have 
attempted to describe a generic process, indicating where possible the different 
options. Figure 13.1 demonstrates the procedure for a two-colour hybridisation. 



Sample 1 Sample 2 



RNA isolation 




Figure 13.1 The microarray experimental process for two-colour hybridisations 
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RNA Extraction 

First, RNA is extracted from the tissue or cells of interest. The quality of the 
RNA extracted is paramount to the overall success of the microarray experi- 
ment, as impurities in the sample can effect both the probe labelling efficiency 
and also stability of the fluorescent label (Hegde et ai y 2000). Snap-freezing 
of tissue in liquid nitrogen, immediately after harvesting, is used to preserve 
RNA integrity. Any further sectioning of the tissues should be carried out under 
RNAse-free conditions (Fernandez et ai, 1997). Total RNA can be extracted 
using kits such as TRIzol® (Invitrogen, Paisley, Scotland) and Rneasy (Qia- 
gen, GmbH, Hilden, Germany). Some researchers perform a further extraction 
of mRNA; this results in a purer stalling material but, has the disadvantage of 
lower yields. Affymetrix recommend between 5 and 40 \xg of total RNA is 
required for their GeneChips™ and 10 \ig or less is the required amount of 
starting material for cDNA microarrays (Hegde et aL, 2000). 



Sample Labelling 

The mRNA is transcribed in vitro, with the concomitant inclusion of labelled 
nucleotides. The labels may be fluorescent or radioactive. In the case of dual 
channel/colour hybridisations, two samples will be labelled with dyes that 
fluoresce at different wavelengths, with different emission spectra. Example 
fluorophores, available coupled to nucleotides, are Cy3, Cy5, fluorescein 
and lissamine. Wildsmith et al (2001) have demonstrated that AlexaFluor 
546dUTP™ (Molecular Probes, Leiden, The Netherlands) gives a significantly 
higher signal than Cy3dCTP (Amersham Biosciences, Piscataway, NJ, USA). 
When performing two-colour hybridisations the control sample and 'test' sample 
are labelled with different fluorophores and the subsequent cDNA is then mixed 
together and hybridised simultaneously (Nuwaysir et al, 1999). An advantage of 
simultaneously hybridising control and treated sample is that it obviates the need 
to control for differences in hybridisation conditions or between microarrays. A 
specific example of the huge impact this technique has had includes its use 
in the first published account of gene expression data of the entire genome of 
Saccharomyces cerevisiae (DeRisi et a/., 1997). In two-colour hybridisations, 
one would assume that the properties of the two fluorescent dyes being used 
are equivocal. In fact, for Cy5 and Cy3 this is not the case as Cy5 has been 
reported to give higher background fluorescence and also is more sensitive to 
photobleaching than Cy3 (Van Hal et al, 2000). In addition, there is evidence 
from several independent sources that the combination of Cy3 and Cy5 dye 
labelling can affect data in certain genes. That is to say, when experiments 
are repeated and the dye combination for the two probes Reversed, inconsistent 
results are obtained with certain genes (Taniguchi et al, 2001). Despite these 
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Treated : Cy 3-532 nm Control ; Cy 5-635 nm 




Figure 13.2 Two-colour fluorescent scan of human gene cDNA array. The probe mix con- 
sists of DNA from HepG2 control cells and cells treated with buthionine sulfoximine for 6 h. 
A colour version of this figure appears in the colour plate section 

facts, two-colour hybridisations are widely accepted throughout the microarray 
community and an example of an image is shown in Figure 13.2. 

Hybridisation and Processing 

After labelling, the cDNA is purified (to remove unincorporated nucleotides), 
mixed with a hybridisation buffer and then applied to a cDNA microarray slide. 
The sample and the slide are heated prior to hybridisation in order to separate 
double-stranded DNA. A coverslip is applied (Shalon et aU 1996), or prefer- 
ably a hybridisation chamber is used to avoid evaporation and enable an even 
hybridisation. The hybridisation and subsequent wash steps are carried out at a 
buffer stringency and temperature that enables hybridisation of complementary 
strands of DNA but reduces non-specific binding. 

Image Capture and Image Analysis 

After hybridisation the microarray slides are scanned, using either a laser or a 
phosphorimager (depending on the type of label used). There are many different 
suppliers and models of fluorescence scanner, for example the ScanArray 5000 
(Perkin Elmer Life Sciences, Zaventem, Belgium), GenePix 4000B (Axon GRI, 
Essex, UK) and the Gene Array® (Affymetrix, Santa Clara, CA). The choice of 
scanner is determined by sensitivity, resolution, flexible wavelength, file size 
generated, throughput and technical support available. 

Images are analysed using software that measures the intensity of the sig- 
nal from the hybridised spotted genes (spots), which provides a measurement 
of the amount of cDNA bound. Thus the initial concentration of messenger 
RNA is inferred. Early software packages 'drew* grids around the spots and 
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integrated across the whole area of the grid. This overcame problems associ- 
ated with accurate location of the spots, which is problematic, especially if the 
spots on the printed arrays are poorly aligned. More recent versions of software 
'draw' circles around the spots themselves and perform measurements within 
and outside of this boundary. For example, the background may be calculated 
from a region outside the spot boundary. The intensity of the signal from the 
spot may be calculated using median, mode or mean values of the pixels within 
the spot. Researchers differ in their preferences regarding using median or mean 
values (Hegde et al. y 2000) and this is likely to depend upon the protocols and 
software used. 

Image analysis software commonly is supplied with scanners or can be 
bought from the same supplier. This has the advantage of being opti- 
mised for that specific type of microarray and the benefit of upgrades 
and technical support. Software for microarray analysis is available 
from BioDiscovery (http://www.biodiscovery.com), Imaging Research 
(http://imagingresearch.com), GenePix Pro (Amersham Biosciences, Pis- 
cataway, NJ), arraySCOUT 2.0 (http://www.lionbioscience.com), NIH 
(http : //www . nhgri . nih . gov/DIR/LCG/l5K/HTML/img .analysis , html), 
Stanford University (http://rana.stanford.EDU/software) Media Cyber- 
netics (Silver Spring, MD, USA) andTIGR (http// ww.tigr.org/softlab). 
Important criteria for image analysis software include speed, ease of use, 
automation and the ability to distinguish artefact from real signal (Wildsmith 
and Elcock, 2001). 

As the technology has evolved and more experience gained, it has become 
more and more apparent that the most significant issues facing microarray users 
are the processing of the vast quantities of data generated and deciding exactly 
what tools are the most appropriate for data analysis. Because of the enormity 
of this, we have dedicated a complete section to describing the current status of 
this area. 



13.3 Data analysis 

It is important to be cognisant of the fact that the practical laboratory aspects of 
using microarrays are only part of -gene expression analysis. Many researchers 
generate vast volumes of data, withbut a clear understanding of how to manage 
and interpret them. Furthermore, th& variability in microarray data confers addi- 
tional problems for analysis. In some'' cases the purpose of the experiment will 
be a gene-hunting exercise, in which case a cursory indication of potential gene 
biomarkers is sufficient analysis. In other instances, such as pathway mapping 
and screening studies, it is paramount that results are statistically meaningful 
and valid. The next few sections detail some relatively simple analysis meth- 
ods and recommendations for the benefit of researchers with minimal statistical' 
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Figure 13.3 The ideal microarray experimental design and process 

training. There are also suggestions for more advanced analysis for those who 
have the assistance of a statistician or specialist data analyst. 

Most of the steps before, during and after performing a microarray experi- 
ment are optimally conducted with regard for statistics and data analysis. Careful 
planning before implementation facilitates the downstream analysis and inter- 
pretation of data. The following model summarises the entire microarray process 
with integration of the biological and data analysis components (Figure 13.3). 



Hypothesis Generation 

Any study is conceived for the purpose of investigating or obtaining supporting 
evidence for a biological hypothesis. Giving time at this early stage to consider 
downstream implications will pay dividends later. It is helpful if, rather than 
simply stating the aims of the experiment, the researcher asks the question 'What 
results do I expect?' or 4 what answer will validate/invalidate my hypothesis?'. 
This 'reverse-engineering' proves useful in focusing the project, assessing the 
feasibility of the work, providing early preparation for data management and 
analysis and, importantly, in managing expectations with regard to outcomes. 

A good example of careful experimental planning is demonstrated by Golub 
et al (1999) in the classification of acute leukaemias in order to distinguish 
between acute lymphoblastic leukaemia (ALL) and acute myeloid^ leukaemia 
(AML). Distinguishing between ALL and AML using conventional' techniques 
is known to be a difficult task. The researchers maximised their probability of 
success by choosing an easier, more defined model (normal kidney vs. renal 
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cell carcinoma), on which to validate their analytical methods. In doing so they 
established that their techniques were suitable for classifying tissues according 
to disease and gained confidence in their approach before using the samples of 
real interest. 

Optimisation Experiments 

Although microarrays are becoming increasingly accessible to all, using these 
tools requires experience and it is unlikely that successful experiments will be 
conducted immediately. It is usual that some time is given to optimising a sys- 
tem for any specific application, for example for a given tissue or cell type. 
Additionally, the requirements for a given system, may warrant some modifica- 
tions. The standard approach for a scientist to take is to vary one parameter, 
whilst keeping all others constant. This is time-consuming and does not take into 
account the interactions between different factors. Well-designed, multifactorial 
experiments (Box et al, 1978), provide a faster route for optimisation, with a 
statistical measure of confidence. An example of this technique is in the opti- 
misation of microarray experimental conditions for preparation of fluorescent 
probes from rat liver tissue (Wildsmith et a/., 2001). When a major source of 
variation is revealed this can be investigated further with a view to minimising 
it or providing sufficient replicates to account for it. 

Design of Experiments 

Once confidence in the experimental procedure has been obtained the researcher 
is likely to have gained an insight into the reproducibility of the system. This 
assists in the design of the experiments, in particular in determining the mini- 
mal number of replicates necessary. Replication can be implemented at many 
stages - from biological samples through to microarray slides. 

Owing to the enzyme-catalysed transcription reactions, a large amount of 
valuation occurs during the probe-making stages in microarray experiments. Out- 
work indicated that replicates should be made at this step and a minimum of six 
replicate probes are made for microarray experiments (Wildsmith et ai, 2001). 
These can be pooled or hybridised separately onto six microarray slides. 

Lee et al (2000) have examined the effect of the different location of cDNA 
spots on the glass slides and concluded that replicates are essential to provide 
meaningful data and to enable reliable inferences to be drawn. 

With regard to commercially available gene chip systems, such as that avail- 
able from Affymetrix (see Figure 13.4), the variation between chips, within a 
batch, is likely to be low due to stringent quality control and highly automated 
manufacture. The use of an automated wash station also reduces variability in 
intensities between chips! However, using a multi-step approach in the probe 
preparation and subsequent antibody binding steps may lead to variation between 

* ' ■-. ■ > 

I 

\ 

i i » 



276 GENE EXPRESSION ANALYSIS USING MICROARRAYS 




Figure 13.4 The GeneChip® Instrument System. From left to right, the hybridisation sta- 
tion, scanner and workstation. Image courtesy of Affymetrix. A colour version of this figure 
appears in the colour plate section 



replicate samples prepared on different days. Pooling of reagents within an 
experiment, and analysing controls together with treated samples, will both 
reduce the variability within a given experiment. 

Conduct of Experiment 

At this stage some attention may be required for verifying and validating pro- 
cesses. For example, checking that the imaging instruments give consistent 
results across the slide, on repeat use and from day to day. If two imagers 
are used it is important to verify that the results from both machines are com- 
parable. Some laboratories read fluorescence of one channel and then adjust the 
laser intensity of the second channel in order to obtain comparable readings. 
This is a method of normalising for the difference in intensities of fluorophores. 
It is important to be aware that this approach has a number of drawbacks. The 
arbitrary value of the second laser intensity setting will vary from experiment 
to experiment; thus comparisons of this channel cannot be made across experi- 
ments. Also the response of the fluorophore may not be linear across the laser 
intensity settings and this can lead to additional errors. 

Another area for investigation prior to running the study itself is the image 
analysis component. Depending on the software used, the image analysis pack- 
age may process the data to some extent, for example automatic background 
subtraction. Full understanding of the software is required so that it is clear at 
what point the data are 'raw', and the extent of inherent, inseparable manip- 
ulation. Effort may be required to determine the optimum settings for any 
software parameters. 

As data are generated it is important to be aware of the data integrity - for 
example ensuring that all data are collected, so that there are no missing data that 
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Figure 13.5 The relative fluorescence units (RFUs) of 1248 genes on seven microar- 
ray slides that were hybridised with cDNA made from the liver of a rat treated with 
acetaminophen. Note the gene outliers at approximately gene number 480 

can complicate analysis later. The researcher may be intuitively aware of any 
spurious results and should be alert for anything extraordinary that could indi- 
cate problems, for example hybridisation intensities appearing inconsistent from 
sample to sample. Data analysis at this point can be a rapid indicator of dubious 
results. For example, Figure 13.5 shows a plot of the fluorescent intensities of 
1248 genes that were hybridised with probe derived from acetaminophen-treated 
rat liver tissue. The data appear consistent, with the exception of peaks in inten- 
sity on one slide at around gene 4800. Further investigation of the microarray 
revealed a large artefact that had been missed by the image analysis process 
(Figure 13.6). 



Raw Data Generation and Storage 

One issue that arises when carrying out microarray analyses is how much data 
to store and in what form. For Good Laboratory Practice (GLP) purposes, often 
required in industry, storage of the raw data is necessary. This could be construed 
as the microarray image. Storing the image analysis results requires far less 
storage space and is easier to visualise, but it has the drawback that image 
analysis cannot be> redone should superior software be available in the future. 
In reality, the methods used for microarrays are continually changing and the 
likelihood of revisiting old images on which the analysis has been performed, 
using outdated protocols is quite small. 
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Figure 13.6 Portion of scanned image showing region where artefact occurred that caused 
very high signals, which were classed as outliers 



Pre-processing 

A number of pre-processing steps are often used in microarray analysis. These 
include filtering, log transformation, normalisation and background subtraction. 
Filtering may be used before or after transformation in order to extract data 
from preferred regions of interest, or in order to remove outliers (see the 
above example relating to image analysis artefact). One example of filtering 
is the removal of individual gene replicates that lie outside a given number (for 
example 5) of standard deviations from the mean. Alternatively, data points that 
lie in the top/bottom few percentiles (e.g. 0.1 %) of the data can be removed. This 
method of removing outliers is also called 'trimming'. It is acceptable if there 
is a large volume of data where only a small proportion of data is removed and 
if the same method is applied consistently across all data. Care must be taken 
in the way in which this is carried out in order not to delete genuine data. For 
example, if one gene is consistently high or low in expression across replicates, 
then it is unlikely to be an outlier. 

Another method of detecting outliers is to plot all genes (see the section 
'Conduct of experiment' above) or to perform PCA analysis (see the section 
'Multivariate analysis' below) to detect replicate outliers. The use of a PCA 
plot to detect outliers is shown in Figure 13.7. 

Log transformation of data is accepted universally because the fluorescence 
data that are generated from microarrays tend to be skewed towards lower val- 
ues. There are scientifically valid reasons why ratios of raw expression values 
should not be used (Nadon and Shoemaker, 2002). When using two-colour 
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Figure 13.7 PC A plot of data used in Figure 13.4 showing one microarray (number 2) as 
an outlier 



hybridisations it is common to express the ratio of treated to control as a loga- 
rithm in base 2 (Quackenbush, 2001). Thus genes up-regulated by a factor of 2 
have a log 2 (ratio) of 1, and genes down-regulated by a factor of 2 have a log 2 
(ratio) of -1. 

Normalisation and background subtraction techniques are methods of data 
manipulation and their use is more subjective and often debated. The purpose of 
these techniques is to reduce the error (variability) that occurs between replicates 
and thus enable a comparison of data across samples. 

The theory behind background subtraction is that during hybridisation there 
will be non-specific binding to the slide. This will effectively 'darken' the 
image and give falsely high readings of fluorescent intensity. Correcting for 
the non-specific hybridisation should reduce error due to background staining. 
Background subtraction often occurs automatically in microarray image analy- 
sis packages. The software may circle the spot of interest and use the region 
beyond the periphery as the measurement of background. In cases of uneven 
hybridisation this method enables locally high background to be subtracted on 
a regional basis. One criticism of this approach is that the slide surface beyond 
the periphery is not similar, in chemical terms, to that where the nucleic acid 
has been deposited, and therefore capnot act as a real control for non-specific 
binding. A more accurate measurement of non-specific binding can be gained 
from using a region where spotting chemicals have been deposited, but no target 
is present. This concept is the basis of a method using local 'blank spots' (Wu 
et aU 2001). 

A number of methods exist for normalisation of data. These include norrnal- 
* ising to total signal or to a 'known' spot or gene, standardisation, or proprietary' 
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methods. Normalising to total signal is the simplest approach, whereby the gene 
intensity is expressed as a percentage or proportion of the signal intensity for the 
entire array. This method works best when the total intensities for the microar- 
rays are similar and the number of changes is small compared with the number 
of genes. However, we often find, when using arrays of around 1000 genes, that 
pathological disease can up-regulate a large number of genes simultaneously. In 
this case, when total signal normalisation is applied, highly up-regulated genes 
will appear less up-regulated and genes that do not change from the control will 
appear down-regulated. 

Normalisation to a control value is a more popular technique. A control value 
can be obtained from using a gene known to remain constant under the condi- 
tions of the experiment. DeRisi el aL (1997) used, a panel of 90 housekeeping 
genes for normalisation, but found considerable valuation in their gene expres- 
sion. Unfortunately it is very difficult to know with certainty that a gene will 
not change and there is evidence to suggest that so-called 'housekeeping genes' 
are variable (Savonet et al, 1997). Other control genes can be derived from an 
alternative species; these should not be expected to hybridise. We have used 
yeast and Arabidopsis genes as negative controls for hybridisations of rat tis- 
sues. No orthologs were known to the genes selected; however, in most cases 
non-specific binding occurred. 

If two or more microarray replicates appear to be different, but they are 
expected to be the same, then they can be standardised. An example of this 
might occur if the total intensity of one microarray is greater than another, but 
the genes are proportionally equally up- or down-regulated. If the microarray 
sample spot data are assumed to be drawn from a normal distribution, then the 'z- 
transform' can be used. This requires that the mean and the standard deviation of 
the intensity values for each microarray are determined. The mean is subtracted 
from each individual gene value and the remainder is divided by the standard 
deviation. The intensity values from each microarray will then have the same 
mean and standard deviation. This has the advantage of facilitating comparison 
of microairays with different dynamic ranges as well as total intensities. If the 
data are not normally distributed, then alternative non-parametric methods can 
be used, such as normalising to the median. 

Univariate Analysis 

Univariate methods of analysis involve examining one variable, or g$ne, at a 
time. This can be a very laborious task when examining a large volume of 
data, yet it is the preferred method of biologists. The simplest technique is to 
compare control and treated values and express the result as a 'fold-change' 
ratio. Typically, when examining small volumes of data, fold changes greater 
than 2 and less than 0.5 are considered meaningful (Quackenbush, 2001). This 
cut-off is essentially arbitrary and has the distinct drawback that microarray data 
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are not homoscedastic; that is, there is more variation about the mean at low 
values than there is at high values (Draghici, 2002). 

A second method for finding up- or down-regulated genes uses the standard 
deviation (SD) of the replicate gene data. Thus if changes greater than, say, 2 
SD from the log mean ratio are considerably greater than changes associated 
with 'noise', then they are considered significant. This technique means that 
when looking at a large number of genes that are normally distributed there will 
be up- and down-regulated genes, regardless of whether there are (biological) 
changes (Draghici, 2002). 

Rather than using arbitrary cut-off values it is far more meaningful to express 
the fold-change in terms of either confidence intervals, or a Rvalue' (that 
is, the probability of the value occurring by chance). Thus, a fold-change of 
1.1 may be associated with a p- value of 0.001 and thus the probability that 
the gene is not up-regulated is 1 in 1000. Naturally, such small fold-changes 
may then be queried in terms of biological significance. One must then ask 
the question: Are large fold-changes more important (biologically) than small 
ones? Simple calculation of p-values for two data sets can be obtained using 
/-test functions in standard spreadsheet software. A number of replicates are 
necessary for this approach, and the data must be normally distributed. We 
have recently developed a method for calculating /?-values for fold-changes that 
is not influenced by the distribution of the data or outliers and applied it to 
TaqMan™ and microarray data. Other complex and computationally intensive 
methods for calculating p-values are described in Draghici (2002) and Nadon 
and Shoemaker (2002). 

Multivariate Analysis 

Multivariate analysis of gene expression data is becoming increasingly popu- 
lar in the microarray community and in other biological domains where large 
volumes of data are generated. Multivariate analysis methods include princi- 
pal component analysis (PCA), factor analysis, multivariate analysis of variance 
(MANOVA) and cluster analysis. Currently, cluster analysis is the most widely- 
used method in the microarray community but PCA is growing in popularity 
(Crescenzi and Giuliani, 2001; Konu et al, 2001). 

Quackenbush,(2001) provides a good review of clustering tools that is rather 
unique in the regard that different clustering algorithms and linkage methods 
are presented. Clustering methods are unsupervised, and they are powerful tools 
for gaining insight into huge data sets. They enable the data to be partitioned 
in order to facilitate interpretation; however, they do suffer from subjectiv- 
ity. This is because the user selects various parameters, such as the algorithm 
used, linkage type, distance metric and, sometimes, cluster size. Whatever the 
data, clusters will always be identified, thus there is also a tendency to over- 
interpret the data - trying to attach meaning to clusters that may have no bio- 
logical relevance. 
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Software available for cluster analysis includes Cluster, the output of which 
is viewed in Treeview; both available from http://rana.stanford.edu/ 
software. This tool is particularly useful for clustering genes to identify genes 
that are co-regulated. 

The PCA is a visualisation tool that enables complex, high-dimensional data 
to be represented in two or three dimensions. It facilitates identification of groups 
of similar data, thus enabling inferences to be made about the samples. 

An example is shown in Figure 13.8. The figure shows the gene data for one 
sample (control rat liver) that was hybridised to seven microarrays according 
to the method used in Wildsmith et al (2001), Each microarray contained two 
replicate gene sets; thus there were 14 replicate gene sets in total. The gene sets 
comprised 1248 genes (with controls). All the data (14 x 1248 data points) was 
input into the analysis and the PCA plot displays the 14 replicates individually. 
The two axes are principal components 1 and 2. Principal component 1 (PCI) 
accounts for 65.5% of the variation in the data, whereas PC2 represents only 
13%. This means that the model accounts for 78.5% of variation in the data. 

The first principal component (PCI) accounts for as much as possible of 
the variation in the original data and subsequent components (e.g. PC2) are of 
decreasing importance. Thus, samples 8 and 9 are very different from samples 1 
and 2. In terms of interpreting the PCA plot, it is immediately clear that there are 
three or four distinct clusters of data. These are marked by circles. Datapoints 
tend to cluster in pairs; for example replicates 1 and 2, 3 and 4, 5 and 6, 
etc. These are the duplicate gene sets on the same microarray. This indicates 
that the variation within the microarray is lower than the variation between 




Figure 13.8 PCA plot of the microarray results from seven slides (2690, 2692, 2723, 2869, 
2876, 2879, 2880), each with two replicate spot sets (labelled 1-14), after hybridisation with 
control rat liver. See text for explanation 
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replicate microarrays. However, given that the same sample is applied to all 
the microarrays, we must ask why we get further separation of the replicates. 
The answer lies in the associated data. The four-digit numbers associated with 
the clusters are the microarray slide numbers that indicate when they were 
printed. Numbers that are more similar seem to be more closely related, and thus 
we can hypothesise that there were some differences between the slides, such 
as differences in slide backgrounds or changes during the printing process or 
change-over of batches, between slides in the 2600-2700 region and the 2800s. 

The PCA provides a clearer overview of the data than does cluster analy- 
sis. It is a rapid method for gaining an insight into the results, in particular 
where biological meaning can be attached to the components (Crescenzi et a/., 
2001). There are a number of packages for multivariate, data analysis, including 
SIMCA-P (Umetrics, AB, Umea, Sweden) and The Unscrambler (Camo ASA, 
Norway), both of which are useful for PCA. 

Other tools for data visualisation include software packages such as Spot- 
fire.net (Spotfire Inc., Cambridge, MA, USA) and GeneSpring (Silicon Graphics, 
San Carlos, CA, USA). Spotfire is particularly useful for visualisation of mul- 
tidimensional data and for visualisation of temporal data. It is possible to use 
these tools to identify genes that are co-ordinately expressed over time. 

Biological Interpretation 

After developing a sound experimental strategy, ensuring that the results are 
statistically valid, and after analysis of the data, it is down to the biologist to 
assemble the pieces of information that have been obtained. This intertwined 
information may include unexpected results that are contradictory to intuition or 
to published literature. One way to untangle the data is to map the relevant genes 
onto existing pathways and known functions. The Kyoto Encyclopedia of Genes 
and Genomes (KEGG), available at http://www.genome.ad.jp/kegg/, is a 
useful source of information, especially where the gene products are enzymes. It 
enables visualisation of the position of up- or down-regulated genes in metabolic 
pathways. 

The gene expression data obtained may differ from protein expression data, 
or information on gene product activity or location. When initiating a study 
it is useful to consider additional endpoints that can assist in the interpreta- 
tion of the data. For in vitro studies, these might include cytotoxicity endpoints, 
metabolites, key signalling molecules or perhaps protein expression. Waring 
et al. (2001) used tetrazolium dye reduction (MTT) as a measure of hepatocyte 
cell viability for their studies of gene expression in response to hepatotoxic 
insult. For in vivo Studies, expression information on the tissue. of interest could 
be supported by pathology, histology and blood chemistry measurements. Gene 
expression results could be confirmed by in situ hybridisations or protein activity 
assays. 
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13-4 Recent examples of microarray applications 

One area of rapid progress using microarray technology is the increased under- 
standing of cancer. Molecular pathologists are subgrouping cancers of tissues 
such as blood, skin and breast, based on differential gene expression patterns, 
for example,, within a small, group, of breast, cancer tissue samples, Perou el al. 
(2000) distinguished two broad subgroups representing those expressing or alter- 
natively lacking expression of the oestrogen receptor-a gene. The work was not 
conclusive, but never has progress in this field been so rapid when compared 
with the previous methods of gene identification. 

Another example of the impact of this technology is in the identification of 
two biomarkers for prostate cancer, namely hepsin and PIM1 (Dhanasekaran 
et aU 2001). 

Microarray technology has also accelerated the understanding of the molecu- 
lar events surrounding pulmonary fibrosis. Specifically, two distinct clusters of 
genes associated with inflammation and fibrosis have been identified in a dis- 
ease where, for years, the pathogenesis and treatment have remained unknown 
(Katsuma et ai, 2001). 

13.5 Conclusions 

Important factors in gene expression experiments include sensitivity, precision 
and reproducibility in the measurement of specific mRNA sequences (Schmittgen 
et al. y 2000). These quality metrics can be maximised by using, or fabricat- 
ing, high-quality microarrays, and by optimising each step of the microarray 
process. From conception to conclusion it is important to bear in mind the 
original hypothesis. 

Having considered the complexity of the microarray experiment, the value 
obtained from a meticulously designed experiment should not be underesti- 
mated. As the number of high-quality gene expression studies increases, we hope 
that the literature will contain increasingly detailed information that will help 
interpret complex gene expression changes, and thus elucidate the mechanisms 
of disease. 
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INCREASING NUMBERS OF HUMAN Dis- 
eases, both acquired and genetic, are 
being considered to be based at least 
in part on alterations in DNA se- 
quence. For most diseases, inheritance 
and acquisition are likely to be com- 
plex and polygenic. The efforts of the 
Human Genome Project to elucidate the 
structural genetic background by iden- 
tifying the chromosomal positions and 
genomic organization of between ap- 
proximately 30000 and 35000 human 
genes are nearly complete. 1 Based on this 
structural knowledge, a byproduct 
should be a better "scaffolding" to help 
link specific genes to susceptibility to 
various human diseases. However, to un- 
derstand how the products of these ge- 
netic linkages work together to orches- 
trate the initiation and progression of 
particular complex diseases, there will 
be a need to apply a functional genetic 
rather than a structural genetic ap- 
proach. 2 ^ 

Until recently, functional genetic 
studies have generally been of limited 
scope, only able to elucidate the role of 
1 or a few genes at a time in 1 system. 
Information on the specificity and rela- 
tive abundance of expression prod- 
ucts has traditionally been obtained by 
techniques such as RNA Northern blot 
hybridization and ribonuclease protec- 
tion assays. Somewhat more sophisti- 
cated methods, such as differential dis- 
play 4 and Serial Analysis of Gene 



DNA microarrays represent a technological intersection between biology and 
computers that enables gene expression analysis in human tissues on a ge- 
nome-wide scale. This application can be expected to prove extremely valu- 
able for the study of the genetic basis of complex diseases. Despite the enor- 
mous promise of this revolutionary technology, there are several issues and 
possible pitfalls that may undermine the authority of the microarray plat- 
form. We discuss some of the conceptual, practical, statistical, and logisti- 
cal issues surrounding the use of microarrays for gene expression profiling. 
These issues include the imprecise definition of normal in expression com- 
parisons; the cellular and subcellular heterogeneity of the tissues being stud- 
ied; the difficulty in establishing the statistically valid comparability of ar- 
rays; the logistical logjam in analysis, presentation, and archiving of the vast 
quantities of data generated; and the need for confirmational studies that 
address the functional relevance of findings. Although several complicated 
issues must be resolved, the potential payoff remains large. 

JAMA. 2001;286:2280-2288 



Expression, 5 have been used to screen 
larger numbers of complementary DNA 
(cDNA) clones. However, technical 
limitations render these techniques 
nonconducive to large-scale genetic 
' survey. 

To this end, a powerful new tech- 
nology is emerging, using hybridiza- 
tion to nucleotide arrays, the so-called 
gene chips. 6,7 This technological inter- 
section of biology and computers en- 
ables the reliable screening of a vast 
number of genes simultaneously and is 
amenable to automation. On a nylon 
membrane or glass surface, gene- 
specific cDNAs can be spotted, or oli- 
gonucleotides can be synthesized in situ 
by a combination of photolithography 
and oligonucleotide chemistry. This 
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permits simultaneous monitoring of the 
expression of thousands of genes in a 
single step. Individual chips can be cus- 
tomized to include any chosen set of 
fully or partially characterized ge- 
nomic or expressed sequences. Chips 
can monitor over 50000 unique se- 
quences. The power of these chips lies 
in the potential for comparative expres- 
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sion studies in diseased vs normal 
samples, and in documenting changes 
at different stages during the natural 
course of the disease or in response to 
treatment. It provides the researcher 
with a new arsenal to analyze under- 
lying pathomechanisms on a grand scale 
and also to review the rationale of thera- 
peutic concepts. 

However, despite the enormous po- 
tential of this revolutionary technol- 
ogy, there are several issues and pos- 
sible pitfalls that attenuate the power 
of microarrays. First, the definition of 
normal in expression comparisons is 
neither precise nor unambiguous. Sec- 
ond, the heterogeneity of the tissues 
being studied complicates the mean- 
ing of the expression profiles. Third, the 
statistically valid comparability of ar- 
rays is an unresolved problem. Fourth, 
the vast quantities of data create a lo- 
gistical logjam for analysis, presenta- 
tion, and archiving. Finally, confirrna- 
tional studies are needed to corroborate 
the biological significance of microar- 
ray data (FIGURE 1). 

TROUBLE WITH NORMAL 

The standard normal vs diseased tis- 
sue type of comparison, which is the ba- 
sic design foundation of profiling stud- 
ies, may be more quicksand than 
bedrock. Normal is not so easy to de- 
fine — neither is diseased. Gene expres- 
sion in normal tissue is likely to be de- 
pendent on several factors involving 
patient and sample variation. These fac- 
tors will also have an impact on expres- 
sion profiles of diseased tissue. 

Patient Variation: Ethnicity, 
Sex, Age, Genetic Background, 
Disease States 

The ethnicity, sex, age, and genetic back- 
ground of a patient are likely to affect 
t he gene expression profiles of many tis- 
sues to varying extents. 8,9 A simple ex- 
ample is provided by the expression pro- 
files of genes involved in scalp and body 
hair follicle activity, which can be ex- 
pected to vary over a normal range un- 
der the influence of all of these sources 
of patient variation. The effects of these 
parameters on gene expression are likely 



Figure 1. Possible Pitfalls in Microarray Gene Expression Profiling Analysis 
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There are issues that complicate each step of microarray gene expression analysis. 



to be subtle but pervasive, not fully un- 
derstood at this time, and quite prob- 
lematic for defining norrnaL 

The presence of disease in a subject 
who is the source of tissue for control 
purposes, presents further potential 
variabilities. For example, there may be 
a significant difference in the conclu- 
sions reached by 2 similar microarray 
expression profiling studies. One may 
compare genes expressed in a pa- 
tient's diseased lung tissue with those 
expressed in normal, nondiseased lung 
tissue from the same patient, and an- 
other may compare genes expressed in 
the same patient's diseased lung tissue 
with those expressed in normal, non- 
diseased lung tissue from a healthy con- 
trol or normal individual. Moreover, it 
is also possible that seemingly unre- 
lated disease states may influence gene 
expression at distant sites. For in- 
stance, the presence of diabetes in 1 of 
2 renal cancer patients may compli- 
cate the direct comparison of renal 
tissues. 

Sample Variation: Proximity 
to Disease, Anatomic Location, 
and Developmental Range 

Yet another complication derives from 
the proximity of the normal tissue used 
as a control for the diseased tissue. Tis- 
sue adjacent to an area of disease may 
not be normal despite absence of evi- 
dence of disease clinically or under the 
light microscope. Normal-appearing tis- 
sue near a tumor could, for example, 
be genotypically altered or exhibit an 
altered gene expression profile. !0 ~ n 
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Moreover, factors such as the degree of 
disease-associated inflammation may 
have a significant impact on gene ex- 
pression profiles. Other bystander ef- 
fects, epiphenomena, orsecondary dis- 
ease processes could all play important 
roles in determining expression pro- 
files within these adjacent, so-called 
normal tissues. These factors must be 
considered in the choice of normal. 

The precise location within a par- 
ticular organ may be another impor- 
tant factor that affects gene expres- 
sion. 9 For example, just as location 
relative to the urethra may influence ex- 
pression profiles in the prostate," skin 
from the nose; back, and palm are cer- 
tain to have different expression pro- 
files as well, despite all being from the 
same organ. Thus, site and specific ana- 
tomic location must also be taken into 
account in a description of normal. 

It must also be kept in mind that the 
definition of normal actually repre- 
sents a dynamic state. H All tissues, 
which are composed of early and late- 
stage cells, have a normal developmen- 
tal range. For example, normal epithe- 
lium in prostatic ducts ranges from 
atrophic to resting to hyperplastic, and 
each has a unique pattern of gene ex- 
pression. 14 

A 3-dimensional analytic approach is 
a strategy that has been used to address 
some of these concerns about defining 
normal. Cole et al H used a 3-dimen- 
sional model to characterize the entire 
prostate gland in theirstudy of gene ex- 
pression profiles in prostate cancer. In 
this study, whole-mount prostactec- 
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tomy specimens were divided into trans- 
verse cross sections such that the entire 
prostate gland, including the complete 
spectrum of normal epithelium and tu- 
mor progression, was available for view- 
ing, microdissection, and microarray 
analysis. This method was used to de- 
termine the exact physical relationship 
of the normal ducts, premalignant le- 
sions, and tumors — thus obtaining an 
anatomic framework on which to over- 
lay gene expression data. This tech- 
nique offers several advantages over the 
normal vs tumor comparison. Previous 
studies had used normal epithelium in 
prostatic ducts as a baseline control 
against which to compare and contrast 
tumor gene expression profiles. 16 How- 
ever, the expression profile of this nor- 
mal epithelium is affected by proximity 
to tumor, location within the gland, and 
developmental state. H These factors can 
be better appreciated using a 3- dimen- 
sional approach. 

Disease-Related Variation 

Of course, many of the parameters that 
affect normal expression profiles (pa- 
tient ethnicity, age, sex, and genetic 
background, location within an or- 
gan, and developmental stage) will also 
affect disease expression profiles. 17 " 19 
Disease heterogeneity, including sub- 
type, activity, severity, stage, and pre- 
vious as well as current treatments, also 
may have a significant impact on gene 
expression. 20 25 

Categorizing and subgrouping pa- 
tients on entry into a study may be use- 
ful to control for as many of these fac- 
tors as possible. However, there may be 
problems surrounding attempts to de- 
fine microarray-based categorization on 
the basis of another imperfect categori- 
zation system, such as histology, as these 
groups are sometimes arbitrary or in- 
consistently designated. Nevertheless, de- 
termining whether gene expression pro- 
files correlate with existing clinical or 
histological categories can provide new 
insights into the meaning of these cat- 
egories as can new methods of classify- 
ing cancers or other diseases into spe- 
cific diagnostic categories based on their 
gene expression signatures. Several stud- 



ies have been able to establish expression- 
based criteria (class predictors) for pre- 
existing categories and then use these 
new criteria to categorize new cases (class 
prediction). 26 " 28 Global profiling may also 
allow the development of new classifi- 
cation systems based on gene expres- 
sion alone (class discovery). 29 - 30 . 

Thus, when possible, it will be of 
value to profile a range of normal and 
diseased cell populations from a num- 
ber of patients to distinguish between 
differences in expression that are rel- 
evant to the disease process and those 
reflecting the biological spectrum of the 
normal tissue or that have occurred for 
reasons unrelated to the disease. The 
significance of this distinction is fur- 
ther appreciated when taking into ac- 
count the vast quantity of data gener- 
ated from microarrays and the potential 
for confounding interpretation from the 
inclusion of differential expression un- 
related to disease processes. 

It is worth noting, however, that the 
issues of patient and sample variability 
are not unique to microarray experi- 
ments. In fact, microarray experiments, 
in contrast to classic single-gene experi- 
ments, may actually provide the tools for 
identifying this heterogeneity. For ex- 
ample, DNA microarrays have been used 
to explore physiological variation in gene 
expression on a genomic scale in 60 cell 
lines derived from diverse tumor tis- 
sues. 31 Cluster analysis allows the iden- 
tification of prominent features in gene 
expression patterns that appear to re- 
flect molecular signatures of the tissue 
from which the cells originated. 11 

HETEROGENEOUS CELL 
POPULATIONS 

A further complication encountered with 
expression profiles is that any given tis- 
sue is composed of several cell types, 
members of which are likely to be within 
a spectrum of dynamic functional states. 
For example, a simple punch biopsy of 
the skin may contain keratinocytes, me- 
lanocytes, Langerhans cells, Merkel cells, 
adipocytes, smooth muscle cells of ar- 
rector pili, striated muscle cells of the 
panniculus carnosus, blood cells includ- 
ing immune system cells, and cellular el- 



ements of blood vessels, nerves, hair fol- 
licles, sebaceous glands, and sweat 
glands. Moreover, cells from each of these 
populations will be at various stages of 
development and levels of activation, per- 
forming different functions and respond- 
ing to disease processes or treatments in 
different ways and to varying extents. The 
result is a highly heterogeneous sam- 
pling of cells, each expressing a unique 
set of genes. An expression profile gen- 
erated from a microarray study of the 
RKA in such a sample will thus repre- 
sent merely a snapshot of the genes ex- 
pressed by a plethora of cells at a mo- 
ment in time. Such extensive cellular 
heterogeneity complicates the ability to 
draw conclusions about specific pro- 
cesses occurring within a tissue speci- 
men. An illustrative example is pro- 
vided by Stanton et al, 32 who used 
microarrays to identify genes differen- 
tially expressed during myocardial in- 
farction. The expression profiles they 
studied represented transcripts from cell 
populations as diverse as immune sys- 
tem cells, which migrated to the infarct 
region and are responsible for the in- 
flammatory response, cardiac myocytes 
within the ischemic area undergoing 
apoptosis and necrosis, fibroblasts un- 
dergoing proliferation and participat- 
ing in the formation of scar tissue to re- 
place the infarct, and cardiac myocytes 
undergoing hypertrophy to compen- 
sate for the loss of cells in the infarct 
area." The issue of such cellular hetero- 
geneity was avoided by categorizing the 
differentially expressed genes into func- 
tional categories to look for patterns in- 
dicative of cardiac remodeling without 
attempting to attribute specific tran- 
scripts to specific cell types. For gene ex- 
pression studies involving samples with 
mixed cellular populations, further in- 
vestigation, such as with in situ messen- 
ger RNA (mRNA) hybridization, may be 
necessary to localize the transcripts be- 
fore conclusions can be drawn about the 
roles of specific genes in specific cell types 
during the disease process. 

Laser Capture Microdissection 

An ingenious but technically delicate 
approach to the study of complex bio- 
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logical samples has become possible 
with the development of laser capture 
microdissection (Figure 2).* This tech- 
nique allows for the rapid and accu- 
rate procurement of cells from spe- 
cific areas of tissue under direct 
microscopic visualization, and thus 
makes the molecular genetic analysis 
of defined populations in their native 
tissue environment possible. 35 Sgroi et 
aP 6 demonstrated the feasibility of com- 
bining laser capture microdissection 
with high-throughput cDNA arrays. 
They showed that in vivo subpopula- 
tions of malignant cells from multiple 
stages of breast cancer progression 
could be separated from nonmalig- 
nant populations, and their expres- 
sion profiles could subsequently be ana- 
lyzed using microarrays. 

The potential is powerful. Specimens 
could be separated into tissue layers; for 
example, separating a skin biopsy into 
epidermis, dermis, and hypoderrnis. Tis- 
sues could be further differentiated into 
specific structural components, such as 
dermis into blood vessels, adipose, ar- 
rector pili, and sebaceous glands. Struc- 
tures could be separated into defined cell 
types, such as blood vessels into endo- 
thelial cells, erythrocytes, and lympho- 
cytes. Cell types could even be sepa- 
rated into marker-defined subtypes, such 
as lymphocytes into CD4 and CD8 cells. 
Expression profiles from refined and de- 
fined structures and cell types likely 
would be extremely valuable in the study 
of disease. 

Potential aside, there are significant 
limitations to this technology at the pres- 
ent time. The standard protocols for fix- 
ing and embedding tissue samples from 
surgical resections were not designed 
to be compatible with microarray ex- 
periments, with or without laser cap- 
ture microdissection. Typically, tissue 
suspected of being important for diag- 
nosis and staging is processed through 
aldehyde-based fixatives, such as for- 
malin, which damage mRNA integ- 
rity. 37 If frozen tissue is available, mRNA 
can be recovered and studied from dis- 
sected cell populations. However, fro- 
zen tissue sections are technically dif- 
ficult to prepare, the histology is often 



Figure 2. Laser Capture Microdissection 
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A, Tissue sections are processed and placed on a microscope slide under a thin, transparent thermoplastic film, 
which is attached to a movable cap. Visualizing the tissue microscopically, a short-duration, focused pulse from 
an infrared carbon -dioxide laser is used to activate and melt the film to selectively adhere cells within targeted 
areas of interest. B, When the cap is lifted, the film,- with selected cells stilt bound, is removed from the tissue 
section for further processing to retrieve cellular materials (eg, DNA, RNA, proteins). 



severely compromised, and the tissue 
available may contain only a limited 
portion of the lesion. 14 

Moreover, the sample amounts gen- 
erated from laser capture microdissec- 
tion can be small, even as miniscule as 
a single cell. 38 Consequently, the yields 
of RNA are low. Arrays have a thresh- 
old for the quantity of molecular start- 
ing material: at least 5 to 15 ug for oli- 
gonucleotide arrays and between 2 and 
100 ug for cDNA arrays, depending on 
the manufacturer, the source of the 
RNA, and the use of signal amplifica- 
tion3' <0 Studies that have successfully 
integrated laser capture microdissec- 
tion with microarray technology have 
used samples of approximately 1 X 10 4 
to 1 X 1(P cells with 95% to 98% ho- 
mogeneity as determined by micro- 
scopic visualization. 36 ' 41 If needed, am- 
plification techniques may be used to 
generate sufficient genetic material for 
microarray hybridizations. 41 Laser cap- 
ture microdissection is an intriguing 
technology, but time will tell whether 
its potential is realized. 

Although some biological issues re- 
lated to gene expression may be com- 
plicated by the presence of heterog- 
eneous cell populations in studied 
samples, it is also true that some bio- 
logical conditions can be understood 
only in the context of these heterog- 
eneous cell populations. The nature of 



global gene expression experiments is 
to uncover differences between 2 bio- 
logical samples, including those differ- 
ences based on diverse cell popula- 
tions. For example, to appreciate a 
disease that is characterized by an in- 
flammatory infiltrate, it must be un- 
derstood that the inflammatory infil- 
trate is part of the disease and is part 
of the difference between diseased and 
nondiseased tissue. Thus, the isola- 
tion of specific cell populations for 
study is not necessarily required or even 
desirable in all instances. 

MAKING AAICROARRAYS " 
COMPARABLE 

Ideally, microarray experiments should 
be comparable both within and be- 
tween laboratory or manufacturing sys- 
tems, but obtaining consistent and com- 
parable data is a critical challenge for 
microarray-based expression analysis. 
Major sources for the observed vari- 
ability of microarray data include the 
normal physiological gene expression 
variations in different samples and the 
noise introduced in the microarray as- 
say process. 42 

Physiological Gene 
Expression Variation 

Inextricably linked to the issues of pa- 
tient and sample variability and tissue 
heterogeneity discussed above, is the 
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problem of normal gene expression 
variations and how to distinguish these 
variations from significant disease- 
associated changes. Few studies have 
systematically investigated physiologi- 
cal expression changes, but data from in 
situ hybridizations suggest that normal 
variance for many tightly regulated tis- 
sue-specific genes can be within 20% to 
30% * 2 However, there can be as much 
as 2- to 4-fold random fluctuations for 
many genes in yeast. 43,44 Affymetrix 
(Santa Clara, Calif) guidelines have sug- 
gested that for most of the "housekeep- 
ing" genes in human tissues, which are 
likely to be less tightly regulated, differ- 
ences ofless than 4-fold are probably not 
biologically significant. 45 Conse- 
quently, a significant portion of micro- 
array data variability for high- or me- 
dium-abundance mRNAs may simply be 
due to normal expression variations. Sev- 
eral previous studies have used arbi- 
trary 2-fold change criteria to define sig- 
nificant expression change. 46 However, 
the 2-fold threshold has been shown to 
be statistically invalid even for dupli- 
cate experiments. 46 In a recent study that 
used cDNA rnicroarrays to profile gene 
expression in samples of normal skin 
from breast-reduction surgeries, 71 of 
4400 genes were found to demonstrate 
variability in expression greater than 2 
SDs from the mean of each gene. 17 These 
included genes coding for transport pro- 
teins, gene transcription, cell-signaling 
proteins, and cell-surface proteins. Thus, 
physiological variation should be con- 
sidered in the analysis and interpreta- 
tion of microarray data. More stringent 
criteria for defining significant expres- 
sion change may be useful. 

Noise in the Articroarray 
Assay Process 

For the tightly regulated (mostly low 
abundance) mRNA species, inconsis- 
tencies introduced at any stage of the 
microarray-based assay process may 
play a major role in data variability. 42 
Due to the miniaturization and the large 
number of genes involved, it is diffi- 
cult to maintain consistent processing 
conditions for each sequence across 
multiple assays, and obtaining accu- 
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rate absolute signals is unlikely. 42 Noise 
may be introduced by slide heteroge- 
neities, printing irregularities (eg, 
pin-to-pin variations), and spotting 
volume fluctuations. 48 Some of the sys- 
tematic variations may be reduced by 
the inclusion of controls, but random 
fluctuation at various manufacturing 
stages cannot be completely con- 
trolled and can accumulate quickly in 
a complicated assay. 48 

In certain microarray systems, 2 
samples are competitively hybridized to 
1 array using different fluors for label- 
ing. In other systems, there is only single- 
sample hybridization. A 2-color system 
might be expected to be more reliable 
since variations in spot size or amount 
of cDNA probe on the chip should not 
affect the signal ra tio (both signals are de- 
rived from the same spot). However, this 
only holds true if signals are well above 
the background in both detection chan- 
nels. 42 In fact, the signal level for most 
of the tightly regulated genes will likely 
be close to the background level. 42 In ad- 
dition, background level on a slide can 
also vary significantly from spot to spot 
due to factors such as unevenness in slide 
surface properties, dust contamination, 
and incomplete washing, leading to high 
levels of signal variability for low- 
abundance mRNA species even in 
2-color systems. 42 

The high levels of variability of mi- 
croarray data also mean that subtle 
changes in experimental conditions 
may significandy alter the results, mak- 
ing it difficult for separate laborato- 
ries to compare experimental data. In 
addition, the lack of standard con- 
trols, the predominant use of relative 
signals (ratios), and the adoption of in- 
compatible data formats contribute to 
poor comparability between studies. 42 

Despite the hard-wired variability in- 
troduced by chip manufacturing con- 
ditions, most of the published studies 
to date using microarray-based expres- 
sion analysis include only limited num- 
bers of replicates. 49 In fact, many stud- 
ies conduct the experiment only once. 
Considering the potential sources of as- 
say variation, the need for sufficiently 
replicated studies is underscored. 49 

(Reprinted) ©2001 Amen 



Microarray Data Normalization 

Because of variability of microarray data 
for single sample arrays and for fur- 
ther analysis of 2-color system arrays, 
each must be brought into the same 
scale to compare 2 or more arrays. How 
to perform this normalization of gene 
expression levels across multiple ar- 
rays, thus removing systematic varia- 
tion between the arrays and rendering 
different experiments comparable, re- 
mains an issue that is not yet fully re- 
solved. 50 Many of the early microarray 
studies in the literature simply ig- 
nored this issue. A more statistically rig- 
orous approach is needed. 

One difficulty has been that leading 
microarray manufacturers have not 
published statistical error models for 
their products. Thus, users are un- 
clear how much to adjust data for varia- 
tions in spot intensity, hybridization 
patterns, and intensity measurement 
sensitivity. Software does exist to al- 
low for array-to-array comparisons by 
using a scaling factor to normalize gene 
expression patterns across arrays. How- 
ever, in general, these algorithms as- 
sume that intensity differences be- 
tween arrays are linearly related with 
a zero y-intercept. M This assumption al- 
lows software to trim the tails off dis- 
tributions of expression from differ- 
ent arrays at statistical cutoffs and then 
simply move the distributions along an 
axis to a common level to provide com- 
parisons. However, this linear relation- 
ship often does not hold true. 51 When 
the average expression level of 1 array 
is higher than that of a second array, a 
longer tail will be trimmed from the sec- 
ond array. Thus, a greater number of 
genes from the first array will be 
counted as being expressed because 
their expression level is above the sta- 
tistical cutoff point. In this case, the 2 
arrays cannot be considered compa- 
rable. 

Although bioinformatic software has 
recently been developed that offers 
more statistically robust normaliza- 
tion, the cost of these commercially 
available programs (combined with the 
already expensive rnicroarrays) has 
been prohibitive for many research- 
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ers. 50 Standardization of these pro- 
cesses awaits the development of im- 
proved methods of normalization 
leading to valid statistical models widely 
available to all researchers. 

To this end, Schadt et al 51 have de- 
veloped a standard nonlinear curve 
technique for normalizing the data in 
arrays that do not demonstrate a lin- 
ear relationship between data sets. This 
model performs well when the 2 
samples being compared demonstrate 
a low number of differentially ex- 
pressed genes. However, when expres- 
sion profiles of 2 samples vary to a 
greater extent, Schadt et al 51 recom- 
mend a rank-selection method. Using 
this method, genes expressed on an ar- 
ray are ranked from highest to lowest 
level of expression. Then, for the ar- 
ray expressing a greater number of 
genes, the genes with the lowest ex- 
pression levels are removed from the list 
until the 2 arrays list a comparable num- 
ber of expressed genes. This type of 
rank-selection method has gained sup- 
port from other groups, but it too has 
limitations. 50 Removing low- expres- 
sion level data points restricts the study 
to the more extreme and easily de- 
tected entities, a technique that blunts 
the genomic-scale potential of micro- 
array technology. 

Efforts continue to improve compa- 
rability between arrays. Jones 50 re- 
cently applied a statistical model to nor- 
malize spotted cDNA array data that 
takes into account not only the differ- 
ences in numbers of genes expressed be- 
tween arrays, but also the interarray 
variations in fluorescent dye intensity 
and mechanical error occurring in the 
printing process. Nevertheless, the is- 
sue of how to properly normalize ar- 
ray data has not been settled. Research- 
ers must continue to demand statistical 
rigor in their comparisons before they 
can believe the mathematical results of 
their data. 

LOGISTICAL LOG J AAA 

Microarrays deliver massive amounts of 
data on tens of thousands of genes. The 
result is an immense quantity of bio- 
logical information that must be ana- 



lyzed, presented, and archived in a 
meaningful way. 

Data Analysis 

In human studies, the number of hy- 
bridizations that can be performed for 
any set of experimental conditions is of- 
ten restricted by the limited number of 
obtainable tissue samples and by the ex- 
pense of arrays. Restricted numbers of 
hybridizations for each experiment 
hamper the ability to assess the bio- 
logical significance of variation within 
or between given sets of conditions. 
Thus, for the assessment of thousands 
of genes in a setting of limited hybrid- 
izations, the importance of reliable and 
sophisticated algorithms for data analy- 
sis becomes amplified.' 51 

A logical beginning is to examine the 
extremes, that' is, genes with signifi- 
cant differential expression in indi- 
vidual samples. For example, a com- 
parison of 2 samples can be visualized 
in the form of a simple bivariate scat- 
terplot in which the expression profile 
of 1 sample (x-axis) is plotted against 
that of the second sample (y-axis). The 
distribution pattern generally demon- 
strates that the expression ratios clus- 
ter around the line in which x is equal 
to y (indicating comparable levels) , with 
individual genes falling varying dis- 
tances from this line. Additional lines can 
be placed on the scatterplot to repre- 
sent various fold changes of expres- 
sion. Data points that fall above or 
below these lines represent genes ex- 
hibiting expression ratios greater or less 
than the specified fold change. Thus, one 
can begin by examining those genes that 
demonstrate a 10-fold or greater change 
in expression level. To expand the num- 
ber of genes under investigation, one can 
examine genes that demonstrate a 5-fold 
or greater change, or a 3-fold or greater 
change, and so forth. Many studies de- 
fine a 2-fold or greater change in ex- 
pression level to represent significant dif- 
ferential expression. The 2-fold 
threshold, however, as noted above, has 
been shown to be statistically invalid. 46 
Although this simple technique can be 
efficient and effective for focusing on ex- 
panding sets of differentially expressed 
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sequences, again, such an analysis does 
not take advantage of the full potential 
of genome-scale experiments to en- 
hance our comprehension of cellular bi- 
ology that would be provided by an in- 
clusive analysis of the entire repertoire 
of transcripts in a cell as it goes through 
a biological process. 52 A more holistic ap- 
proach, which allows the deciphering of 
patterns from the entire data set, is 
needed. 

Data Organization 
and Presentation 

Statistical algorithms can be applied to 
detect and extract patterns within pro- 
filing data. It is a basic assumption of 
many gene expression studies that 
knowledge of where and when a gene 
is expressed provides information about 
the function of the gene. Therefore, an 
important beginning is to organize 
genes on the basis of similarities in their 
expression profiles. 53 However, even 
this basic tenet deserves critical con- 
sideration. Similarity of gene expres- 
sion profile does not mandate similar- 
ity of function or mechanistic pathway, 
and it may be purely coincidental. Nev- 
ertheless, the idea of clustering genes 
on the basis of their expression pat- 
terns is well established and cluster 
analysis has become the most widely 
used statistical technique applied to. 
iarge-scale gene expression data. 52 

Although various cluster methods can 
usefully organize tables of gene expres- 
sion measurements, the resulting or- 
dered but still massive collection of num- 
bers remains difficult to assimilate. Thus, 
another important component of ge- 
nome-wide expression data explora- 
tion is the development of powerful data 
visualization methods and tools. Ap- 
proaches have been developed that pre- 
sent clustering results in simple graphi- 
cal displays such as dendrograms, which 
represent relationships among genes by 
a tree whose branch lengths reflect the 
degree of similarity in expression be- 
tween the genes. Similarity is mathemati- 
cally defined. 54 The computed trees can 
be used to order genes in the original data 
table such that genes or groups of genes 
with similar expression level patterns are 
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Figure 3. Real-Time Polymerase Chain 
Reaction (PCR) 
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A, Complementary DNA synthesis from source mes- 
senger or total RNA proceeds as with traditional re- 
verse transcriptase PCR. B, A dual-labeled fluoro- 
genic probe (with a higher melting point than the PCR 
primers used for extension) is annealed to the target 
sequence between the forward and reverse PCR prim- 
ers. A reporter (R) fluorochrome (usually 6-carboxy- 
fluoroscein) is attached to the 5' end of the probe and 
a quencher (Q) molecule (usually 6-carboxy- 
tetrametbyl-rhodamine) is attached to the 3' end. 
C, As Taq polymerase extends in the 5' to 3' direc- 
tion, the dual-labeled probe begins to be displaced from 
the target sequence. D, As the Taq polymerase con- 
tinues to extend, the 5' to 3' endonuclease activity 
cleaves the reporter (R) molecule from the probe se- 
quence such that its emission spectra (518 nm) are no 
longer quenched by the second (Q) fluorescent dye. 
Fluorescence is measured continuously throughout the 
PCR amplification in real time and is proportional to 
the amount of PCR product generated in each cycle. 



placed adjacently. Clustering methods 
can also be combined with representa- 
tion of each data point with a color that 
quantitatively and qualitatively reflects 
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the original experimental observa- 
tions. 52 Visual assimilation is then more 
intuitive. 

Data Archiving and Mining 

Ultimately, successful interpretation of 
gene profiling studies is likely to be de- 
pendent on the integration of experi- 
mental data with external information 
resources. As multiple experiments in- 
volving multiple cell types and tissues 
from multiple laboratory groups accu- 
mulate, data archiving may well be- 
come the watershed issue. Ideally, all 
data, in a suitably standardized form, 
would be freely accessible in the pub- 
lic domain. Even assuming a willing- 
ness to share the data, such Utopian 
goals would require a user-friendly and 
powerful database system and stan- 
dardization of correction and normal- 
ization procedures such that data points 
from various projects become compa- 
rable." The National Center for Bio- 
technology Information Entrez sys- 
tem (http://www.ncbi.nlm.nih.gov 
/Entrez/) does provide useful data in this 
regard, but current databases may be 
limited in scope or computability." A 
major focus of infrastructure develop- 
ment to support genomic-scale expres- 
sion studies will need to be in the area 
of electronic biological pathway data- 
bases and resources. 

CONF1RMATIONAL STUDIES 

The development of more sophisti- 
cated analytical algorithms and data- 
bases will help lend credence to the bio- 
logical significance of differential gene 
expression determined by microarray 
analysis. In the meantime, several stud- 
ies have begun to examine the sensi- 
tivity and specificity of microarray- - 
based experiments. Sensitivity, defined 
as the minimum reproducible signal de- 
tected by a given array scanning sys- 
tem, has been reported for microar- 
rays to be approximately 10 mRNA 
copies per cell, which is slightly infe- 
rior to the sensitivity of Northern blot 
analyses. 5 ^ 7 Specificity studies showed 
that for a given probe any non target 
transcripts with more than 75% se- 
quence similarity may show cross- 

(Reprinted) ©2001 Ameri< 



hybridization. 56 The problem of clone 
misidentification and the need for clone 
confirmation have also been ad- 
dressed. 58 One study found that of 1189 
bacterial stock cultures, only 62.2% 
were uncontaminated and contained 
cDNA inserts that had significant se- 
quence identity with published data for 
the ordered clones. 59 Thus, the use of 
sequence-verified clones for cDNA mi- 
croarray construction is warranted. 

Additionally, potential gene candi- 
dates can be assessed for relevance to 
disease using parallel technologies. Sev- 
eral such alternative platforms have 
been used to bolster the importance of 
specific sequences first suggested in 
gene chip comparisons including (1) 
methods at the RNA level, (2) meth- 
ods at the protein level, and (3) func- 
tional studies. 

Methods at the RNA Level 

Reverse transcriptase polymerase chain 
reaction (RT-PCR) is a method often 
used to verify microarray data. Al- 
though RT-PCR is not well suited to 
quantitation, the relative technologi- 
cal ease of this assay and the ability to 
rapidly monitor multiple samples make 
it a useful technology. 60,61 Hybridiza- 
tion data can be verified and multiple 
putative markers can be screened in a 
short period. 

Several other studies have used real- 
time quantitative RT-PCR (TaqMan, 
PE Applied Biosystems, Foster City, 
Calif). n - 62 Real-time PCR is a tech- 
nique that increases the quantitative 
ability of RT-PCR by providing accu- 
rate and reproducible information 
on RNA copy number (Figure 3). In 
this method, a fluorogenic probe 
(labeled at the 5' end with a reporter 
fluorochrome and at the 3' end 
with a quencher fluorochrome) is 
annealed to 1 strand of the target 
cDNA sequence between the forward 
and reverse PCR primers. As Taq poly- 
merase extends the forward primer, its 
intrinsic 5' to 3' nuclease activity dis- 
places and degrades the dual-labeled 
probe, releasing the reporter fluoro- 
chrome from the quencher label and 
allowing the detection of a fluorescent 
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signal that is proportional to the 
amount of PCR product generated in 
each cycle. 63 

Northern blot analysis is also com- 
monly used as a confirmational tech- 
nique, as it is a standard specific and 
semiquantitative method. 15 - 57,61 For 
mRNA expressed at moderate-to-high 
levels, and for which cDNA probes are 
available, Northern blot analysis works 
well, but it is not well suited for low- 
copy mRNA. 64,65 Furthermore, only a 
small number of genes can be ana- 
lyzed with this conventional method. 

Methods at the Protein Level 

DNA microarray technology is limited to 
the study of gene expression at the 
mRNA level. However, it has been es- 
tablished that mRNA levels do not nec- 
essarily correlate with protein levels. 
Moreover, the level of expression or even 
presence of a protein is not ughdy linked 
to physiological consequences. An in- 
vestigation conducted by Winzeler et al, 66 
for example, provides a cautionary tale. 
Their study demonstrated that genes up- 
regulated in yeast growing in minimal 
medium did not prove to be more im- 
portant for growth than genes that were 
not upregulated. 33 They found only 2 of 
8 genes required for yeast growth in 
minimal medium to be induced. The les- 
son to be learned is that genes that are 
not differentially expressed may be of 
equal functional importance in disease 
states compared with those that are. 

Furthermore, the regulation of some 
genes maybe at the translational rather 
than the transcriptional level, which 
would preclude detection by DNA mi- 
croarrays. Posttranslational modifica- 
tion of proteins is also an important 
mode of regulation that cannot be de- 
tected by DNA microarrays. Protein ac- 
tivity, particularly receptor activity, is 
heavily dependent on phosphoryla- 
tion, for example. DNA and mRNA re- 
veal nothing about whether a given pro- 
tein is active, and can be deceptive when 
used to speculate about quantities of 
proteins. It has been demonstrated that 
the correlation between mRNA and pro- 
tein abundance isdess than 0.5, 67 em- 
phasizing that ideally, mRNA expres- 

©2001 American tyedical Association. All rig 



sion studies should be accompanied by 
analyses at the protein level. 39 Radio- 
immunoassay and immunohisto chem- 
istry have been used in a number of 
studies. 156869 These techniques, how- 
ever, are not well suited to detecting low 
levels of expression, and they require 
the availability of an antibody specific 
for the protein to be studied. 

The field of proteomics, the large^ 
scale parallel analysis of the proteins that 
are present in a cell, is developing rap- 
idly, but has problems of its own. Pro- 
teins vary in abundance by many orders 
of magnitude within a given cell, and 
there is no PCR equivalent for the ampli- 
fication of proteins. Moreover, proteins 
fold in many known (and unknown) 
ways that affect their function. The fea- 
sibility of the microarray analysis of pro- 
teins has begun to be explored. Antibod- 
ies attached to microarrays can be used 
to bind to and quantitatively detect pro- 
teins that have been tagged with fluo- 
rescent dyes. 70 Skeptics doubt the plau- 
sibility of identifying thousands of 
unknown proteins in this manner. 70 The 
diverse chemistry of various proteins 
poses serious difficulties, and it will be 
challenging to find antibodies for every 
protein. Thus, although it is important 
to incorporate protein analyses into 
expression profiling studies, current plat- 
forms are technically limiting. 

Functional Studies 

Confirming the role of a gene initially 
identified in a microarray experiment in 
animal models with transgene or knock- 
out studies provides a particularly pow- 
erful alternative platform. Transcript 
function, rather than mere presence, is 
addressed. However, this approach is ill- 
suited for high-throughput conditions. 
It may be ideal foi* an in-depth investi- 
gation of 1 or 2 genes of interest, but it 
is not practical for confirming large quan- 
tities of profiling data. 

Confirmational studies are useful to 
corroborate the biological signifi- 
cance of differential gene expression de- 
termined by microarray analysis. While 
improved databases and more reliable 
statistical models will help to lend 
greater authority to array data, alter- 
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native platforms can be used to assess 
the relevance of genes first identified by 
array comparisons. It should be real- 
ized, however, that the alternative tech- 
nologies are not intended for large- 
scale analyses. Realistically, only 
selected sequences from the array data 
can be confirmed with other plat- 
forms in the short-term, a retreat from 
the initial purpose of the genome- 
scale investigation by microarray. 

CONCLUSIONS 

Microarrays can be expected to prove ex- 
tremely valuable as tools for the study of 
the gene tic basis of complex diseases. The 
ability to measure expression profiles 
across entire genomes provides a level of 
information not previously attainable. Al- 
though complicated issues must be re- 
solved, the potential payoff is big. Mi- 
croarrays make it possible to investigate 
differential gene expression in normal vs 
diseased tissue, in treated vs nontreated 
tissue, and in different stages during the 
natural course of a disease, all on a ge- 
nomic scale. Gene expression profiles 
may help to unlock the molecular basis 
of phenotype, response to treatment, and 
heterogeneity of disease. They may also 
help to define patterns of expression that 
will aid in diagnosis as well as define sus- 
ceptibility loci that may lead to the iden- 
tification of individuals at risk. Finally, 
as specific genes are identified and their 
functional roles in the development and 
course of disease are characterized, new 
targets for therapy should be identified. 

Despite the problems of defining nor- 
mal, understanding tissue heterogene- 
ity, making arrays comparable, analyz- 
ing and archiving massive quantities of 
data, and performing confirmational 
studies in alternative platforms, expres- 
sion profiling with microarrays stands 
as a truly revolutionary technology. As 
we continue to delve into the possibili- 
ties, we will surely progress in our un- 
derstanding of current issues and com- 
plications. No doubt the ride on the 
high-throughput highway will be ex- 
hilarating. 

Author Contribution: Ms King and Dr Sinha both con- 
tributed to the conceptualization and writing of this 
article. 
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eppUcuions of PNA miuuaJiays include uncov er ing unsuspected associations between 
genes nod specific clinical feature of disease the! are helping devise novel moleculir based 
disease clarifications. Most published tumor studies using DKA mieraanoyi have either 
examined a pathologically homogeneous eel of tumors to Identify clinically relevant 
subtypes, for example survivors vs non-survivors, or pathologically' distinct subtypes 
btlongjog to the same Imcoge. for ex ample liarited stage vs advanced stage tumors to 
identify molecular correfatea, or rumors of difTeitni lineages to identify molecular 
dgnemres for each lineage. 

One of the landmark studies that have attracted much Interest with respect to the 
potential contribution of DMA injcroarroys to uncover novel clasaaa of rumors, la an 
analysis of diffuse large oVcell lymphoma* the most common subtype of non-Hotfgaint 
lymphoma 14 . Large B-cell lyoiphoma ia a clioicalry hetesogeneous disease. Onfy 4M of 
patients have a good response to current therapy wttfi a prolonged survival. A sysmrnxtfc 
characterisation of gene expression in this disease using DMA uuuuai i aya unc ov er ed a 
- diversity in gene expression tat reflected variation in tamer proliferation rate, host 
response and dtfferc miction state of the tumor. Two mateuhuty dbtinct forms of dUresc 
large B-cell lymphoma were oawvutd which had gene expiuaxm psmma mdtesdve of 
di ff trail stages of B-ccQ differentiation. One type expressed genes characteristic of 
germinal center B cells and bad a ds?tiftcantry better overall survival than the accond type, 
which caomsed genet normally induced during la vitro activation of peripheral blood B 
calls. Tho analysis thaeJui e identified previously undeteottd and cumcaOy ■Ig nlflcam 
subtype* of lymphoma. 

Stato to classify bnast eaaritu^ 
tumors could be classified Into a basal ephheilai-liloc group, an CRBB2-overaayreasiag 
group and a normal brcast-tikc group 11 * A hmdnal epihefiaVaOKrgen recepftY-pcolOve 
group could be divided into at least two eabgroups, each with o distinctive expression 
prafUsv Survival analyses on a eubcehort of patients with locuOy*edvenced breast eanoer 
uniformly treated, in ■ piospective 'study, showed daniftoandy oWcrcnl outcomca for 
patient* belonging to the various groups, including a poor prognosis for die basal-like 
subtype and o significant difference m outcome for the two estrogen noccptor-posJovc 
groups. In an iaefepenaem study of 3d invasive breast canters* suiting molecular 
differences between ductal carcinoma specimens were uncovered that led to a cojjestad 
new classification for esuogen-feceptar negative breast csncer". Similarly, a amdy of 54 
node-eegstive breast carcinomas discordant for BR atatua also uncovered a list of ajenes 
which cHscrunJnaied tumora aouuding to ER status 21 . Artificial ntursl net w oAa could 
accurately predict ER suttua even afuer exctoelng top discriminator genes, tocfoelng BR 
itself. Only a small proportion of the 100 most uiiu o rtant BR discrinunamr genes are 
regulated by csoadJol In MCF-7 cells. 

An Informative approach to analyze DNA microarray data in clinical surfs* is to divide 
such data into a oaining set to uncover a ss o ci a tions between specific genes and certain 
cBroeal features of the disease, and ■ testing act to validate these associations. However 
since both oiinang and testing acta are derived from dm name pool of patterns whose 
eamples were available to the.iirvestyelom the extent to which such association* may 
apply to other patients not included In the study, who may have different characteristics, 
cannot be inferred. Beer et of. have undertaken a study of lung eanoer in which the 
association they observed between a set of genes and patient survival was validated with o 
testing set of rumors titey had available and further validated with an independent set of 
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tumors for which micros/ray cfcia was collected by anorher group no: associated with (hat 
study' 9 . Such es tensive validation clearly indicated the robustness of the sssociatfoa 
uncovered bttweco o act of genes and survival In lung eJtaocarcinoina. 

The mmuroos published studies using DNA microarrsys justify the ose of this 
technology for uncovering patterns ■ of - gene esprtssion Chat ens clinically informative 
However if la substantially tnore difficult to develop an understanding of disease at o 
mechanistic level using DNA aiuwu i eya . For booh of the published studies it is uncle* 
how well UNA levels reported eorrehse with protein levels. A leek of ctmelanon may 
tnply that she predictive property of the genets) It htdependesi of gene Junction. In studies 
of lung cancer, Chen ct a), ccBecGrf both DNA mtcroarray and 2-D PAOB data* which 
allowed them to compare mRNA and protein levels in the same tumor*". The integrated 
mtensities of ICS protein spate representing protein produces of 9$ genes were analysed in 
76 lung adenocarcinomas and 9 unafTected lung tissues using 2-D gels. Far the same M 
samples, mRNA levels were cfctejrnined using oUs^muclcodde cnicroarrays. Oary 21 of 98 
genes (21.4%) had a sutisticalry significant correlation between .protein and mRNA levels 
(r >0J445; ? <D.05). Ine coRNA^proteio correiabon coefficients also varied between 
ttofosira of the same protein, indicating potentially isofesrn-specific coechaniams for the 
regulation of protein abradant*. 
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Deapita the advances In our <nttfsf*tan£ng of tt» in substantial 
gaps remain bath In our undemanding of cancer paJhapaasss and in the development of 
effective atncjegjaa for early diagnosis and tar Ottoman. A proteosnie approach to 
investigating diseasas such as caneef may overcome tome of the fenitationa of other 
approachssV DNA mferoarraya have Um$ted utiKry for the analysis of biological fluids and 
for uncovering directly in the fluid, assayable Womaitov, tfcraefous aJteraunne may occur 
in proteins that am not reflected in changes at Che JftNA level. 

Unlike DNA rntesosrrays that provide one measure of gene expression, namely UNA 
levels, there is s need to urtplement protein nteroarrey strategies that address the many 
different features of proteins iacluding determination of their tevala in Wologjcei samples. 



obtained using cDNA cacraskm libraries** 0 or pnage-dUplay libraries 41 have been 
utilized for different types erf protein based assays. With other types of imooarraya, whole 
titroe^denved samples have been directly arrayed onto slides, to assess the reactivity of 
total protein lysates with specific Ke^nd** - *. Two practical applications of protein 
microarrsys were ptssented by KiatsnVi**. designated protein function array and pretehv 
detecting array. With protein function arrays, a large amount of protein ts spotted on a solid 
support at a defined location and tested to characterise either a bicchemtca) activity or a 
molecular interaction. Ine pratdn<4efecting array consists of an arrayed set of protein 
ligends used to profile gene eapnasion and draw aignsturcs toeScsdvc of the celhifar etna. 
The whole process of assembling the protein array isqolres considerations rcf ated as the 
nature of the support the typo of immobilisation, as well as the molecular architecture of 
the particle being attached. Ann main different supports have been optimised to perform 
assays of Uds magnitude: chemically modified glass slides (poryrUlysines. pcJyeldtfcidBS. 
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these samples. Patterns that distinguish between cancel patients nod normal subject* with 
remarkable accuracy, have been reported for several types of cancer*. The coopiing of 
protein arrays with mass specoometry technologies is Ukely to become a powerful analytic 
tool with which to profile protein expression. Such an approach, known as rVotirhjOrfo 
(Gpheigsi Inc. USA), was snceessfuny applied to srep> prostate end ovarian cmtnT 
end, more recently, head end neck cancers*. Results from these studies revealed the 
involvement of proteins in carcinogenesis processes and specifically identified protein 
fingerprints from which cancer biomerhcr? were cavaeted The major drowbocks of direct 
analysis of tissues or biological fluids by MALOI is the prsjfensntiel detection of proteins 
whh a lower molecular mass and the difTicolty In (derm* rying. the protehis corresponding to 
the masses observed. Anther mehnologjcal imjnwemenis cooM enhance the mlHty of 
f analysis of tissues and biological fluids. 




2003). 



.potman* for opdmtastiisn ef POA fttpretectioa. (Rcpnadimd hem ftlfeis ft of.. 



.tissue profiling studies that have utilised protein fDtooairays are beginning to 
: As • model to better imderstand how patterns of protein expression s hape the tissue 
nri cr o ciivti onrnent. Kheievtc el el. analyzed protein expression i n ttos nc derived from 
sqoammta cell cercinomas of the oral cavity through en entibody rmtroarmy approach for 
mgh-throoghput proteomic Morysis 49 . Utilising laser capmre mlerodissectitm to encore 
total protein from specific rnjeroscopie cellular populations, they demonstrated that 
quantitative, end potentially qualitative, differences in capictai o n pafiera* of multiple 
proteins within epithelial cells lepTodwlWy correlated with oral cavity tumor progression. 
Differentia) expscaslon of multiple protems was found m stromal cells surrounding end 
adjacent to regions of diseased epIiheHnm that directly correlated with tumor progression of 
the epithelium. Most of the proteins identified in both cell types were Involved In signal 
trensduetion pathways. They hypothesised therefore that csterjsivo noleculer 
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comnfuntcaitons involving complex cellular signaling between epithelium and stroma ph>y 
• key rah in rtwrtfig oral cavity cancer progression. 

A clinically relevant sppfjcau'en of protein mtcroamyt is the identification of proteins 
(hat induce an antibody response in autoimmune disorders 0 . Mieroarnys were pi ed uce d 
by attaching several hunted proteins and peptides to the surface of derividxed glass stales. 
Arrays were iocubaied with patient serum, and fluorescent labels were used to detect 
autoantibody binding to specific proteins in svtotamtme diseases* including systemic lupus 
erythematosus and rheumatoid arthritis. Such i itfirutu i sy t represent a powerful tool to 
study immune responses, m a variety of diseases including cancer. 

A reverse phase protein array approach that unmobiUzcs the whole repertoire of a 
tissue's proteins has been developed^ A htfrfe degree of sensitivity, precision and tinearity 
was achieved* making it possible to quantify the ptamfcerylsied status of signal proteins in 
human tissue ecD subpopolaoons. Using this approach Ftwtltnret a) ."have IrmgimdrrmJIy 
' analysed the stale of pro*somval chcchpoint protein* at the stueroscopte tnumtioo stage 
from patient tnstched bisbslogiceJry normal prostate epithelium to prostate imioephhshsl 
neoplasia and to invasive prostate cancer. Cancer progression was associated widfc mcreaaad 
phosphorylation of Aid, su ppr e ss ion of apoptosis pith ways, as vail os decreased 
phosphorylation of EMC At the transition from histologically normal epithefiom to 
Intraepithelial nroplaiia, o statistically significant targe to phospnosytattd Akt and a 
co n comitant suppression of downsttessn apoptosis pathways prec e ding the trans itio n into 
invasive carciooma were, observed. 

A major challenge In making bxochfps for global analysts of protein eapressten is iho 
current lack of comprchciarve sets of genome scale capture agents such as antibodies. Aa a 
result, blochipi Out target specific classes of proteins such as looms or cytokhtea are much 
easier to produce, dial would hove clinical utility. Another important consideration to 
protein oucfoarrays is that proteins undergo numerous poat*trsnslatienal esocHfteetions eg 
phosphorylations, glyxosylattons, which are highly important to their roitetions, as they can 
determine activity. itBbfliry, tocalssatsnci and turnover. To eddress the need for 
comprehensive analysis of proteins In then* modified forme* several approathea to fho Heboid 
based eepsration of cell and tissue fyssses were Investigated in order to obtain protein 
fractions whh reduced complexity or pore Individual proteins' 1 . The sepssatien piu d mU 
can be srreyed in a manner that allows the probing ef protein constituents of cells and 
tissues to uncover sped fie tsrgete. For example, using a combination of anion exchange 
and reverse phase IX. Msdoa^Gurptde ct at have obtained soma 2000 Individual pretefn 
fractions Owt have been utilised to predate mkioawayi that mterrogate cancer cell 
proteomes. fractions thsl reatt with speclfle probes are within the reach of 
chromatographic and gel based separation techniques for resolving their indjvidoal protein 
constituents and of mass epectxomcttric techniques for icentiftcatioa of their constirocnt 
proteins. The LC procedures allow sufficient protein amounts to be resolved for the 
comuuetion of large nambers of tmcruarTsys from a given cell or tissue source. 

Protein oiicroarraya of different types are tikely to become commtrdaDy ovailsble for 
assays of bread sets of proteins and may well rival or at least complement DWA 
micros/raya cc tools far global eupmsiion analyiit. 
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In conclusion, array-based technologies have emerged that contribute to profiling 
tissues at die genomic, eroscriptoinic and pretcomfc levels. Analytical cools are needed to 
mine the vast amount of data generaledt intimately the molecular analysis of cancer at e 
genome and proteome scale will allow better classifscatfon of disease and tailored 
indivutoaifeccl therapy for individual patients. 
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Abstract Novel aad powerful techno logics sots as DNA 
reierearraya sad svoteomles have road* seasfete lot soxlycb of 
'Che capressJoB tcvcb of nodtfpb SjrBCf TtriwtTf both Ib 

beajtb tad cuscesc. lo oonstnatfon» chest tadmoto|to premise to 
iCTuturi&hlu- btology» to partfestor to the ores of mtrtsiiursr 
n editing os they ere opectad to reveal gene nanbtfoo citato 
bvotved to disease progrcsstoa to well os to ptapotet eetrxrael 
targets for drug discovery est dhgnwlke, Hare* ire review cbt 
correal states of these technologies 1 and Mshfi&n ootot otedba to 
which they hove been applied b couccrt to Che ena)ytb of btasay 
sBcdmesv © 2000 Fedcnta of InToncaB Btochredcsl S00V 
ctbs. Published by Ehertcr Srfceee hXV. AO righto reserved. 

Key words: Gene expression profile;. Mbraeney; Proteoipc; 
Two-dimcQsioneJ pi ttosjopboiteu; linmunohistochcniiilry; 
Bladder cancer 



I* bnodUCtfOO 

It b sow e little more iheo 10 yean since the Human 
Genooie Project woe bunched and during thit rcbtfrcly ehort 
period of time there hove bees iccaaxfcaUe ettveooea In the 
ooartnictfon of phyiical and genetic reaps ai wcD at to the 
idcotificauoo of fcacs associated with human dbcascs ft J). 
Soon, the total sequence of the human genome wiD be ded- 
pbered and, hopefully, nude avainbte to researchers world* 
wide for the benefit of mankind. 

Undoubtedly, die Human Ocnorae Project has paved toe 
way to the revolution in the life adenoee that wo ore eapcri- 
enciag today. Oradually, however, ib focus it starting to shift 
to wardi functions! guomics, oa orta of the poM-genemk cm 
thit deals with the functional analysis of genes tad then* 
products (see the article by OoJTeav in this issue). Techniques 
of functional genomics indode rnetbodi for tone cxpiesrion 
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profiling at the transcript (DNA microarrays |V5); set also 
article by Brsma and VUo to this issue); differential disptoy 
(6); serial aoalyab of sseui esvrcssfea P-91 and protolD Icveb 
u^ioteotaks) ([10-12); see also artfete by Andersen and Maap 
to this isflteX m wefl ju tianxpedce |l% phage disptoy |l«4 
pTooedurca rot stustyiog protcto protf in ie^eiYctioos QI5J6]; 
sos aba trtlcb.by 'U^raiB tad Sclig la this issae) and bbh> 
fbrosiUs P7). 

Among the te c faoique s of ftmctioaal gronmirs, both DNA 
nn^roarrtys (P^5) and references therein) and protooorics |I0- 
12] hold arect prowler tor the study of compbjx Uotogtegl 
1900900 with appltcotsora in tnotonibr raedicine. these novel 
end exmcrful fcne oprcssbo profilina techniques fxnuit the 
tntlyas of the uipitssiuu Icveb of thousands of genes mwti- 
taoMuty both in health and disease, these t eUaudu a Jto are 
ooiiipbsucataiy, oOow higtHhioa^kpuv and to eowMaadoo 

1 ipn is i uu data tam^saay bad to a better imdc^indioac? the 
itfutotory events luvu hed to norma! oad disease processes* lo 
addition, these technologies offer 0 systematic approach- for 
seaiching for cflectrfe ta/fetj for drug discovery and diagnos- 
tics. ^ 

Horc, we review the cuircat state of DNA microarrays and 
proteoetici and highbght some studies to which they have 
been applied to coneert to the oaaJytis of biopsy ttwei i nnn 

X UNA edefoanoya 

the amount of information that b now becoming evmttabb 
to fescardters to the life sescoocs ta capfoding, and even 
though the cbtt can be stored to conventional soedto, new 
methods arc being required to analyze targe ecu of genes la 
a rrigj^ttoottgjmut festoon. For thb purpose the DNA anay 
technology was d e v elop ed, the method makes it poasibte to 
survey thousands of genes in porolbi and has several areas of 
apphcatien. One b capressioa mooHorisg (IS}, to which the 
transcript teveb of genes arc measured fat different physiology 
teal conditions both in cultured ccfls and tissues, to search for 
rcffdntory expression patterns. Undrjstanding patterns of ex- 
pressed genes b expected to improve our knowledge of highly 
complex actworka that cross ooonmmieatc in hitherto un- 
known ways, both to .health and disease. Another area of 
application b polymorphism analysis (191 In thb case* poly- 
morphic rrgjoas of the genome arc scanned to search for link* 
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age to d buses, and to renal disease susceptibility genes andf 
or inherited disease gents. A simitar approach has beta mad 
io analyse polymorphic rtpoos of known genes, in particular 
to determine whether polymorphisms tic associated with io 
altered function of the gent product, a fact lhat nay tocmse 
(he ruseeptibOiry to disease Penally, various ottempts have 
been made to utilize ONA arrays for sequencing (K). • 

Below we rmrw the technology currently io use for ixscro- 
arroy-gt or rated gene expression pattern discovery aj veil as 
some appKcatioDS. Tabic 1 provides a set of World Wide Web 
sits that contain useful additional information (sec abo the 
arucb by Braana aad Vito is this issue}. 

2,1. Mkroarrty technology 

Micro arrays arc usually made by dcpnsitioa of DNA spots 
do a solid support like a coated gloss surface, lhat differs m 
. srvcrml ways from conventional tiler-based supports such as 
charged oyton and oitrocettuloic The flatness of the gbts 
tuifase mates it possible (0 to array molecules io a parallel 
fashion, (o) to teuniaiurise the procedure and (iii) to use 8uo» 
resccat dyes for detection. There is eo duTusioo of the eppbed 
msurta) bio (he support, thus allowing focusing for User 
scanning microscopy. 

Two main prccedurcs have been used to product DMA 
chips: pfcotofflhognmhy as developed aad marketed CfftmaxQy 
by AtTymstria lot (Santa CUrm. CA, USA) PbUIJ and mt> 
chanfcal gridding (22). Phmolitfionraphy b well known io the 
computer chip industry and unite an ulbmviolct Ggnl source 
that passes through a mask that directs Io a step- wise csanncr 
where a photochemical reaction <ob^fiuctBotsse syotbcsb) 
takes place oa a sulcesused glass surface. The mask can be 
produced with openings as small as a few oucromctcra a 
big a density of several huadroi thousand probes per i 
ceatimeier of glass. There a, however, an inherent length re- 
striction with this in situ synthesis techaefogy tmnoag am 
probes to about 25 aucf cetida is length. rCgh-dcnsity arrays, 
ea the other hand, allow the use of multiple probes per gene 

m 

Mechanical gridding methods are based oa fas>jet or phys- 
ical deposition of the material using pins ntaniifartuied whh 
wry high preaitoo- There b direct surface contact and the 
transport of small amounts of bqunt makes these systems 
susceptible to evaporation and contamination with dust par- 
tides. The grMdiog {nstrureents use an XYZ motion control 
based oo step engines that can- be coetrourd with very high 
precision. The DNA containing enaterial can be spotted from 
96 or 314 wcD pbtcs to glass ia picdcnued patterns. 

The arrayed probes can be ohgos (photolithography and 
pioolng) or cDNAs (gridding). The hybrid batten reaction 
conditions are quite different in the two cases, and special 
sample preparations am needed to optimally utifizc these 
probes. With shorter probes, Le. of 20-50 nucleotides, the 
sample- b fragmented to avoid tertiary structures and to 
achieve optimal hybridization (18,20). Polymerase chain reac- 
tion (PCR) atop lined probes of 3011-2000 nucleotides usually 
do not require fragmentation of the sample (23). 

The type of glass used as support, the coating substance, 
the coupling technique, the labelling system, and the ftuorcv 
cent labeb used for detection arc all variables that must be 
optimised. A oumbcr of coating substances are commercially 
avaibbte to immobiliu DNA to the surface. Two types of 
aKdcs arc avaibblc that use a 



for printing of amine modified DNA: these irtctudc (t) su> 
bled slides, which contain reactive aldehyde groups thai react 
with antmo-graups via a SchuT base formation as the printed 
DNA dries on the surface of the sSde (avtumbfe from Tele- 
chem Inc., CA, USA; Cell Associates lac. TX. USA; for 
coupling chemistry sec http/Avww.arra yitcoroftaicr oarray* 
covpliogO aad 00 activated slides prepared by the covsirnt 
anactimcat of a bydrepJuuc, polymeric amine reactive coating 
to sitae* bnsocostad sfides as described by Bctcr aed Hohcisc) 
P<) (avaibblc from Sarmodics Inc. MS, USA). AstmsHnrjdi- 
1 DNA attaches covaJently to the actrrated.poryottric sur* 



Three other types of slides are evaitabb that are 1 
snore conventional hum obligation technology rotjtfnriy used 
m nxembranc tnunobitouoo: these foohtdc (i) suaabed slides 
which carry envaJcatry attached primary aaaises oa the sur-. 
fane lhat can fans sonic bonds who the i 
at neutral pH. In aottiucn, the r 
thymidine residues cm the DNA aad carbons oa the alkyt 
ansae of the substrate can be urfuced wrm UV hent^prc- 
vided by companies She Tcteehem lac. CA. USA; " 

A hWh^ VSA tob w fifl 
and jauoobftn abwa pr opes ties of ritooorJulose that Wads 
DNA ia a nen*ovahatt but bfevcrsibfc nsaaner (Srhfrfcrhrr 
aad Schueu), and (o*i) poly-lysunvcoatad slides wench require 
UV crosshnJdag of the DNA (awumbfe from I 
lac. MO, USA; see also tttn/fcagmnjtanforcUc^ 



12. jf^OTirtroThf tkt Mpmbfotm arrayj 

A uncar response that coven two or three orders of i 
oituna to atom Beaded to detect tow and high copy number 
transcripts oa the caste array, la cases where thb b not pes* 
Bbse to may be eeecssary to scan the chip at dfflerent wave* 
lengths* or to anspttiy the sjsyssl wtth aa unassum snauVieb oo 
top of the bound sample (2$, bs the tatter case, the flnt 
aeaanbg b carried out after bybridimtioo of the I 
pie, and the second after reaction whh the tab 

It b necessary to daesnaeat the fiaearfty and wpiodurfbilhy 
sa each step of the pi occc haa, and semethna even from probe 
to probe to obtain reliable ease. Often, a standard sample fa 
I to emanate wftb the experimental saapk aad thb may 
for differences in hybrid isahon from probe to 



2.3. 



arrays and 



Comparison of data obiained from I 
from different laboratories esquires i 
Affytnetrix chips and the custom made eDNA chips use differ* 
cat methods for sumenrdizauoa. The Aft)mctrto chips have 
. aamroairnstcty 20 enrobes per gene and standnrdnmtion Is ei- 
ther cased on the cipicaslua bvcl of selecied grnca, like actin 
and OAPDH, or oo a act ting of the global chip intensity to 
approiimately ISO units per gene an the chip, ta tab way, 
chip data from different caperhncnts can be compared to each 
other, la our hands, the data obtained with the two suutdait)- 
89tioa methods differ only by approiimately 10% (unpub- 



*The cvsiom-made cDNA or otigo arrays also require ataa> 
dardbation, but thb b a comptes problem, la general, the 
standard used often icSccts the purpose for which the amy ' 
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was produced. For example. Tor expression moaitoring of 
breast cancer ecUs, a mixture of breast c&eoer ccfl tines ray 
provide a good standard (26]. Today, however, there b eo 
golden suodard thai can be used for all purposes and as a 
read I, ii b difficult to compare data from different laborato- 
ries aad often it is necessary to use other technologies sucb as 
Northers hybndiattoa, real-time PCR or immunostaiaiog to 
validate the signals. A miaununj rcqutrasfat b that bbore* 
tones thai produce arrays themselves should be able to repro- 
duce data from oac chip generation to the seal based on the 
analysis of «ell*dcfioed controls covering different genes and 
eiprcsson levels. In addition, it is common to use spiking of 
samples with bacterial genes that hybridise to probes, spotted 
for control purpose on the arrays. 

2,4. Sample* for cxpruxion monitoring 

The analysis of relatively homogeneous cefl populations 
(dODcd cell lines, yeast, etc.) has proven much sfanptrr than 
the analysis of tissue biopsies as the totter often contain i 
cell types (cphhchal, endothelial, inflammatory, i 
and connective tissue cells) that ait preseni 
amounts. Standardization may require microinjection of. 
the tissue to isolate specific ceil types [27,24), although the 
ourabcf of oeOs needed for the assay is wcO above a i 
Sampling of specific odl types using teacr capture i 
tion (LCM) (29) can be a fhm»conwim1njg task, and gives that 
mRNA is prone to degradation the pxoceaamg'tiare must be 
kept to a minrnitim IT only a amen amount of enteral is 
available, then a reverse transcription-FCIt step b' i 
for amplification, but Ibis adds aa additional < 
due to the lack of linear amplification of aD transcripts. In 
one of our laboratories we have used preparation of single 
cell types from tumor biopsies to standard rye pooling of sam- 
ples for generating profiles of gene expression at duTerent 
stages of tumor development (rcanuscript in emrpamtron). 

J.J. Bwbtfemtattc ancfyib of expression data 

2.5.1. HUrarthkai ehuttr atgcrhhmx HtearahicaJ duster* 
iog algorithms can be divided into two types: agsjomcxmbve . 
and divisive (Fig. 1) (30). The agajjomcretivc method is a bot- 
tom-up approach, where the algorithm starta with n separate 
clusters (for example 4000 fence, where #»a4OO0) and succes- 
sively combines cfustera unul only one b left The divisive 
method, on the other hand, b a top-dowo approach etartiag 
with one duster and successively splitting dusters to produce 
others. The algorithm used to form the dusters must also be 
defined; two widely used and simple algorithms arc the single 
linkage and average linkage method* respectively. Single fink- 
age, also called nearest neighbor, defines the distance b era mo 
two dusters as the minimum dtaanee over aD pairs of dusters. 
Average linkage takes in consideration the average distance 
ova all (30). 

A distance matrix must be calculated before eturtering is 
performed, and h b the distances between the obtained gene 
expression profiles that are used to form the actual dusters 
(30). Observations with small distannn are grouped together 
as described above, The two most commonly used distance 
mcasurementi are the Bodidcaa distance and the Pearson 
corretaiion coefficient. 



Fig I. Two nab typo of choUruig algorfthma, (fat bcemrthical 
and tat mm^MemjctuBal ■* 



*in is the cacasuiraacot for the ath variable cm sampling unit I 
?i b the variance of the Alb variable (30,31). 

2.12. NoM^lmrxhkmJ chattr algorithms. In nofrbier- 
aiehiorJ cluster enatjeb b b aosismod that the data can be 
divided into o certain number of dusters and that they are 
wdl separated. Tha advantage of thb approach b that targe 
data seta can be clustered mus* f astm tJum by nam* M 
cat e tBrirrta g because a tower number of dusters b enisled. 
The most ccormmo earthed for uon-trirrrjclucaJ cfurtnr anat- 
ysb b taseaas. However, a socthod termed etlfers^med 
maps (SOM) has recently been apptfad to mprearJen dam 
generated tram DHA chip arrays (32*33). The fcmcans caetb- 
od (34) identifies ft points that function as duster centers. 
Each data point b then assigned to one of these ceatera in n 
way that mlnimirr s the sum of the diftannrt between aD 
peinta and their ceatera. Thus* it b the attribution of pomts 
that decides the value of the means. One drawback of thb 
method b thai -a specific cumber of chatera b < 
the number of ctaatcre b usually tmfarow* in 
sets. SOM b assume to the a-eoeens approach, but it baa a 
gcoatctncal ccmfijuratfiQo and the number of oodo predefines 
thb oosilgnmnon, lamafly, dan points art mapped onto the 
geometrical conjuration. When clustering the data with 
SOM the pKfchioa of a node rnigTatcs to fit the data points 
daring auix a aai v e hrrnfrons PUI) 

ZJLJL StipwUt* €tta*gim$tak Cbrnmoa to the clustering 
methods in which array dam ore used b that they are tmsu* 
pcrvbed. La. no predefined references are smowtv An i 
tive option b to i 



Euclidean distance : ED o ^{X^-X^f 



t ice^nres at least two references. For cancer classification, 
for example, the rescrcnees could be the gene oprcssioo pro* 
film from * m **^pI and Invasive tumor tissue* In thb particular 

used to deaarroc^each^ reccnTarttde by 

Oorob aad co-workers (lo), the as than ejmlyced 6117 geno 
using 38 bene rnarrow samples. Based on these 31 samples 
they found that o vector baaed on 10 and 200 genet was. 
snflkbnt to distmgulsb bcrween acute myeloid leukemia and' 
acute lymphoblastic teukemb. Thus* the authors were able to 
construct a cancer dassificr based cm a low number of genes. 
Using ■ similar vector~bajnd classifier approach. Brown and 
ceworfcers (37) analyzed 2467 yeast genes in 79 diflerenl ex- 
pcrimcnts and were abb to classify genes into functional cat- 
egories based on the expression daU from DNA chip arrays. 
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2.6. Applications ofairop* for czprtmn monitoring 

One of the main ireu for amy application is in the Simul- 
taneous monitoring of thousands of transcripts lo different 
biological tcitiap. The approach b bong mad lo identify 
new networks ond to' understand patterns el a pressed genes. 
A numtatr of articles have been published using the array 
technology arming at identifying cUscase-assoaaied alterations 
in human*. For this purpose clinical samples, human cell fines* 
and to a few cases aaima] models of human disease have baco 
used. 

2.6.1. Gtnr €Mprtsswn profiling of time biopsies, cef/ Aicf 
ond animal motUls of disease. 2.6. U. Tissue biopsies, Ocse 
opfcssion studio en clinical samples have ben performed hi 
breast and colon cancer (36,31), as wall as is ethveaderosis 
P*.40J. Genes of presumably' knows functions have been 
identified and linked to the diseases; most of these data arc 
now aveneblc in the Internet (Table )). la a study of breast 
cancer that used dinica) specimens and cell lines* with as 
array containing apptpjuroatdy 5000 genes, Pcrou and eoV 
leagua (26] identified a proliferation related gene duster m 
the cell line that was unregulated in the more aggressive din- 
ica) breast tumor specimens (26). la a siinibttjtudy of colon 
cancer based en the AffrmruU arrays. 41 BSTs homologous 
to ribosooul proteins west found to be uaitguUtad b the 
tmnor tissue (31). In this study, a muscle iadea was used for 
csrrccting tor the stromal oompoaeata as this showed a hfejfr 
iodes in the norma! biopsies. Both the breast tad the ootoo 
capcer studies larerd , a conrJetioa bctwaea gene expiration 
levels in the cell tines aad is the distal specsacas. 

Hunan athcrosderosts lesions from artarfa carous sampled 
from patients undergoing surgery have aba been analyzed 
using the AQyaietria capressioa arrays |40|. Out important 
finning was a flve*fobJ trpregulation of the caify growth re- 
sponse gene EgM, a DNA biodiag proteia that maucaom the 
transcription of genes etiaodiag growth factors, cytokines, sd- 
heston nwlscufcs, aad preeeia* related to coagulation. This 
fading was corraborated by bnjatmohaudthtaiistry aad aai- 
ma) uperiattata, aad identified Egr-I as a possibto target Car 
therapcuttc intervention . 

16.1. 2. CHI lines. Several studies based on expression 
monitoring have been performed ia human cdl lines in areaa 
as diverse as caaoer (4<l> ophthaiasology (4% aad the central 
nervous syrtem HD- Arrays have also been used to study the 
effect of cytokines interferons) (44], cytomegalovirus ufection 
r**H ond oncogene transfectioa (461 on the overall patterns of 
geae cspression. la oae study, bumaa foreskin fibroblasts 
were infected with human cytomegalovirus aad the capressioa 
of spprouraatdy 6000 genes was monitored for up to-24 h 
(45). A tout of 23S genes was found to be vpvegnlsfied more 
than four-fold. These included HLA-B (upregulated six-fold). 



a protein that protcets sgamst cytotoxic T-fympbocytcs; ROf 
SSA (52 kDa protein uRNA; unregulated 12-fold), a com- 
monly targeted eutoantigea, as well as several eorapaaeets of 
the pathway that produce prostagUadia E2 from arucbidontc 
and. Ia aaotber study, h was shown that btatsneat of the 
bumaa fibrosarcoma cdl hoe HT 1080 with IFKs o. 0 aad 
T resulted ia the upieguktinn of aovd genes bnplicated io 
apoptasis (RAP46, Bag>l, scrambtasc), while genes Ukc 
IGF-2 and ZaT*3 were strongly dowortgulaud (441. 

The effect on geae Mui e uion of the fusion oooogexte PAX> 
FKHR traasfectod Into NIH 3T3 cells has also been studied 
using mrcraarrays (461 >* was shown that the ration teat, but 
not the wOd type control was able to activate a myogenic 
trajujeriptioa prostata that inchided induction of a number 
of transcription factors such as MyoD, myogenin, SUI, aad 
Slug. 

2.6.M. Animal model systems. A common approach to 
the study of bumaa diseasa is to use animal asodd systems. 
This hag ban done for a range of diseases iitetuding eaeepha- 
mmydopithy« rymphoma, rcaal tubuli (23], aad tuag fibrosis 
HQ. la the latter case, AffytDctri* chips were probed with 
pookd samples obtained from group of set aaassam to reduos 
variatinns aad oast. Transcripts that were significantly altered 
ia hug fibross (47] included tsliacefhihv matrix aad saflsun* 
matory retpease genes. Analysis of dusters esmtamfn g theat 
genes m a lima course experiment with the SPUIHRB PRO 
3*0 program revealed dutcrcat tssaporal patterns of eapresstoo 
that farther subdivided these geaca* 

2.6J. Tosdtotogy mid stag taring. From a toiiccloxjtsJ 
peat of dew, there are great expeetaaoas for espscssioa taoa- 
iioriag as the effect of drugs, both expected and unforeseen 
sate effects, can be. cnonitored m anunats aad cveatuaOy io 
humans (48^9). Oae problem that has arisen from these 
studies, even la quits f*ff!pflt. modd fynfff^. Is tat often* 
unrxptstnrd changes fas Uaai u ipt levels obaervad. These 
dausgm trt ajuJtt stssro 

plea relationship among geae products than pr ev i o usl y 
thought W 

The oombma tfO D of geae expressioa saonitoring aad testing 
of drugs oa cdl Uass and ia ir*T*f models boats street prom- 
ise. A fecent publication shewed that the varuuon la the liver 

xyenes, glutathione fcguhttars, OKA repair easirmca, heal 
shock proteins aad housekeeping genet is larger among iad»- 
'vsaual animals than that eitraduced by the array assay hsetr 
(4°)- These studies revealed the upregulation of cytochromes 
even at low doses of ^aaphthoflavoaa treatment aad showed 
a gaad correlation between the array aad Northern hybridi- 
sation data. A recent sfifecnmg using cDNA rnJcroarrays of 60 
himan cdl Isaea used by the National Caocer Institute for 



Tobh I 

Vwdd EoU on tbt Worfct Wide Web for anay dsu and software for data analysis 



SUf. t^fc Avauabai 

|l| bupiffbnpp. me^gevAjmpbons/ dsn asd sefhrare 

P) b up JSmrwxst.vacjcd u/wa reh/tonpnl etgtntjfpm biml dats sad mflware 

(6.I2.U.I8) hiipt/Awww^0ietM.«irjai^QnbfPR/dtta^^AtmJ date eed nftwore (CcneOusuv) 

|ll) bitpJAvvw^Mm/Uxa^bsnua4ll»3Atm daa 

(13) biip^rcpjncdJomrd^dufttctworCdisovuy/ tfatt <renlt>) 

|M) bHp://qoanigtnstaefonledar • dsn Ciemhs) 



6 



/£ Grftf 99 al/tEBS Uittn «0 (7000) 7-/fl 



drug discovery, has formed the basis for csubfisbiog a data- 
base that can be linked to gene expression data asd molecular • 
pharmacology fdl.50). 

2.6 J. hfoUarfar tlatjificoitan of dittnstx Oatrifiiattoo of 
disease cao generally be achieved by class discovery asd class 
prediction |J6|. Class discovery refers to the identification, 
based on gene expression, of previously unrecognized sub- 
type of the disease. If cfiateaJ foUow*vp material as available 
these classes can be related to sips and symptoms* disease 
course, treatment outcome, and mortality. Class prediction 
refers to the ability to assign a particular patient to an already 
defined class* based on molecular nomination of the diseased 
tissue or other specuYicna. 

Two recent examples have demonstrated the power of ex- 
presrloa arrays to classify hematological malignancies. In one 
of these studies, leuhemias were classified as AML and ALL* 
mutoiniy . based on SO genes selected from an array carry- 
ing 6117 gents, lo this study, 36 out of 98 patients were 
correctly classified, and two were uncertain The SO genes 
used m the array included some that were known to differ 
between AML and ALL* as weO as new oarken f)6> 

Another study classified feVoeO lymphomas mto two caotce* 
ularly distinct etaeso that reflected different stages of B-cefl 
' duTcrcntiation, the gcrnuaal center-tike and activated B-cell 
bxc lymphoma groups, respectively. For this study the tyro* 
phncbip oucsoarray from Researcli Genetics, which holds 
17096 genes, was used. Follow-up for 12 years showed a etg» 
nttcaat difference in turvfvsJ among these groups pi), 

It b eovisaged that global rurveyi of gene caprcsrion win 
identify marker genes that, may be used to group patients into 
motecularfy rttcvaat categories; these charters arc cspectad to 
greatly improve the preejrien and power of clinical triab; 

2.7. Conthufons 

There b no doubt that the massive parallel gene capitation 
• information generated by rnicroerrays sriO have a major cm- 
pact in the discovery and understanding of patterns of ea* 
pressed genes. In addition, the technology b o peeled to gen- 
crate novel and effective targets for drug discovery and 
provide, in combination with protcomtcs, valuable toots tor 
the entire process of drug development and evaluation. 

One of the main challenges wc foresee in the future wal be 
to solve the problems posed by the analysis, interpretation 
and access to the targe amount of Information that will be 
generated. Large studies of EST* have oot been pobtbhed yet 
as dam analysis requires the dt vd o pi ucut of new csocsfor* 
ma tic tools that can deal with the' huge amount of informa- 
tion that b being created; These studies are capwrtrrl to Iden- 
tity new genes of importance to specific biological pioueues 
and reveal new regulatory pathways through the enaJyib of 
the expression levels of individual BST*s is large nurnbers of 
samples. 

lo the future, coupling of expression monitoring to trans- 
genic animal models may prove to be quite rewarding as the 
global effect of a gene knock-oat or knock-in can be moni- 
tored with both mtcroamys and proteomies toots |I0). 

3. Proteomies 

A complementary technology to DNA mieroarrays for 
monitoring gene capitssioa is provided by proteomies, a 
term generally used to cncapsulsts oD of the technology cur- 



rently available to analyze global patterns of gem expression 
at the proltia level totems are frequently the functional 
molecules and, therefore, the most likely to reflets differences 
in gene expression. Genes may be present, they may be mu- 
tated, but they arc oot nerxxsardy transcribed. Some messen- 
gers arc owuxribed but not trentbtedL and the number of 
caRN A copies does not necessarily reflect the number of func- 
tional protein rnotocuhs (Sty In addition, protevmio adV 
dresses problems that cannot be approached by DNA analy- 
sis, namely, relative abundance of the protein product, post* 
translation*! modification, subcellular localization, tumovcr. 
mtrtxctioo with other proteins as well as functional aspects. 

The r^otcome has been defined by V/Ohios and colleagues 
as lbs complete set of proteins encoded by the grtrcmj ISJJ, 
and recently, the terns has been broadened to include the act 
of proteins a pressed both in epaee and tfaave. There are two 
main armroathea to pr nt n o anea : one b the exjresmon model 
in which an proteins are analysed, and the other b the ceO 
map model in which only a stirred an of proteins* Eke com- 
plexes and organelles, are studied (54). 

The protxomie technology b complex and oo inprisu a 
plethora of stats of the art techniques to rexorve (high rosn* 
hitioa tw^dinxccaional gels)* quantitate (pbcwnlurrisnagsr, 

qumdngj mats spcc^uuctryX as wcD as to atore {twot^nsen* 
atonal rnjlyarxytamlde gel deempkorcsss (2D ■AGE) data- 
bases; tXty'JJtWbMMkh&hWai}*; bttp^cxpasyJee^s^jch/ 
sproi/r^rot-tophtjeJ) ccmrrrankutt end fotertmk r^oeesn and 
DNA sequence and mapping m/ormetien (biourforcnatics) 

po,ius-sq 

The usftumws of the 2D PAGS tnchniqtte tor a v g ^ a cal o 
protconss projects depmda very tsuch on the number of pro- 
Jcifts that can be resolved in a complex protein mixture, for 
example a human ceO. rvoteome profile data from a few lab- 
oratories* bcludtni one of our own, have bsdtated that only 
a fraction of the human genes are twhehtef^o Id n given cell 
type and estensfwj analysb of whole ocO extracts* organelles as 
weO as partially purified subcellular factions, suggest that 
individual cells may oot aprcss enore than 6000 primary 
translation products |59]. To this nsxnbcr one haa to add 
the post-crenslaeloael processing and chrmrral us o rfifrairo ns 
f^hosxftorytetion, glycosytatioo, denssthytation* escfysxtion, 
ayrbtoybtioo, fuUmJtoybtioo, sulfation, ubiquination, ct&X 
the latter being rather common and eatcnaivu in canny pro- 
teins 160,61). Thus* an we stand today the ID FAOB tachnot- 
ogy is oot abb to resolve and depict b a single gel aS of the 
proteins thought to be present in a tnaramntian oefl. 

Currently, there b a great deal of interest in piutcumio as 
epuficntrons of this technology art expected to reveal gene 
regulation events involved b disease progression aa weO as 
potennal targets for drug ctawety and cUagnostica. More- 
over, the technology b bound to have a great impact te agri- 
culture, toxicology and the industry io general. 

J./. Ptotttn stp&etiam: tht 2D PAGE tedunhgy 

For the past 26 years, high resolution 2D PAGE has been 
the technique of choice for analysing the protein composition 
of cell), tissues and fluids, as weO as for studying changes in 
global patterns of gene expression dieted by a wide array of 
effectors (12,62-64). The technique, which was originally de- 
scribed by O'FarrcD (65,66] and Klose (67J, separates proteins 
both b terms of their isocteetrie point (jpi) and trabecular 
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weight and provides the highest resolution for protein inaly- 
ris. Usually, one chooses a ttroditioo of interest, for example 
the addition of scrum to aoo-diffcrcntiatcd human keratmo- 
cytes. or compare ooraal mad transformed cefb (Fig. 2A,8), 
aod let the cells reveal the global protcio response as aO de- 
tected protons cao be analysed both qualitatively (pesMraaa- 
tstionsJ modificatioas) and quantitatively (relative abundance, 
co-ordinated expression. Fig. 2Q ia relattoo to each ether 
Q 55,6*] and references therein; see also http://biobase.dk/ 
cp-birVcdh). 

For tnany yean the 2D PAGE technology relied oa the uae 
of carrier ampholytes (amphoteric compounds) to establish 
the pH gradient, but ibis technique has proved to be difficult 
because of the lack of re prod uti bully created by uncontrol- 
lable variations ia the batches of ampholytes used to generate 
the pH gradient;. Lately, however, with the ratfoduotiaa of 
immobilised pH gradients (IPG*) («J0L which are integral 
pan of the polyacfylamtde matrix, it has been possible to 
obtaia focusing patterns that can be casOy it pic d ttt ed by 
the Borxxpcrt. IPGs avoid some of the problems associated 



with carrier ampholytes such as cathodk drift and i 
sis, allow a hi|hcr loading capacity fox cnoopreparstivt runs, 
and pra vide btcteascd charge resolution amen narrow pH gra* 
dieots (0.03 pH unlt/ca) arc used <P0.7I| aod referenoo 
therein} In our hands, however, carrier ampholym P.S-1©. 
Fig. IA) and broad range IPOs (Fig. 3B) resolve tmtflar rron> 
bcr of pSImcthioaiae bbefled pufrpMstMes (about 2500} m 
dhuuattd with the separation of whole proucm catracts from 
bbeSaJ human a*raJmocytcs |?n It has been proposed thai 
narrow range, overtopping IPO gradients viewed aido4y>s{de 
may provide a tohstJoa to the problem of resolving and de- 
picting the proteome of a given erU type. Recently, however, 
Oarthab and co-workers found this solution unrealistic; as U 
will require the running of a bugs Dumber of gels (711 

Very basis p uly p aptf d u have proven difficult to resolve, 
Bllhoogb both earner ampholytes (oon-equilibrium pH gya* 
dieot emorophoresis, NEPHOE) (60,731 and IPOs (9-12 and 
4-12) (PO] and references thereio) have boss showo to sepa- 
rate basic proteins. 

One of the most bnpoitaat steps io the 2D PACE technal- 
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Fig X Nro-euJtured oenmJ human temunoryu pro (dm vparalad toon) (A) cersvr anpMyio p.S-10) end (9} IPOs P-tO). A far pretrfa* 
on lodicoted for reference. B b froo Bjtflqvxst ct at |72V 



ogy concerns sample prepare tion n very often tome proteins 
cannot be property dissolved by the lysis solution originally 
developed by OTarrcll (65). Thus, there is pressing need to 
develop proteeoU for optirorjng sample sotobaisation, To- 
watds thh aim, Rabilloud tad covorket have eosde use of 
the high loading capacity of IPOs to resolve tmsnbrtns pro- 
teins for struct uraJ analysis, and fo dotal so have unproved 
their solubility by using a cornbrnstion of detergents aad cha* 
otropct [74,75}. h has bora shown that the addition of thio- 
urea, CHAPS and ivtfobctaia surfactants to the lysis solution 
containing urea results id a much improved sohiWfimtioo as 
wet) as transfer to the second d uumsiuJi SOS get As far as 
nadcar protcau are conacre cd, Oerg and cofleagusa have 
unproved cooedereWy the anarsoen of very baste proteins 
by first pTcdpluUag the samples with acetone prior to toruhl- 
bxatioa in the lysis solution (70). Its problems associated 
with the extraction of tissue sentries, on the other hand, are 
much more ensptea and have not been addressed yet la a 
fysianauc fashion. 

3.1.1. Dneetlwi An important limitation of the 2D 
PAOB technology Is the lack of very scusrrJve procedures to 
deled those proteins that are present In vary low abundance. 
In addition, detection procedures are needed that can be ap- 
plied to a large number of resolved proteins whoso abundance 
may span through seven or eight orders of msgnitude. 
dearly, the icraitiviiy of sOvcr nitrate and Cormunsft Blue 
suuoiag is inadequate, and on)y mt tabetic labelling with spe- 
cific isotopes may reveal enough proteins to warrant proteo- 
e&tc project*. Furthermore, the use of phosphor-irnagiag tech* 
oology may enhance the sensitivity and Uneasily of det ect ion . 
Limitation* of the radiolebdbag approach include CD be* of 
labeling of some proteins due to low turnover, (H) problems 
assoctsted .with safety regulations aad disposal, and (Ui> diftV 
cultics to obtaining fresh human biopsy material for rebelling 
experiments. IdeaDy, one would like to have a highly sensitive 
fluoicsoeacc-bssed proteio detection technique able to support 
aJl types of studies irrespective of the sample, or the cad point 
of the analysis. Preferably, the dye should not alter the mo- 
lecular weight and p/ of the proteins if St is to be added prior 
to electrophoresis, and should support quantitative studies 



mvorvtng proteins having extreme duTercnao in their copy 
cumbers. Urfortuastejy, no such ideal dye is available on 
the iip^t* yes» ejflswgh Oxford Ot^fejoScsesscc has developed 
fionroont fPO-PAOB (htt>:/>v^ < oy AC^FOtwux/ 
iMmtbtoT), a coshnnlogy not available to the serrrit m c coxn- 
muoiry. Bautatam compoixnds such as SYTRO Orange; 
SYPRO Reel aad SYPRO Ruby have bees used to analyse 
whole pmteo ryeates from bacterial aad mammalian cells, but 
their seasKMqr (1-2 ng) is slightly lower than that of saver 
nitrate |76,77). Some ad vintages over silver staining Include 
short staining tima and the fact that the gets do not need to be 
fixed prior to staining. Id addition, Bttte or no drttamtng to 



of 

docs set pose a problem as Cells aad eo workers have shown 
that 2D 'PAOB bnmunoMotllag id uimbUiiUoo with en- 
hanoed chrrnllwTiunmccncc (ECL) can detect as Bub as 100- 
S00 aasleeales per cell m unfraetionated ceOulsr extracts [SO). 

3.1J. Quantltailan. Even though there are several toob 
available for the quantitation of protein spots, there is at 
present no available procedure for quaatitatwg afl of the pro- 
teins resolved to a complex contuse. Part of the problem ties 
in the large dynamic range of protein expression, tack of sea- 
ototlon, post-transmtienal modifications, staining behavior of 
the proteins, as weO as in the fact that many abundant pro- 
teins streak over less abundant components mtcrfcrmg with 
themeaiufteoenta. At present, fluorescent technology seems to 
be way ahead; as with the fluorescence stain Sypro Ruby 
there b e linear icsponsa with respect to the sample amount 
ova a wide range of abimoance (P7| and references therein). 
Quantitative fluorescence ouaxurezscDti can be performed 
wftb OCi>eamcra based systems as well as with laser seaaner 
systems Q77) aad references therein), lo lome cases, fncbosa* 
beffing in combinatiaa with scintillation counting oflcrs a rea- 
sonabte alternative for qeaatitaung a smsO number of pro- 
teins (6s]. 

3.1J. tdutt&cattotk Methods of protein identification 
have mended tracauaoblottuig f78.79J, Edman peptide se- 
quencing fJBO.St) and reference* thereto), arnlno add compo- 
sition (B233^ and more recently the use of matrix-asslsted 
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bier doorpucn/i cm nation (MALDJ) enaas speuiuuury (84) 
and dectrospny ionoabon (ESI) lift The outer tafrmquea, 
which rely en the comparison of peptide mus fingerprints, arc 
fast and require only pioomol amounts of prc4rins. limited 
peptide sequceciag can be perforated swag tandem mass spec- 
trometry or post-source decay, but h it Is stfl) not possible la 
make N» and C-tenninal sequence identification. For a review 
on Edman peptide secjusocing see referenoe (81). 
aad Mano review protein identification by i 
elsewhere id this issue. 

JJ. MaJdnt wmprekouitt 2D MB database* 

Advances to hardware and software development (Elsie, 
OeBsb, Melaale, Quest. Tyeho aad Kepler) for scanning 
aod image analysis of 2D gets, as wcD aa the developmcni 
of sensitive toob for rapid protein identification, have cata- 
lysed the caiabushment of cosaprsoansivc prouoaue 2D 
PAGE databases (64,86). Ttae databases aim at btcrfacing 
proteia information with fonhcomiog DHA snapping aad se- 
quence data from genome projects* and offer a global ap- 
proach to the study of gene esprasien both m health aad 
disease (hn>y/biobase^s^hin/ecUs; http^/eapasyJbcugexhf 
sprol/spTOWop.btmf) Q 10.95} aad referenccf therein). Besides 
annotating genomes, these databases arc expected to address 
problems ihsi cannot be approached by DNA aaalytB, 
namely, rcbtivc abundance of the protein product, posvtraos- 
lational modificationt* subcellular localization, turnover, in- 
teraction with other proteins aa well aa function ai aspects. 
The first prototype databases were built by Cefis aad Bravo 
{87|. who pioneered the use of protein identification tech- 
niques to establish comprehensive JO PAGE dsubascs (88) 
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Ftg. 4 shews a region of the synthetic enaster 2D PAGB 
image Cuoetectrfc reusing, IEF) of I^SJewt^^elaeetod 

^dbsTm'te World Wide Web (http^obaseCdh/tgl b W 
cehj). PnvdDM bagged with a red cro conespond to knowo 
polypeptides that have ban by one or a $ 

ties of techniques that inchide 0) ZD gel 
using tpcciftc asrttedsa and the ECL < 
00 miereaeqittntelBg of C owman*, biunant Bhre stained pro- 
teins fjKflt (Hi) mass spccuomctiy of tryptic peptides (90) and 
r» condgracbn with smown human proteins (mdividual pro- 
teins aad organelle coaponentJi) aad (v) overlay t jTfmsanrt 
\59] and(vi) transient caprcssloa in mammalian cells pi). 

To date, 1237 polypeptides have been identified as tbm data- 
base (IEF and tfBPHOE) of the 91 J» that have been resolved 
aad catalogued. la addition to ^S^ethionisit UbeDod pro- 
teins, the database contains a Tew polypeptides that lack eae- 
tMonirm, brt are revealed by silver staining, Cootnae^bra- 
Gant Bide or by tabeObg with a misture of Itf | M C^mroo 
acids. It should be stressed Chat some protons migrate both 
in IEF sad NEFHOB gets (for exempts, Orenotase, tnose- 
phosphate uomense aod elongation lector 3) and they easy 
serve as landmarks to alien the gets and as refercnoes to oar- 
malms cjuaatf tatioftS in both pH directions. 

lafoimitioa gathered en any gtvea polypeptide, known or 
unknown, can be retrieved by clicking on the coropemdmg 
spot, in this case le-VJ sfgzns. also known as stratifin (Fig. 4) 
(92). A file containing all of the ^formation entered f or this 
particular protein, mostly obtained from eapesirneate per* 
formed in our tsboraioiy. ts shown m Fig, * (only a traction 
of the file is presented). FH» for known proteins contain links 
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to a subset of Mediae (http ://w\wjjcbiniro -oih.gov/ 
PubMed/), Swli*-Proi (hup ://cx|Aiy^oi9exh/sprol/ipro(-4op. 
html) and PDB (bttp'7/i^^b^hoddbcTi^c^faO. Other 
boks include OMIM (hUp jtwww orhi nhn nih aw/ OtnfenA 
OeneCardi (http^/btoiafofmatia.winninii ex it/cards), UbJ- 
Gcse (http^/ww«^btJitai.oib«oirA/niOcBtrmdc3uhtmI) and 
etbv Web rites such as CySPID (cytosketets) protein dale* 
base; blip 7^aeUajt»d.yaJexdu/ ~ panaer/cytc^b/lndaa. 
html), metabolic pathways (compiled by KEOO: httpif 
www.acoomeAil.jp/UssOt tbt cytokine explorer (blip:// 
fcboi.opaJTi i isottiicdu :443feytokinqfeapfayo > .bBnlX, histology 
untie* (blip ^/biosan.biobsse. dk/— » pdiJ^uuJii/oiiuugjBpas. 
btraJ), etc. Io the future, aa new databases and rented Web 
sata become available, h wO) be possible to navigate through* 
out various datahatn containing coasptemcetary irformaoon 
(i.c nucleic acid and protein sequence, genome snapping, dis- 
eases, protein structure, pett-tTansmtmaal aodificnUena, aaaV 
bodies, rignaumg pathways; hiitotoiy, etc). Clearly, data- 
bases allow easy aoecn to a large body of data: one* a 
proiefo is identified bi givto databases, all or the isfomatioo 
d be easily r c ti i c vc d and made svattebfe to the 



Today, 181 Information categories are available ia the 
World Wide Web version of the kcratfoocyu datsbase. These 
include ccBular localisation, pathways, pro terns aftocttd m ' 
psoriatic fcentioocytes, proteins expressed ia oonnal urotbt- 
bum, keraiioocyics, fibroblasts. He La oclU and Madder caro- 
noma ceD tine RT4, revets in fetal bumaa feiisuts, partial ami- 
no acid scqu cooes, sounds cot, cytosacfetal proteins, caJaunv 
binding proteins, aancains, chapcronins, beat shock proteins, 
etc 

Functions to query the databases include starch by name. 



• or keywords (Fig, 4), molecular weight and 
pf, u wen as by orftanefle or oefiular cenopemcot By c&cking 
oa any ofihe OTgancQcai cellular structures or touiuunc ota It 
b possible to eat a protein Cst as well aa their relative pesi- 
tiooj en iho tnattcr anogo. Id adiiWoa^ o« cm rctrtt^ a b^ 
of aO known proteins man^.b the daiafaasa. Motcovcr, wc 
provids protocols and videos of preparative steps thai can be 
used to reproduce the data displayed as wcO as a gallery of 2D 



Airbus 2D PAOB databases available at http i/c4obasc^k/ 
cgWn^cepj tgetude tranritfanal cell carcinoninj (TCCe). sana* 
mous ceil carcinomas (BOCaX urine, fibroblasts, and moose 
kidney cefb. Other databases available fas the Internet can bo 
found to bttp-Jtap*jy.bi^^ http*J? 
wvrw.aairnrhl ntbamo nhfrohfahli/jbrotcsn; and http^asxr* 
pagutbmfoJfo+triiaM~iMnMhAJbvtiL Abo, several 
proteomic loots for protein cdeolificatton and characters))* 
tioa, primary structure analysis, secondary structure cvodm* 
boa, tertiary structure ami DMA trenstatjoa into protein ere 
available at the EXPASY proteomre server (http^rwvm^apa- 
syjobrwww/toob). 

J J. Application* 

To date, there have been thousandi of reports inustrating 
the usefulness of the 2D PAOB and prottomfc technotogjes ia 
many areas of biology- Because of apnea txmiintiana, however, 
on*/ a few of the application areas arc highlighted below: 

IS. I. Qnurr. A great deal of research has been devoted 
to the elucidation of the pathways that control ceil prolifer- 
ation in normal celts, and hence, the detersss nation of the 
means by which attentions of these pathways bad to abnor- 
mal growth characteristics anoVor imtpfattit transforrnation 
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and caocer. Most studies ban focused on oucotcees, tumor 
suppressors, cell cycle regulated proteins and final transduc- 
tion molocuto id various ceD typo of different species (9V96), 
but only in a few cases (here have beco systematic attempts to 
analyze the protein pbcootype of pairs of ooraul, and trans- 
formed cct) types using a proteomic approach |Wj68.97-99). 
So fer, ooly very few studies have made use- of biopsy material 
due to problems related to toe cell bctcrogactty. 

Already io 1982, Gelis and cd- wart en started a proteomic 
approach to the study of ccU inasfonnaiion urioi dosed ccU 
. tines (68,97,971. Their rtsuhs showed that transformation re- 
sulted in the abnormal expression of oormal genes, rather 
than in the expression of new ones (98), In addition, their 
studies raised a word of caution concerning the widespread 
use of protein information derived from studies of different 
cell types from various species. Today, we ait.vcfl aware that 
cultural ccftr undergo iBportut *frfrgri wheq pUecd la cut* 
tore due to different ennrotuneotaJ factors and growth eoo- 
ditions 1 1 00) nod accordingly, current efforts using the proteo- 
mic approach are being diieeted to the study of cot-cu) tared 
cdb and/or tiuue biopsies, Among the cancer projects cur* 
really underway, those centered en leukemia and hematology 
ice) malign Boczctv breast cancer, ootorecttl casner and bladder 
cancer are bricSy mentioned below. 

J.J././. Lndtanb and hemaiotoftat maHgnunekx Studies 
by Hannah and colleagues en chMhocd leukemia and other 
hemstotogreal marrgiindes have yielded so far several 
markers that mctude Opl8. also known as — an 
oncoprotein that has been mtpficnmd m signal transduetrotj 
{101.1021. In cbiMheofi toucrma, phosphorylation of OpIS 
was shown to corrctaic with a high content of otfls m the S- 
phasc suggesting a role in prevention. The group aho 
identified om&»HI (nucleoside diphosphate kinase AX a 19 
kDa proteio that tc uprcavlated m normal lymphocytes 
treated with mitogens as wdl as in leukemia eclb from 
patients with acute leukemia (101). 

J.J./. 7. Brtnn cancer. Systematic studies of clinical breast 
tumore of different histopathotogical types by Franseo and 
oo-warkera (104-106) have revealed several proteins, including 
PCNA, hspoA bap90 and calretlcuun that are highly deregu- 
tated io invnsro caranonuu and that may serve as prognostic 
flssAcrt. These studies have made use of fresh cfinical tumor • 
tissues of different subtypes and have paid special attention to 
sample preparation. 

J.J. /.J. CoJortetaJ cancer. Studies 'of Juaghlut and cot- 
leagues J 107-109) oa sets of tucroscopicaBy oormal coles 
mucosa and colorectal carcinomas have revealed several pro- 
teins that arc deregulated in the tumors. Dowcuugumtsd pro- 
teins included the liver fatty add binding protest, the smooth 
musete protein 22-ct, and cyclooxygeoasc 1 Uprofulated pro- 
teins included the heat shock protein 70 as well as several 
members of the 5-100 family of caJefanvblnding proteins (5- 
I0QA9, S-100 Aa\ SI00A1I and S-I00A6). Some of these find- 
ings have beco confirmed by umvunohistcehcnuoBl studies 

JJ.i.4. Bladder cancer. Gclb and colleagues (110-112) 
hive explored the possibility of using prottomc expression 
profiles of bladder tumors as rwgerprmtj to subctassify hiito- 
paihological types, and as a starting point for searching for 
protein markers in at may form the basis for diagnosis, prog- 
nosis and treatment. To achieve these goals they have ana- 
lyzed the prottomc express too profile of hundreds of fresh 



tumors as weD as random biopsies and cystectomies |l 10-1 12], 
and have established TCC and SOC proteomic databases thai 
may provide a solid irtfrastructurc to support future snxeCes 
Q> 13); bttp^/bubase^k/c^bii^cclis). Io the long run, a snec* 
tical goal of these studies b to identify a complete set of 
protein biomarkcrs that may be useful to classify hJaosaiho- 
togieal grides, asd that will provide with specific probes for 
the objective dbgnoas, prognosis and titatretot of these le- 
sions. So far, these studio have revealed markers for TCC 
progression (1101 a marker in the urine of patients bearing 
SCCs (114,115), and have ted to the development of a novel 
strategy for identifying prrmalignim squamous lesions (1 12). 
The approach makes use first of proteomic technologies to 
reveal a nd ide ntify proteins' that arc differentially assessed 
to jprurc SOCb and normal orothetlum. Thereafter, specific 
aattbomxs against the differentially csprcssed proteins are 
used to immunosiaia serial cfyostet sections of biopsies (Im- 
muaowalkmg) obtained from SOC patients that hove under- 
gone removal of the- bladder due to Invastve disease (cystec- 
tomy). Same bladder cancer b a field disease (lid) - that b 
targe pan of the bladder lining bat rbk of developing disease 
-H b espected that the uromcfium of these patients may 
olubit a sp uc trum of abaormafidm ranging from metaplasia 
to mvastve disease* 

3.3J. Htm dteam. Heart faihire b among the leading 
causes of mortality t» the Westers Hemisphere and therefore, 
efforts are being devoted to (he ctactrtaHon of the ototeculax 
events tcadbig to csudme dysfunenon QII7) and references 
therein). So far, resenrch oa dilated cajo^omyopatby (DCM) 
(117-1191 has revealed thai s^roamutety 100 proteins arc 
deitjulsied, mostly dow njtgu latcd» in DCM ea compared to 
their normal counterparts. These i nclude cytoakcfetal and my* 
ofibrfBar proteuxs, pcdypeptides associated with mitochosdria 
end envotved in energy pr o duc tion, as well as proteins esso* 
dated with the stress lupuusc, These studies have been expe- 
dited by the cstablishmcat of proteomic 2D PAOB dambases 
of the human heart (vestries* and atrium) |I20) 

In ad d iti on to the global analysis of protein uprcasioo 
patterns in * n *"T*^ heart i*rffTfnt cardiac antigen oprcsmon 
following cardiac tmiipuufiattoo has aho been studied using 
twtrriqiits from nxvieomka For rrantpsr, ustag 2D PAOB 
(cardiac proteins) fas combination with Western immumscaot- 
ting (paebnt sera). It has been possible to identify acm'gens 
that react with autoantibcelrcs present both in DCM 
(121,1221 and uijucajdlt b (123}. In this way, antigens associ- 
ated with the antibody response that may be Involved la acute 
or chronic organ remetion have been chjuuetenssef. 

Froteomie studies using anhnal models of heart disease 
have also beco carried out in em-effort to unravel the tnolso- 
umr events leading to cardbc disease. Reeeatty. two olflerest 
studks mvolvmg large tuumab, c^sm>induoed heart failure In 
the dog (124) and bovine DCM (1291, yielc^ ixmflar risvrts o 
those observed in human DCM. Intensttngiy. the most strik- 
ing result to the bovine DCM study was the finding of a 
seven-foht decrease in the cspiession level of ubtquiiin O ter- 
minal hydrolase, as inappropriate ubiquinatton of proteias 
has been sugpsted as an ctbtogb factor in heart failure |I26). 

J.J.i. TaxkBtofy. Changes m the environment as well as 
the growing utcrcst of the phaimscevtieaJ industry have 
stimulated the development of novel testing approaches based 
on die recent technical advances both fai genomics and pre* 
ttomies. Pioneering studio by Anderson and Anderson (127) 
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Fig. 4. 3D FAOfi (Au4B)istf chip data (C tad D) ea lh* 
(grade ID, Tl). C and D: Tbe cop raw thaws fiactiea with 
B: Immuioblol of lire pmriai rashed la K reacted with 
products ii tndfcatcd for rtfrcsae. F: MALOJ-TOF pcptiO* 



«T kenrun • to a aos^avnsho (grada H. To] 

itcb control | 

acjrimt kcmtiD & The position of krraua • and of oao of lis I 
it of tec ftxretb B dn^dkUoB prate* fadfcueri bAtsdB. 



have highlighted the nwftitirm and potential of the proteomxe 
appnnch lo identify quantitative changes hi rat Uver cxprcs- 
eioo profiles aismiated with toxicity of drug* and other acoo- 
biou'es. The data, which are being systematically stored in the 
rodest molecular effects' database, arc expected to yield Im- 
portant ufonnadou as to the molecular mechanisms under- 
lying toxic responses. Likewise, the potential of protoxrmtta 
have beta recently exemplified la sty dies of glomerular acph* 
rotoxicity ia rats {1 28], and of stimulated occupational Jet ftid 
exposure ia miss ling 1139). hi particular, the studies of Scen- 
es** group have shows a remarkable corretatloa between de- 
creased levels of calbindin D-28, urinary calcium wasting ia 
the urine, aod iotra tubular corticoTsoduOary caMfuabons in 
the kidney of rats and human treated with cycl os porin A 
l»30J. 

In the future, proteoralcs b a high-throughput mode it 
caponed to have a major impact in the pnxJioJca) safety 
testing of drup. Toese studies will be facilitated by the estab- 
lishment of 2D PAGE databases of frequent target tissues 
(kidney, liver) as well as of cell lines and fluids. 

J. 3. 4. Nturohgieal disorders. The Creuufetd-Jacob dis- 
ease (OD) has been the subject of intensive analysis itstog 
proteomics. These studies have led to the ideoiiAcaiion of 



two meaben of the 14- V) family of proteins to the cerebral 
spinal fluid (CSF) of CID pattenta (131). Tbe jmmet of these 
proteins in the CSF has been used to differentiate CID from 
other dementia both with high sensitivity and specificity 
(113,113). These proteins, however, arc present in the CSF 
of patients suffering from other neurological cbscrders not 
revolving dementia, Kxmtiog its dmfaal varoe (134.135). 

3,4. Coxtbatens 

. Today, there is oo tacbnology in sight that matches the 
resolving power of 2D PAOE, a technique that win continue 
to enjoy a centra) position in pro tee mis projects for some time 
in tbe near future, There is considerable room for improve* 
ment, however, in particular as far as sample preparation and 
solubility, choice of pH gradient and detection methods are 
concerned. Abo, wc need to improve the separation of very 
basic as well as very low and high molecular weight porypep* 
tides, m general, one a poets researchers to first use wide IPO 
gradients to obtain an overview of the proteome profiles, and 
then proceed with a more detailed analysis using narrow pH 
gradients, which provide higher resolution and sample load- 
ing, thus increasing the possibility of visualizing the lesser 
abundant proteins. The latter can be fadSitsted by the avail* 
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ability of specific antibodies, as weD as by the us or extra*- 
lioo procedures and subcellular fractioaatroa methods cur- 
rently at hand (136-1)8). 

Thar are t\H\ tnaoy additional challcages. however, thai 
nun be addressed before a complete Hum so Protcome Pro}- 
eel can be implemented (139). These include: automation to 
allow high-throughput sample analysis (I40}> improved quan- 
titation capabilities, belter instrumentation and software for 
peptide sequencing using mass spectrometry, -more sophisti- 
cated image analysis systems Co support pi comparisons 
and daubasiflg as wtD as improved btotoformatit capabilities 
overall (12). In addition, we Deed to deal with the problem of 
tissue ocD heterogeneity as marc and mom protcensic projects 
w0l make use of biopsy material m the future 

4L Transcript and pretela hrrets: DNA micro arrays and 
pro iconics applied so she came rrr^p^rt 

As mentioned in Section I, both DNA aacroarreys and 
protcomia arc complementary tnchao logics, To date, how- 
ever, there have bom only a limited number of studies in 
■which both tecboologics have been compared by. applying 
them to the same sample (141,142). Notably, the pioneer stud* 
ies of Anderson aad Seuoamcr (SJ) showed that there b not a 
good correlation between mRNA and protein levels to human 
liver, implying thai geee-based caprcssioo data may be of 
limited vihie m the process of drug d hcu v ciy . The study, 
which compared (he levels of 19 gene products, yielded a 
correlation coefficient of 048 between mRNA and protem 
abundance, a value that b half way between perfect and so 
coriclatioa. 

Recently, 0rnteft ct ai (manuscript in preparation) carried 
out a oucroarray and protcomic study of Madder * fat 
which they compared the transcript and protem csarcsnion 
revels of pain of noo-tavativc and invasive lav grade fresh 
TCCs. Even though they could only compare the levels of 
about 40 wcJKruotved end focused abundant proteins, it 
was clear that in most cases there wis a goad coriclatioa 
between transcript sad protein levels. Only in a few cases 
.they found discrepancies, and in soma of mom instances * 
they could net clrminatr the posxibfliry thai tab was due to 
messenger stability, post-trasoripliooal cpfetng, po4«-cransla- 
tioasl modifications, protein focusing prebtesna, degradation, 
as well u the choice of methods used to assess protein ca* 
predion bvcb (staining versus radioubclliag). For example, 
in one tumor pair they found that the levels of keratin 8 
transcripts were much higher in the invasive tumor (compare 
Fig. 6C and DX while the protein levels were asuch lower 
(compare Fig. 6A and B). Immunobtotting analysis uamg her* 
a tin 8 specific antibodies revealed thai the discrepancy wae 
due to degradation, as several related ni od u c ts of lower ap- 
parent molecular weights end mere acidic p/s, could be visu- 
alized (Fig. 6&> The identity of one of the crossreecting 
peptides (indicated with arrows la Fig. 6B) to keratin 8 
was further confirmed by MALD^tune^-fiight (FDF) (Fig. 
6F). From these studies it was dear that when comparing 
mRNA and protem levels there arc other factors that 
need to be taken into consideration when interpreting the 
data. 

When comparing transcripts aad protein aprcssion profiles 
of matched sample pairs one often gels the cmprcssioa that 
there are more changes in the abundance of the mRNA tran»> 



scripts as compared to the proteins. Considering that the cur- 
rent 2D PAGE technology depicts mainly the marc abundant 
proteins, it would seem possible that most of the changes 
affecting protein levels may involve low abundance polypep- 
tides. ' 

&. Gens tinmstou proffliag frffmtqnrs: perspectives 

Novel and powerful techniques arc aow available to analyze 
the global gens caprcssioo patterns of cultured octts and tis- 
sues o burned from normal aad dhxased subjects. Each of 
these technologies has its awn advantages and limitations, 
hat ia combination they should provide us with o detailed 
^ expression phenotype at both the transcription and 
tsanshtioB level A major chaUcngt us the sear future will 
he to define a base toe for the normal gens enprensiea phe- 
natype of a given ceO type, tissue or body i^dd. TUs b not a 
trivial task, however, as It wOl require the analysis of hun- 
dreds or even thousands of samples. 

Berrdes unpiuvcmcats on the individual techalques then* 
selves (see above), there are etui major bUtatfana that mast 
be ecMrcased before these technologies can provide cho 
capecttd outcome in molecular medicinr. These snohtde: 
01 tirhrrirnl amsVIeais associated with the analysis of enpres- 
tlen profilo derived from tbrues that are cornposed of cfiftcr- 
cut ceO types, (a) tact of iu o cidui es for Identifying targets 
thai lie la the pathway of cfiaease, and (us) aad for bmiafor* ' 
rustics toob for rapidly ■«tH ,i 8 the function of the putative 
targets. The latter, b of paramount importance to ths pher- 
auccutical mdustry as the idsnthratiaa of disease deregulated 
targets alone b not sufficient to start a costly drug screening 
process* 

As far as thiuc rtetefogeaeity b concerned, the recent ad- 
vent of LCM holds great promise as with tbb technique it b 
possible to iso late rprrifir populations of oeOs from a tissue 
section using curact microseopie observation (if]. However, 
even though the technique has bam used for UNA mnarytb 
it b stiQ net ready for canst pro tsomk ptojaets as the number 
of oetts that con be obtained b too emal) to srtifrale reason- ' 
a We protein profiles la terms of the number of proteins that 
can be vbuaxacd (141). 

As DNA asiaoarraye and pxotcomica geuttalc asorc chua la 
the future it will become n matter of priority to develop snspte 
and rapid strategics to vafidsta the. vast amount of informa- 
tion that wul be genera ted, particularly in tissue Mopsica. This 
we behave can be aococapliibcd in part by snaking use of 
spccBe antibodies in combination triAassninnohlsioc 
(1.12). At present, them b no technology at hand that may 
allow us to prepare antibodies at will, although phage anuV . 
body libraries |I4) show much promise. 

Finally,, we would tike to emphasise that biology In tab 
MiSennium wiO be characterized by the study of corn pica 
biological phenomena. DNA rnJcroarraya and protcomics 
are just some of the technologies of f un ct i o na l genomics, 
and only their integration may oDow us to tocttfc the great 
edmplcaity underlying biological processes. 

Adn mkdiv r uA U: We woaU like to thank H Wolf for many rtinv 
olsting dise&ssions. Tbb work was supported by grants from the 
Danfah Cancer Society, tht Danish tKotecaaologr Progr amm e, toe 
lolm and Biitbc Meyer FouDiatlea and ths Mctaanr OrTonteJogy 
Centra. 
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Amplification and overexpression of putative oncogenes 
confer growth advantages for tumor development. We 
used a functional genomic approach that integrated 
simultaneous genomic and transcript microarray, proteo- 
mics, and tissue microarray analyses to directly identify 
putative oncogenes in lung adenocarcinoma. We first 
identified 183 genes with increases in both genomic copy 
number and transcript in six lung adenocarcinoma cell 
lines. Next, we used two-dimensional polyacrylamide gel 
electrophoresis and mass spectrometry to identify 42 
proteins that were overexpressed in the cancer cells 
relative to normal cells. Comparing the 183 genes with 
the 42 proteins, we identified four genes - PRDXI, 
EEF1A2, CALR, and KCIP-I - in which elevated protein 
expression correlated with both increased DNA copy 
number and increased transcript levels (all r > 0.84, two- 
sided P< 0.05). These findings were validated by South- 
ern, Northern, and Western blotting. Specific inhibition of 
EEF1A2 and KCIP-1 expression with siRNA in the four 
cell lines tested suppressed proliferation and induced 
apoptosis. Parallel fluorescence in situ hybridization and 
immunohistochemical analyses of EEF1A2 and KCIP-l in 
tissue microarrays from patients with lung adenocarcinoma 
showed that gene amplification was associated with high 
protein expression for both genes and that protein 
overexpression was related to tumor grade, disease stage, 
Ki-67 expression, and a shorter survival of patients. The 
amplification of EEF1A2 and KCIP-I and the presence of 
overexpressed protein in tumor samples strongly suggest 
that these genes could be oncogenes and hence potential 
targets for diagnosis and therapy in lung adenocarcinoma. 
Oncogene (2006) 25, 2628-2635. doi:10.1038/sj.onc. 1209289; 
published online 1 2 December 2005 
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Introduction 

In lung adenocarcinoma, as in other types of cancer, 
gene amplification and the consequent overexpression of 
the amplified oncogene play an important role in the 
development of tumors, because their overexpression 
confers a growth advantage. The ability to identify 
putative oncogenes that are activated during tumorigen- 
esis could facilitate the choice of molecular genetic 
targets for diagnosis and therapy of the disease. This 
concept has been exemplified by HER-2, which was first 
found to be amplified in neuroblastomas and subse- 
quently shown to be associated with poor prognosis in 
breast cancer (Ross and Fletcher, 1999). Now, HER-2 
aberrations are used as a predictor of response to 
therapy, and treatment of HER-2-positive breast cancer 
with the monoclonal anti-HER-2 antibody trastuzumab 
has been shown to improve prognosis (Ross and 
Fletcher, 1999). Emerging evidence of common ampli- 
cons in lung adenocarcinomas (Luk et aL, 2001; Jiang 
et al, 2004; Tonon et al., 2005) suggests that additional 
oncogenes remain to be identified; however, conven- 
tional techniques are ineffective in pinpointing such 
oncogenes. Parallel measurement of DNA copy number 
and mRNA levels in cDNA microarrays permits 
changes in copy number to be compared with transcrip- 
tion levels on a gene-by-gene basis to generate lists of 
candidate genes within the defining amplicons (Hyman 
et al, 2002; Pollack et aL, 2002). However, use of 
transcript patterns does not allow assessment of the 
expression of protein products or identification of proto- 
oncogenes. Another approach, identifying differentially 
expressed proteins by proteomic analysis and then 
comparing the proteins present with mRNA expression 
in cDNA microarrays from the same specimens, can 
clarify the extent to which changes in transcript patterns 
reflect changes in their cognate proteins and post- 
transcriptional mechanisms (Chen et al., 2002), but this 
approach cannot be used to identify oncogenes driven 
by extensive increases of their gene copy number. 
Moreover, using individual microarrays or proteomic 
approaches alone cannot distinguish the cancer-driving 
oncogenes that directly propel tumor progression from 
the larger number of passenger genes that may be 
concurrently over-represented but are not biologically 
relevant in tumor development. 
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In this study, we used a comprehensive approach that 
integrated simultaneous comparative genomic hybridi- 
zation (CGH) and transcript microarray with proteomic 
analyses of six lung adenocarcinoma cell lines. We 
directly and specifically identified four putative onco- 
genes that could have been activated through amplifica- 
tion and consequent elevation of transcript expression. 
We used small interfering RNA (siRNA) to inhibit the 
expression of two of these four genes in the lung cancer 
cell lines, which further implicated them in oncogenesis. 
We then explored the clinical significance of these 
findings by assessing the expression of these two genes 
in tissue microarrays of human lung cancer specimens. 
Our findings underscore the power of integrated 
functional genomic analyses for identifying putative 
oncogenes in tumorigenesis; such activated genes could 
be useful as targets for diagnosis or therapy in lung 
cancer. 



Results 

Simultaneous global genomic and transcript analyses 
identify 183 genes with increases in genomic copy 
numbers and transcript expression levels 
To identify genes in which increased DNA copy number 
might contribute to increased transcript in lung adeno- 
carcinomas, first we used CGH with microarrays of six 
lung adenocarcinoma cell lines. We identified 587 genes 
showing increases in DNA copy number across all six 
cell lines (Supplementary Table IS), which were 
distributed as 90 amplicons on all chromosomes except 
for chromosomes 13 and Y (Supplementary Table 2S). 
A subsequent transcript test with the identical arrays of 
the same cell lines revealed 275 genes that showed 
increased mRNA levels (Supplementary Table 3S). 
Using random permutation tests across all cancer cell 
lines, we identified 183 genes (31%) that showed 
elevated transcript levels from the 587 genes that were 
over- rep resented in the genome (Table 1), suggesting 
that elevated transcript levels of the 183 genes may 
reflect their genomic over-representation in the cancer 
cells. These findings are consistent with previous reports 
linking genomic changes with altered transcript patterns 
in breast cancer (Hyman et aL, 2002; Pollack et al^ 
2002). However, our finding that only 31% of the genes 
showing increased DNA copy numbers had cognate 
increases in transcript expression in lung adenocarcino- 
mas is different from the overall rates of 40-60% 
reported for breast cancer (Hyman et al. y 2002; Pollack 
et al., 2002). This discordance may reflect methodologic 
differences between studies or biological differences 
between breast cancer and lung adenocarcinoma. 

Proteomic analyses identify four genes for which protein 
abundance was associated with increases in the cognate 
gene and transcript levels 

Analysis of transcript patterns is insufficient for under- 
standing the expression of protein products and the 
effect of genomic over-representation on the expression 
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of their cognate proteins. To extend these findings 
beyond genomic over-representation to expression of 
the protein products of those genes, we next assessed 
protein expression in the same cell lines by two- 
dimensional polyacrylamide gel electrophoresis (PAGE) 
and found that 42 different proteins, representing 42 
individual genes, were significantly increased in the 
cancer cell lines (Table 2; Supplementary Figures IS and 
2S). Some of these proteins were identified as having 
multiple isoforms, and all individual isoforms exhibited 
increases in expression ranging from 4.6 to 12.8 times 
their expression in normal lung tissue cells. In compar- 
ing protein level of the 42 genes with changes in their 
cognate genomic and mRNA expression from the global 
microarray analyses, we found that four (9,5%) of those 
42 genes - PRDXf EEF1A2, CALR, and KCIP-I - 
showed statistically significant correlations between 
elevated protein expression and increases in both copy 
number and mRNA expression (all r>0.84; /><0.05) 
(Table 2) in the cancer cell lines. These findings imply 
that the abundance of these four proteins is attributable 
to the amplification and consequent elevated transcrip- 
tion of their cognate genes. 



Validation of copy number, transcript, and protein 
expression of PRDX1, EEFJA2, CALR, and KC1P-1 
in lung cancer cell lines 

To confirm our findings from the high-throughput 
analyses, we next used Southern, Northern, and Western 
blotting to assess DNA, RNA, and protein levels for the 
four genes identified in the six cell lines. For compar- 
ison, we arbitrarily chose one gene, NFKB1, in which an 
increase in protein level did not correlate with genetic 
changes. Overall, we found excellent concordance 
between the CGH microarray and Southern blotting 
analyses, transcript array and Northern blotting ana- 
lyses, and proteomic and Western blotting analyses for 
all five genes (Figure 1). For example, KCIP-1 showed 
fivefold amplification in five of the six cancer cell lines, 
whereas NFKB1 showed no such increase in any of the 
cell lines. As for transcript expression, Northern blotting 
of EEF1A2 showed high expression in five of the six 
cancer cell lines; again, levels of NFKBl transcript were 
not increased in any cancer cell line as compared with 
normal bronchial epithelial cells. The results of Western 
blotting were also consistent with the results of the 
proteomic experiments; for example, five of the cancer 
cell lines exhibited strong protein bands for PRDXI as 
compared with normal cells. These findings provide 
strong support for the validity of the results derived 
from the high-throughput techniques in this study. 

These parallel analyses also revealed close correla- 
tions in the extent of changes in gene copies, transcript, 
and protein of each of the four genes in the cancer cell 
lines. For example, in the five cancer ceil lines that 
showed at least fourfold increases in EEF1A2 copy 
number, expression of transcript and protein was also 
increased by at least a factor of four as well (relative to 
their expression in normal cells) (Supplementary Figure 
3S). The protein abundance of the four genes showing 
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Table 1 List of 183 genes with statistically significant correlation Table 1 {continued) 

(0.05) between genomic copy number and transcript level ______ 

— : Gene symbol Chro. Distance from p arm of each a 

Gene symbol Chro. Distance from p arm of each a chromosome (Mb) 
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Gene symbol 
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corresponding increases in both DNA copy number and 
mRNA provides further evidence that these could be 
oncogenes, the activation of which is reflected by 
genomic amplification and consequent increases in 
transcript level in lung adenocarcinoma cell lines. 

Specific inhibition of EEF1A2 and KCIP-J expression by 
si ft N As led to decreased cell proliferation and induction of 
apoptosis 

To further prove the oncogenic function of the identified 
genes in lung tumorigenesis, we used siRNAs to inhibit 
the endogenous expression of EEF1A2 and KCIP-1 
protein in four lung cancer cell lines (HI 563, H229, 
H522, and SK-LU). Transfection of the cancer cells with 
specific siRNAs reduced the level of EEF1A2 and 
KCIP-1 protein by 70-90% 48 h after transfection 



(Supplementary Figure 4S). In contrast, EEF1A2 and 
KCIP-1 protein levels remained unchanged in mock- 
treated control cells and in cells transfected with a 
scrambled siRNA sequence. At 48 h after siRNA 
transfection, the percentage of proliferation of the 
transfected cancer cells was reduced to 15-30% as 
compared with 91-100% of cell proliferation of the 
same cell lines treated with PBS or scrambled siRNA 
(Supplementary Figure 5S). Apoptosis of siRNA- 
transfected cells was 27-34%, whereas only 4% of the 
same cell lines treated with PBS or scrambled siRNA 
showed apoptosis. These results strongly support an 
oncogenic role for the identified genes in lung cancer and 
confirm their potential usefulness as therapeutic targets 
for the disease. 



Amplification and protein expression of KCfP-l and 
EEF1A2 in lung tissue 

To further validate these findings and to assess the 
possible clinical significance of the four potential 
putative oncogenes identified from the cell lines, we first 
applied fluorescence in situ hybridization and immuno- 
histochemical analysis, in parallel, to commercially 
available human lung tissue microarrays (Ambion, 
Austin, TX, USA) to evaluate the status of two of these 
four genes in lung cancer tissue specimens. (Commer- 
cially available antibodies to PRDX1 or CALR were 
not suitable for use in immunohistochemical analysis 
when this report was written.) Overexpression of KCIP- 
1 and EEF1A2 protein in the tumors was concordant 
with amplification of the corresponding genes 
(/> = 0.0003 for KCIP-I and / > = 0.0011 for EEF1A2). 
For example, 16 (35%) of the 46 lung adenocarcinomas 
in the microarray showed amplification of KCfP-l, and 
strong cytoplasmic staining for KCIP-1 protein was seen 
in 18 tumors (39%) (Figure 2). We next examined 
whether overexpression of these genes was associated 
with increased cell proliferation by analysing Ki-67 
expression in contiguous sections of the tissue micro- 
arrays. Positive Ki-67 expression was found to correlate 
with positive expression of both KCIP-1 (P = 0.02) and 
EEF1A2 (P = 0.0I). To extend these findings, we then 
studied 1 1 tissue microarray blocks comprising normal 
and tumor tissue specimens from 113 patients with 
pathologic stage ! non-small-cell lung cancer who had 
undergone curative surgery (Wang et aL, 2005). 
Immunohistochemical analysis showed that EEF1A2 
was expressed in 32 cases (28%) and KCIP-1 in 29 cases 
(26%). Univariate and multivariate Cox proportional 
hazards models were used to detect possible associations 
between EEFIA2 and KCIP-1 expression and clinico- 
pathologic variables. Expression of EEF1 A2 or . KCIP-1 
was associated with short overall survival time 
(P = 0.00 1 2 for EEF1A2 and /> = 0.0026 for KCIP-1) 
(Supplementary Figure 6S). Age at diagnosis, histologic 
type of cancer, degree of tumor differentiation, and 
smoking history were not associated with survival time. 

Although only two genes were validated in the lung 
tissue microarrays (because available antibodies to 
the other two genes were not suitable for use in 
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Table 2 Proteins showing significant overexpression in cancer cell lines relative to those in normal bronchial epithelial cell lines and their 

correlation coefficients with increased DNA copy number or mRNA values 0 



Acc. HO. 


Gene ID 


Gene 


Mw/p\ 


Description 


r with geno- 
mic copy 
changes* 


r with mRNA 
changes* 


VUuo 31) 




r KDX 1 


48.4/5.4 


Peroxiredoxin 1 


0.92364 


0.91892 




ion 


Lbr 1 A2 


50.5/5.7 


Eukaryotic translation elongation factor 1 alpha 2 


0.90218 


0.89456 


rl I ly I 


81 1 


CALR 


61/5.5 


Calrcticulin 


0.84128 


0.86434 


D/ll 1 f\A 

loJ 1U4 


7534 


KCIP-1 


27/6.5 


Tyrosine 3-monooxygenase activation protein, zeta 


0.84467 


0.85499 


P07237 


5034 


P4HB 


54/6.2 


Procollagen-proline, 2-oxoglutarate 4-dioxygenase 


0.91884 


0.76786 


r\f\ a ccxc 

QU4695 


3872 


KRT17 


48.0/4.9 


Keratin 17 


0.00236 


0.86892 


r092 I 1 


2950 


GSTP1 


23.2/4.7 


Glutathione 5-transferase pi 


0.84218 


0.69456 


D 1 

r 1 /yjo 


3486 


IGFBP-3 


31.6/5.8 


Insultn-like growth-factor binding protein 3 


0.06412 


0.16434 


P2664 1 


1937 


EEFIG 


50/6.4 


Eukaryotic translation elongation factor 1 gamma 


0.00446 


0.85549 


P08727 


3880 


KRTI9 


44.1/5.2 


Keratin 19 


-0.04884 


0.86786 


DA/1 TOO 


3315 


HSPBI 


22/6.5 


Heat shock 27 kDa protein 1 


0.00364 


0.31892 


r*/\/\ ceo 

P00558 


5230 


PGK1 


44.5/4.2 


Phosphoglycerate kinase 1 


0.50402 


0.79456 


AA 1 Art C 

Q01995 


5876 


TAGLN 


22.5/4.3 


Transgclin 


-0.34128 


-0.26434 


P0863 1 


3055 


JTK9 


59.5/6.8 


Hemopoietic cell kinase 


-0.01446 


0.02549 


DAD 101 


3956 


LGALSl 


16/5.5 


Galectin-I, galactoside-binding, soluble, 1 


0.026623 


0.01 123 


t^9z /34 


8110 


DPF3 


25.8/4.8 


D4, zinc and double PHD fingers, family 3 


0.094884 


-0.03214 


P54257 


9001 


HAPI 


75.5/6.5 


Huntington-associatcd protein 1 


0.12364 


-0.08108 


DA c -701 


3875 


KRTI8 


48/5.3 


Keratin 18 


0.010218 


0.60544 


rllj /is / 


Jo JO 


KRT8 


9.2/4.4 


Keratin 8 


0.041280 


0.84566 


rWtl / .IIS 


3240 


HP 


55.2/6.2 


Haptoglobin 


0.044679 


-0.14501 


rU9769 


2268 


FGR 


59.5/5.2 


Gardner-Rasheed feline sarcoma viral oncogene homolog 


0.031264 


-0.13789 




4790 


NFKBI 


50.4/6.3 


Nuclear factor of kappa light gene enhancer in B-cetls 1 


0.04467 


-0.14501 


DTOfil A 


6273 


M00A2 


10.9/4.6 


S100 calcium-binding protein A2 


0.87964 


0.243214 


^ 1 j IUj 


I /U9 


ZB 1 Bl / 


87.9/5.3 


Zinc-finger and BTB domain containing 17 


-0.17636 


0.048108 


yuuv ts / 


4193 


MDM2 


75.2/4.8 


Transformed 3T3 cell double minute 2 


-0.19782 


•-0.50544 


P27816 


4134 


MAP4 


1 1 1 '5.4 


Microtubule-associated protein 4 


0.25872 


-0.05356 


P52732 


3832 


KIF1I 


119.2/6.2 


Kincsin family member 1 1 


-0.25778 


-0.53444 


P25205 


4172 


MCM3 


. 90.9/5.5 


Minichromosome maintenance deficient 3 


0.25644 


0.053666 


P0863 1 


3055 


HCK 


59.5/5.7 


Hemopoietic cell kinase 


0.65533 


0.054501 




43 16 


MMP7 


..22.6/5.8 


Matrix mctalloprotcinasc 7 


0.234987 


0.876820 


P30305 


994 


CDC25B 


64.9/4.5 


Cell division cycle 25B 


0.045116 


0.283214 


P50290 


998 


CDC42 


21. 3/6.1 


Cell division cycle 42 (GTP-binding protein, 25 kDa) 


-0.47636 


0.088108 


P61586 


387 


RHOA 


19.8/6.9 


Ras homolog gene family, member A 


-0.49782 


-0.00544 


P63000 


5879 


RACI 


21.5/6.8 


Ras-related C3 botulinum toxin substrate 1 


-0.05583 


-0.03566 


P07437 


203068 


TUBB 


49.6/6.5 


Tubulin, beta polypeptide 


0.255533 


0.145010 


P24864 


898 


CCNE1 


47.1/4.3 


Cyclin El 


- -0.65116 


0.232149 


P04141 


1437 


CSF2 


16.9/6.3 


Colony stimulating factor 2 (granulocyte-macrophage) 


-0.64636 


-1.28108 


P28072 


5694 


PSMB6 


25.3/5.2 


Proteasome (prosome, macropain) subunit, beta type, 6 


-0.69782 


-1.30544 


P00352 


216 


ALD- 
HIA1 


54.7/4.3 


Aldehyde dehydrogenase 1 family, member Al 


-0.75872 


0.03356 


Q03013 


2948 


GTM4 


25.3/5.0 


Glutathione 5-transferase M4 


. -0.78533 


0.134501 


P63241 


1984 


EIF5A 


10/4.4 


Eukaryotic translation initiation factor 5A 


-0.97893 


-1.44321 


Q01469 


2171 


EFABP 


18.0/4.2 


Fatty acid-binding protein 5 


0.25684 


-0.36432 



'Only the gene showing statistically significant increased protein expression with increases in both genomic copy number and transcript 
simultaneously will be considered as potential putative oncogene in lung adenocarcinoma cells. V, Spearman correlation coefficients between 
proteins and genomic or mRNA values are based on all six cancer cell lines; bold indicates /*<0.05, if r> 0.84000. Mw, molecular weight; p/, 
isoelectric point. 



immunohistochemical analysis), these findings are con- 
sistent with those from our cell lines, demonstrating 
again that genomic amplification and consequent 
increases in amounts of transcript may be, at least in 
part, driving the abundance of proteins in these lung 
tumors. The association between expression of these 
genes and that of Ki-67, a known indicator of poor 
prognosis in lung cancer (Martin et al., 2004), suggests 
that activation of these genes may be an indicator of 
tumor aggressiveness. These results also suggest that 
expression of EEF1A2 and KCIP-1 proteins in stage I 
non-small-cell lung cancer may be useful as a marker for 
distinguishing patients with relatively poor prognosis 
from those who might benefit from adjuvant treatment. 



Discussion 

Our current study illustrates the power of integrated 
functional genomic analyses for identifying putative 
oncogenes and for evaluating their potential clinical 
significance. Among the four identified oncogenes, three 
genes (PRDX1, CALR, and KCIP-1) have been im- 
plicated in lung tumorigenesis. PRDX1 is an antioxidant 
protein involved in regulating cell proliferation, differ- 
entiation, and apoptosis. Kim et al. (2003) found 
PRDX1 expression to be elevated in both, lung cancer 
and adjacent normal lung tissue, suggesting that 
activation of PRDX1 may enhance proliferation in lung 
cancer. CALR has a major role in Ca 2+ binding and the 
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CALR 



KCIP-1 



NFKB1 



Figure 1 Confirmation by Southern, Northern, and Western blot 
analyses of increased DNA copies, transcript levels, and protein 
levels in the four genes identified in high-throughput analyses. For 
comparison, wc arbitrarily chose one gene, NFKB1, in which an 
increased protein level did not correlate with genetic changes. The 
blotting results are consistent with the results from the CGH array, 
transcript array, and proteomic analyses. Nor, indicates normal 
bronchial epithelial cell line. All the experiments were repeated at 
least three limes with each cell line. Means of normalized to 0-actin 
signal intensities on Southern, Northern, and Western blots, along 
with 95% confidence intervals, were calculated (0-actin signals are 
not shown in the figure; two different normal bronchial epithelial 
cell lines were used in the confirmation and only one normal cell 
line is shown in the figure). 



transcriptional regulation of other genes and was 
recently found to be overexpressed in 73% of 40 lung 
adenocarcinomas (Oates and Edwards, 2000). KCIP-1 
belongs to the 14-3-3 family, which participates via the 
MAPK and Wnt signaling pathways in the regulation of 
many cellular processes including cell proliferation and 
differentiation as well as tumorigenesis (Thomas et al., 
2005). KCIP-1 was recently found to be expressed in all 
12 lung tumors tested in a single-institution study (Qi 
et al., 2005). Interestingly, EEFIA2 was originally 
considered a putative oncogene in ovarian cancer on 
the basis of its being amplified in 25% and over- 
expressed in 30% of the same set of ovarian tumors 
(Anand et al., 2002); functional analyses have estab- 
lished its oncogenic role in cellular transformation (Lee, 
2003). Our discovery that EEFIA2 may be a putative 
oncogene in lung adenocarcinoma demonstrates the 
power of our functional genomic strategy for rapidly 
identifying potential oncogenes. 

Although the main focus of this study was to 
specifically identify putative oncogenes, it should be 



noted that 90.7% of the genes showing high protein 
expression did not show corresponding increases 
in both DNA copy number and transcript, a finding 
consistent with that of others that transcriptional, 
translational, and post-translational regulatory mecha- 
nisms can greatly influence the abundance of protein 
in lung tumorigenesis (Clien et al, 2002). For example, 
NFKBl is a critical arbiter of immune responses, 
cell survival, and transformation and is often activated 
in several types of tumors (Chen et al. % 2002). De- 
regulation of NFKBl is thought to be modulated 
through phosphorylation of Ser337 by protein kinase 
A (Chen et «/., 2002). In our study, 68.8% of the 
genes showing over-representation in the genome 
did not show elevated transcript levels, implying 
that at least some of these genes are 'passenger' genes 
that are concurrently amplified because of their 
location with respect to amplicons but lack bio- 
logical relevance in terms of the development of lung 
adenocarcinoma. 

Although the potential oncogenes we identified here 
are likely to be important, certainly other oncogenes 
could be involved in the development of lung adeno- 
carcinoma. The oligo microarray we used consists of 
22000 probes, which represent only about 60% of the 
human genome. Moreover, each probe was designed for 
the 3' region of expressed sequence tags of the selected 
genes. Also, our results were initially derived from 
cancer cell lines, although the findings were later 
confirmed in human tissue samples. Our ongoing study 
using microarrays with information on more genes 
and the development of high-resolution proteomic 
analyses for use with larger numbers of specimens will 
allow more comprehensive analyses of the molecular 
consequences of gene amplifications. Such expanded 
analyses will very likely lead to the identification of 
additional oncogenes. 

Some of the results of our current study were 
comparable to those of other studies of lung cancer. 
For example, genomic copy number and protein levels 
of KCIP-I were previously found to be amplified and 
overexpressed in primary lung cancers by cDNA clone- 
based CGH array analysis (Jiang et al., 2004) and 
proteomic analysis (Chen et al., 2002), respectively. Our 
functional genomic approach, which integrates simulta- 
neous CGH, transcript microarrys, proteomic analyses, 
and siRNA, allows us not only to quickly identify 
potential oncogenes but also to explore their significance 
as diagnostic and therapeutic targets in tumor progres- 
sion - more than could be achieved by any technique 
alone. 

Genes identified in this way may serve as promising 
targets for diagnosis and therapy in lung adenocarci- 
noma. Further research on the clinical implications of 
such genes is needed; experiments now underway in our 
laboratory include overexpression of the genes in 
normal cells, disruption of the function of these genes 
in cancer cells, and investigation of how interactions 
among these genes (or interactions with other known 
oncogenes) may mediate the expression of the trans- 
formed phenotype. 
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Figure 2 EEF1A2 amplification is associated with high EEFI A2 protein expression in lung adenocarcinomas, (a) Cells from a lung 
adenocarcinoma sample in which EEF1A2 is amplified show more green signals {EEFIA2) than red signals (chromosome 20 
centromeric probe) (original magnification, x 400). (b) Immunohistochemical staining of cells from the same tissue sample as in panel 
a shows strong EEFI A2 staining in the cytoplasm, (c) A lung adenocarcinoma sample with two copies of EEFI A 2 and chromosome 20 
centromeric probe, indicating no EEF1A2 amplification (original magnification, x 400). (d) Immunohistochemical staining of cells 
from the same tissue sample as in panel c shows negative staining for EEF1A2. 



Materials and methods 

Cell lines 

Six human lung adenocarcinoma cell lines (H23, H229, H 1 792, 
SK-LU-1, H522, and H1563) were obtained from the 
American Type Culture Collection (Manassas, VA, USA). 
Two normal bronchial epithelial cell lines were obtained from 
Clontech (Palo Alto, CA, USA). Genomic DNA, mRNA, and 
protein were derived from a single harvest of these cells. 

DNA and RNA profiles by microarray analysis 
Genomic DNA labeling and hybridization were performed as 
described previously (Barrett et al., 2004) with Agilent's 
Human 1A Oligo Microarray (V2) (Agilent Technologies, 
Palo Alto, CA, USA), which contains 22 000 unique 60-mer 
oligos. Details of the protocol for analysing transcripts are 
available at http://www.chem.agilent.com. Map positions for 
arrayed genes were assigned by identifying the DNA sequence 
represented in the UniGene cluster and matching it with the 
Golden Path genome assembly (http://genome.ucsc.edu/; Mat 
7, 2004 Freeze). Microarray images of DNA copy number and 
expression were analysed by using AgilentCGH Analytics and 
Feature Extraction software. DNA copy number profiles that 
deviated significantly from background signal ratios (measured 
from normal control cell hybridization, as described elsewhere; 
Barrett et al., 2004) were interpreted as evidence of true 
differences in DNA copy number. The criteria for defining 
genomic over-representation and amplicons are described 
elsewhere (Hyman et al., 2002); details are given in the 



Supplementary Information. An increase in mRNA level was 
defined as a twofold increase in signal ratio relative to that of 
the control (log 2 > 1). 

Quantitative two-dimensional PAGE and mass spectrometry 
Analysis of proteins by two-dimensional PAGE and their 
identification by mass spectrometry were performed as 
previously described (Shen et aL, 2004). Briefly, protein pellets 
were solubilized in rehydration buffer, after which the first- 
dimension isoelectric focusing was carried out with a Protean 
IEF Cell (Bio-Rad Laboratories) and the second-dimension 
separation was carried out with Bio-Rad's Ready Gel Precast 
Gels and the Bio-Rad Criterion Cell apparatus. Protein spots 
were visualized by silver-based staining, and all gels were 
assessed with Bio-Rad's PDQuest 2D gel image analysis 
software. Selected spots were subjected to in-gel tryptic 
digestion and analysed on a Voyager-DE PRO matrix-assisted 
laser desorption ionization/time-of-flight mass spectrometer 
(Applied Biosystems, Foster City, CA, USA). The mass list of 
the 20 most intense monoisotopic peaks for each sample was 
entered in the MS-Fit search program (v3.2.1) (http:// 
prospector. ucsf.edu/ucsfhtml4.0/msfit. htm) and searched in 
the National Center for Biotechnology Information protein 
database. 

Southern, Northern, and Western blot analyses 
Southern, Northern, and Western blot hybridizations were 
performed according to standard protocols. cDNA clones for 
the tested genes were purchased from Invitrogen (Carlsbad, 
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CA, USA) and prepared as probes for the blot hybridizations. 
Antibodies used were obtained as follows: PRDXI, CALR, 
NFKB1, KCIP-1, and 0-actin from Santa Cruz Biotechnology 
(Santa Cruz, CA, USA); and EEF1 A2 from Upstate Biotech- 
nology (Waltham, MA, USA). 

Fluorescence in situ hybridization and immunohistochemical 
analyses of lung tissue microarrays 

Fluorescence in situ hybridizations and immunohistochemical 
analyses of KCIP-1 and EEF1 A2 were carried out as described 
elsewhere (Jiang et <//., 2002; Wang et 2005) with Lung 
Tissue Microarrays (Ambion, Austin, TX, USA) and 11 
homemade microarray blocks containing tissue samples from 
1 13 patients with pathologic stage I non-small-cell lung cancer 
(Wang et a/., 2005). DNA probes specific for KCfP-I and 
EEFIA2 were obtained by screening a Human BAC Clone 
library (Invitrogen) by polymerase chain reaction as described 
previously (Jiang et al., 2002). The antibodies used for the 
immunohistochemical analyses were the same as those used 
for the Western blotting. Cell proliferation of the lung tissues 
was assessed* with a Ki-67 monoclonal antibody from Santa 
Cruz Biotechnology. Definitions of the cutoff value for a 
positive result of each antibody are shown in Supplementary 
Information. 

siRNA transfection. cellular proliferation assay, and apoptosis 
analysis 

Transfections were carried out by using siPORT Lipid 
Transfection Agent (Ambion) with siRNAs targeting KCIP-1 
or EEF1A2 or with a scrambled siRNA duplex (siControl) 
(Dharmacon Inc., Lafayette, CO, USA), with PBS used as a 
negative control (Jiang et «/., 2002). Cells were fixed 24, 48, or 
96 h later and subjected to further tests. All siRNAs were 
prepared by using a transcription-based method with Silencer 
siRNA according to the manufacturer's instructions (Am- 
bion). Sequences of the individual siRNAs are listed in 
Supplementary Table 4S. Inhibition of cell growth by the 
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siRNAs was determined by MTT staining, and cell growth 
rate was plotted against the percentage of viable cells in the 
saline-treated controls (a value arbitrarily set at 100%) (Jiang 
et al. y 2002). Apoptosis was analysed by fluorescence cell 
cycle analysis of terminal deoxynucleotidyl transferase- 
mediated dUTP nick-end labeling with FITC-labeled dUTP 
(Boehringer Mannheim Biochemicals, Mannheim, Germany) 
(Jiang et al., 2005). 

Statistical analyses 

Relationships between gene copy number and mRNA level 
were examined as described elsewhere (Hyman et al., 2002, 
Supplementary Information). Correlations between, protein 
abundance and DNA copy number and mRNA expression of 
the corresponding genes were evaluated with the Spearman 
correlation coefficient. Fisher's exact test and * : -tests wer e 
used to analyse associations between amplification and 
expression of the candidate genes with various histopathologic 
variables of the samples in the tissue microarrays. Univariate 
and multivariate analyses were carried out with Cox's 
proportional hazards model to determine which independent 
factors might have a joint significant influence on survival. A 
P-value ^0.05 was considered statistically significant; all 
statistical tests were based on a two-sided significance level. 
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